2025-12-04T08:53:38.7031671Z Current runner version: '2.329.0' 2025-12-04T08:53:38.7034609Z Runner name: 'linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp' 2025-12-04T08:53:38.7035012Z Runner group name: 'default' 2025-12-04T08:53:38.7035468Z Machine name: 'linux' 2025-12-04T08:53:38.7036642Z ##[group]GITHUB_TOKEN Permissions 2025-12-04T08:53:38.7037685Z Contents: read 2025-12-04T08:53:38.7037960Z Metadata: read 2025-12-04T08:53:38.7038241Z ##[endgroup] 2025-12-04T08:53:38.7039264Z Secret source: Actions 2025-12-04T08:53:38.7039548Z Prepare workflow directory 2025-12-04T08:53:38.7276970Z Prepare all required actions 2025-12-04T08:53:38.7297290Z Getting action download info 2025-12-04T08:53:39.2955167Z Download action repository 'pytorch/pytorch@main' (SHA:ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T08:54:06.1233038Z Download action repository 'pytorch/test-infra@main' (SHA:39aa74d619174326f4e2fb0e216151c2f29d9ffd) 2025-12-04T08:54:07.1610140Z Download action repository 'actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T08:54:08.0363828Z Download action repository 'aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722' (SHA:ececac1a45f3b08a01d2dd070d28d111c5fe6722) 2025-12-04T08:54:08.8303333Z Getting action download info 2025-12-04T08:54:09.0312832Z Download action repository 'actions/checkout@v4' (SHA:34e114876b0b11c390a56381ad16ebd13914f8d5) 2025-12-04T08:54:09.8067680Z Getting action download info 2025-12-04T08:54:10.0341651Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2025-12-04T08:54:10.7974857Z Getting action download info 2025-12-04T08:54:11.0426769Z Uses: pytorch/pytorch/.github/workflows/_rocm-test.yml@refs/heads/main (ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32) 2025-12-04T08:54:11.0432770Z ##[group] Inputs 2025-12-04T08:54:11.0433250Z build-environment: linux-jammy-rocm-py3.10 2025-12-04T08:54:11.0443990Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T08:54:11.0454881Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:11.0455819Z sync-tag: 2025-12-04T08:54:11.0457099Z timeout-minutes: 300 2025-12-04T08:54:11.0457439Z tests-to-include: 2025-12-04T08:54:11.0457753Z dashboard-tag: 2025-12-04T08:54:11.0458487Z disable-monitor: true 2025-12-04T08:54:11.0458855Z monitor-log-interval: 5 2025-12-04T08:54:11.0459240Z monitor-data-collect-interval: 1 2025-12-04T08:54:11.0459651Z ##[endgroup] 2025-12-04T08:54:11.0460299Z Complete job name: linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T08:54:11.0851659Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2025-12-04T08:54:11.0851932Z with: 2025-12-04T08:54:11.0852029Z no-sudo: true 2025-12-04T08:54:11.0852131Z submodules: recursive 2025-12-04T08:54:11.0852232Z fetch-depth: 0 2025-12-04T08:54:11.0852371Z env: 2025-12-04T08:54:11.0852469Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:11.0852613Z ##[endgroup] 2025-12-04T08:54:11.0912016Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T08:54:11.0912383Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2025-12-04T08:54:11.0920232Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:11.0920392Z env: 2025-12-04T08:54:11.0920489Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:11.0920601Z ##[endgroup] 2025-12-04T08:54:11.1229702Z ##[group]Run actions/checkout@v4 2025-12-04T08:54:11.1230161Z with: 2025-12-04T08:54:11.1230523Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:11.1230940Z fetch-depth: 0 2025-12-04T08:54:11.1231238Z submodules: recursive 2025-12-04T08:54:11.1231564Z show-progress: false 2025-12-04T08:54:11.1231892Z repository: pytorch/pytorch 2025-12-04T08:54:11.1232444Z token: *** 2025-12-04T08:54:11.1232726Z ssh-strict: true 2025-12-04T08:54:11.1233008Z ssh-user: git 2025-12-04T08:54:11.1233317Z persist-credentials: true 2025-12-04T08:54:11.1233656Z clean: true 2025-12-04T08:54:11.1233986Z sparse-checkout-cone-mode: true 2025-12-04T08:54:11.1234354Z fetch-tags: false 2025-12-04T08:54:11.1234634Z lfs: false 2025-12-04T08:54:11.1234922Z set-safe-directory: true 2025-12-04T08:54:11.1235240Z env: 2025-12-04T08:54:11.1235509Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:11.1235831Z ##[endgroup] 2025-12-04T08:54:11.1929141Z Syncing repository: pytorch/pytorch 2025-12-04T08:54:11.1930935Z ##[group]Getting Git version info 2025-12-04T08:54:11.1931451Z Working directory is '/home/runner/_work/pytorch/pytorch' 2025-12-04T08:54:11.1932181Z [command]/usr/bin/git version 2025-12-04T08:54:11.1932612Z git version 2.52.0 2025-12-04T08:54:11.1933756Z ##[endgroup] 2025-12-04T08:54:11.1939089Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/e852a08e-539d-4477-8058-9b956e785261/.gitconfig' 2025-12-04T08:54:11.1940227Z Temporarily overriding HOME='/home/runner/_work/_temp/e852a08e-539d-4477-8058-9b956e785261' before making global git config changes 2025-12-04T08:54:11.1941512Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T08:54:11.1942354Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T08:54:11.1943527Z [command]/usr/bin/git config --local --get remote.origin.url 2025-12-04T08:54:11.1944056Z https://github.com/pytorch/pytorch 2025-12-04T08:54:11.1945199Z ##[group]Removing previously created refs, to avoid conflicts 2025-12-04T08:54:11.1945841Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-12-04T08:54:11.1946590Z refs/heads/main 2025-12-04T08:54:11.1949405Z [command]/usr/bin/git checkout --detach 2025-12-04T08:54:13.1369105Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:13.1434010Z [command]/usr/bin/git branch --delete --force main 2025-12-04T08:54:13.1563384Z Deleted branch main (was ffd9b0fb4355). 2025-12-04T08:54:13.1570818Z ##[endgroup] 2025-12-04T08:54:13.1574098Z [command]/usr/bin/git submodule status 2025-12-04T08:54:13.1858108Z 7e1e1fe3858c63c251c637ae41a20de425dde96f android/libs/fbjni (v0.1.0-12-g7e1e1fe) 2025-12-04T08:54:13.1929454Z 4dfe081cf6bcd15db339cf2680b9281b8451eeb3 third_party/FP16 (4dfe081) 2025-12-04T08:54:13.1998057Z b408327ac2a15ec3e43352421954f5b1967701d1 third_party/FXdiv (b408327) 2025-12-04T08:54:13.2100249Z c07e3a0400713d546e0dea2d5466dd22ea389c73 third_party/NNPACK (c07e3a0) 2025-12-04T08:54:13.2150106Z 3ebbc93ded7285963bff932c678fa367eb393ba6 third_party/NVTX (v3.1.0-313-g3ebbc93) 2025-12-04T08:54:13.2221239Z 1d8f600fd424278486eade7ed3e877c99f0846b1 third_party/VulkanMemoryAllocator (v2.1.0-982-g1d8f600) 2025-12-04T08:54:13.2533870Z 51a0103656eff6fc9bfd39a4597923c4b542c883 third_party/XNNPACK (remotes/origin/ds/ndk-1243-g51a0103656) 2025-12-04T08:54:13.2569004Z 01aae101b9e5e94d6c16a9514c9fb8df99c93150 third_party/aiter (v0.1.1-92-g01aae101) 2025-12-04T08:54:13.2597754Z 299e5928955cc62af9968370293b916f5130916f third_party/benchmark (v1.9.3) 2025-12-04T08:54:13.2667158Z 7fe50dc3da2069d6645d9deb8c017a876472a977 third_party/composable_kernel (rocm-6.4.3-459-g7fe50dc3d) 2025-12-04T08:54:13.2757768Z 89c932f313c6437c38f2982869beacc89c2f2246 third_party/cpp-httplib (v0.26.0) 2025-12-04T08:54:13.2860896Z f858c30bcb16f8effd5ff46996f0514539e17abc third_party/cpuinfo (f858c30) 2025-12-04T08:54:13.2901795Z 0b1577c8c83401237d601d0d0db5210506705396 third_party/cudnn_frontend (v0.5-61-g0b1577c) 2025-12-04T08:54:13.2996019Z f88806b1e31dfa579842638740216dd41fc6c588 third_party/cutlass (v4.3.1) 2025-12-04T08:54:13.3023146Z c0b988d39a9e47c794d699f29930ed4d7c7e13a4 third_party/fbgemm (v1.4.0-rc1-2-gc0b988d39) 2025-12-04T08:54:13.3089654Z 979702c87a8713a8e0a5e9fee122b90d2ef13be5 third_party/flash-attention (v2.7.4) 2025-12-04T08:54:13.3105840Z a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757 third_party/flatbuffers (v24.12.23) 2025-12-04T08:54:13.3375283Z 407c905e45ad75fc29bf0f9bb7c5c2fd3475976f third_party/fmt (12.1.0) 2025-12-04T08:54:13.3469856Z 3fb5c176c17c765a3492cd2f0321b0dab712f350 third_party/gemmlowp/gemmlowp (remotes/origin/revert-87-master-135-g3fb5c17) 2025-12-04T08:54:13.3578398Z 54cbae0d3a67fa890b4c3d9ee162b7860315e341 third_party/gloo (remotes/origin/gh/c-p-i-o/1/base-37-g54cbae0) 2025-12-04T08:54:13.3726802Z 52eb8108c5bdec04579160ae17225d66034bd723 third_party/googletest (release-1.8.0-3544-g52eb8108) 2025-12-04T08:54:13.3792420Z 719d8e6cd7f7a0e01b155657526d693acf97c2b3 third_party/ideep (pytorch-rls-v3.7.1) 2025-12-04T08:54:13.3833024Z dec1d23ca65ab069d225dfe40dea14f455170959 third_party/ittapi (v3.25.5) 2025-12-04T08:54:13.3972039Z 31f85df8fbd89c188f14ef10f1ec65379786b943 third_party/kineto (heads/main) 2025-12-04T08:54:13.4002979Z d7770c89632329a9914ef1a90289917597639cbe third_party/kleidiai (v1.15.0) 2025-12-04T08:54:13.4034920Z fbd8b99c2b828428947d70fdc046bb55609be93e third_party/mimalloc (v2.2.4) 2025-12-04T08:54:13.4058379Z 55f93686c01528224f448c19128836e7df245f72 third_party/nlohmann (v3.12.0) 2025-12-04T08:54:13.4260956Z e709452ef2bbc1d113faf678c24e6d3467696e83 third_party/onnx (v1.18.0) 2025-12-04T08:54:13.4277404Z a799f4aed9c94b765dcdaabaeab7d5e7e2310878 third_party/opentelemetry-cpp (v1.14.2) 2025-12-04T08:54:13.4307760Z 0fa0ef591e38c2758e3184c6c23e497b9f732ffa third_party/pocketfft (release_for_eigen-40-g0fa0ef5) 2025-12-04T08:54:13.4513347Z d1eca4e4b421cd2997495c4b4e65cea6be4e9b8a third_party/protobuf (v3.7.0-rc.2-1279-gd1eca4e4b) 2025-12-04T08:54:13.4559182Z 072586a71b55b7f8c584153d223e95687148a900 third_party/psimd (heads/master) 2025-12-04T08:54:13.4616952Z 4fe0e1e183925bf8cfa6aae24237e724a96479b8 third_party/pthreadpool (0.1-144-g4fe0e1e) 2025-12-04T08:54:13.4640626Z f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8 third_party/pybind11 (v3.0.1) 2025-12-04T08:54:13.4705346Z f45429b087dd7d5bc78bb40dc7cf06425c252d67 third_party/python-peachpy (remotes/origin/pre-generated) 2025-12-04T08:54:13.4757693Z 5a1d179df9cf652951b59010a2d2075372d67f68 third_party/sleef (3.8) 2025-12-04T08:54:13.4814940Z 2b4cd91092d335a697416b2a3cb398283246849d third_party/tensorpipe (heads/main) 2025-12-04T08:54:13.4831139Z ##[group]Cleaning the repository 2025-12-04T08:54:13.4838650Z [command]/usr/bin/git clean -ffdx 2025-12-04T08:54:13.4973410Z [command]/usr/bin/git reset --hard HEAD 2025-12-04T08:54:13.5692446Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:13.5747121Z ##[endgroup] 2025-12-04T08:54:13.5752021Z ##[group]Disabling automatic garbage collection 2025-12-04T08:54:13.5758228Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T08:54:13.5796688Z ##[endgroup] 2025-12-04T08:54:13.5797225Z ##[group]Setting up auth 2025-12-04T08:54:13.5807506Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T08:54:13.5838633Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T08:54:13.6106507Z Entering 'android/libs/fbjni' 2025-12-04T08:54:13.6141409Z Entering 'third_party/FP16' 2025-12-04T08:54:13.6171499Z Entering 'third_party/FXdiv' 2025-12-04T08:54:13.6208359Z Entering 'third_party/NNPACK' 2025-12-04T08:54:13.6233830Z Entering 'third_party/NVTX' 2025-12-04T08:54:13.6269231Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:13.6302131Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:13.6342477Z Entering 'third_party/aiter' 2025-12-04T08:54:13.6381439Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:13.6420722Z Entering 'third_party/benchmark' 2025-12-04T08:54:13.6454836Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:13.6500986Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:13.6548409Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:13.6595501Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:13.6626357Z Entering 'third_party/cutlass' 2025-12-04T08:54:13.6663530Z Entering 'third_party/fbgemm' 2025-12-04T08:54:13.6692888Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:13.6720872Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:13.6769184Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:13.6796355Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:13.6826965Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:13.6851828Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:13.6884417Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:13.6913349Z Entering 'third_party/flash-attention' 2025-12-04T08:54:13.6947140Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:13.6989338Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:13.7019825Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:13.7045789Z Entering 'third_party/fmt' 2025-12-04T08:54:13.7077288Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:13.7131950Z Entering 'third_party/gloo' 2025-12-04T08:54:13.7157870Z Entering 'third_party/googletest' 2025-12-04T08:54:13.7194448Z Entering 'third_party/ideep' 2025-12-04T08:54:13.7223388Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:13.7270551Z Entering 'third_party/ittapi' 2025-12-04T08:54:13.7308169Z Entering 'third_party/kineto' 2025-12-04T08:54:13.7348630Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:13.7380919Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:13.7421148Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:13.7457962Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:13.7488272Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:13.7527393Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:13.7551671Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:13.7577260Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:13.7601671Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:13.7626815Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:13.7654762Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:13.7682005Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:13.7716021Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:13.7748590Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:13.7774646Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:13.7835546Z Entering 'third_party/kleidiai' 2025-12-04T08:54:13.7876826Z Entering 'third_party/mimalloc' 2025-12-04T08:54:13.7906655Z Entering 'third_party/nlohmann' 2025-12-04T08:54:13.7945095Z Entering 'third_party/onnx' 2025-12-04T08:54:13.7982334Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:13.8022177Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:13.8080614Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:13.8116219Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:13.8150924Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:13.8182917Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:13.8211879Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:13.8256215Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:13.8293198Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:13.8338860Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:13.8372682Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:13.8443532Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:13.8509935Z Entering 'third_party/pocketfft' 2025-12-04T08:54:13.8542337Z Entering 'third_party/protobuf' 2025-12-04T08:54:13.8590094Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:13.8621879Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:13.8650106Z Entering 'third_party/psimd' 2025-12-04T08:54:13.8679245Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:13.8706780Z Entering 'third_party/pybind11' 2025-12-04T08:54:13.8734210Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:13.8777017Z Entering 'third_party/sleef' 2025-12-04T08:54:13.8816464Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:13.8859038Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:13.8905097Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:13.8929676Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:13.8971435Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:13.8996613Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:13.9043393Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T08:54:13.9067207Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T08:54:13.9302829Z Entering 'android/libs/fbjni' 2025-12-04T08:54:13.9339836Z Entering 'third_party/FP16' 2025-12-04T08:54:13.9366144Z Entering 'third_party/FXdiv' 2025-12-04T08:54:13.9395474Z Entering 'third_party/NNPACK' 2025-12-04T08:54:13.9439199Z Entering 'third_party/NVTX' 2025-12-04T08:54:13.9474002Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:13.9503489Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:13.9547430Z Entering 'third_party/aiter' 2025-12-04T08:54:13.9574993Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:13.9611170Z Entering 'third_party/benchmark' 2025-12-04T08:54:13.9637894Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:13.9673228Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:13.9699517Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:13.9732236Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:13.9781245Z Entering 'third_party/cutlass' 2025-12-04T08:54:13.9833706Z Entering 'third_party/fbgemm' 2025-12-04T08:54:13.9885467Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:13.9932505Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:13.9980185Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:14.0010011Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:14.0057451Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:14.0090657Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:14.0118226Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:14.0159352Z Entering 'third_party/flash-attention' 2025-12-04T08:54:14.0204045Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:14.0241839Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:14.0283497Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:14.0319871Z Entering 'third_party/fmt' 2025-12-04T08:54:14.0368599Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:14.0405422Z Entering 'third_party/gloo' 2025-12-04T08:54:14.0437887Z Entering 'third_party/googletest' 2025-12-04T08:54:14.0470368Z Entering 'third_party/ideep' 2025-12-04T08:54:14.0496278Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:14.0547137Z Entering 'third_party/ittapi' 2025-12-04T08:54:14.0599035Z Entering 'third_party/kineto' 2025-12-04T08:54:14.0640632Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:14.0686404Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:14.0714261Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:14.0745972Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:14.0775428Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:14.0816685Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:14.0858685Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:14.0887617Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:14.0924555Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:14.0962645Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:14.0997692Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:14.1022282Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:14.1056438Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:14.1111663Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:14.1139506Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:14.1182398Z Entering 'third_party/kleidiai' 2025-12-04T08:54:14.1217844Z Entering 'third_party/mimalloc' 2025-12-04T08:54:14.1256192Z Entering 'third_party/nlohmann' 2025-12-04T08:54:14.1297544Z Entering 'third_party/onnx' 2025-12-04T08:54:14.1353149Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:14.1390146Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:14.1429596Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:14.1464552Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:14.1494469Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:14.1528732Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:14.1569193Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:14.1597838Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:14.1627290Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:14.1662596Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:14.1702842Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:14.1740018Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:14.1780498Z Entering 'third_party/pocketfft' 2025-12-04T08:54:14.1813915Z Entering 'third_party/protobuf' 2025-12-04T08:54:14.1840703Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:14.1867321Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:14.1893911Z Entering 'third_party/psimd' 2025-12-04T08:54:14.1934021Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:14.1958035Z Entering 'third_party/pybind11' 2025-12-04T08:54:14.1995237Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:14.2039294Z Entering 'third_party/sleef' 2025-12-04T08:54:14.2082352Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:14.2119340Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:14.2152163Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:14.2186776Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:14.2215233Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:14.2249247Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:14.2307288Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.2341138Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T08:54:14.2592313Z Entering 'android/libs/fbjni' 2025-12-04T08:54:14.2604836Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:14.2615926Z Entering 'third_party/FP16' 2025-12-04T08:54:14.2633124Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:14.2645694Z Entering 'third_party/FXdiv' 2025-12-04T08:54:14.2658940Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:14.2683264Z Entering 'third_party/NNPACK' 2025-12-04T08:54:14.2703020Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:14.2725603Z Entering 'third_party/NVTX' 2025-12-04T08:54:14.2742473Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:14.2754353Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:14.2766447Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:14.2779130Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:14.2791314Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:14.2807820Z Entering 'third_party/aiter' 2025-12-04T08:54:14.2824841Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:14.2837348Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:14.2846904Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:14.2863361Z Entering 'third_party/benchmark' 2025-12-04T08:54:14.2874505Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:14.2886481Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:14.2898513Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:14.2913570Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:14.2926132Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:14.2937942Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:14.2949334Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:14.2961388Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:14.2973510Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:14.2986283Z Entering 'third_party/cutlass' 2025-12-04T08:54:14.2999138Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:14.3014948Z Entering 'third_party/fbgemm' 2025-12-04T08:54:14.3027026Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:14.3041058Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:14.3058006Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:14.3070001Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:14.3084030Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:14.3110355Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:14.3121256Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:14.3133528Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:14.3151059Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:14.3182832Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:14.3193385Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:14.3214894Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:14.3227499Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:14.3250867Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:14.3262423Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:14.3290410Z Entering 'third_party/flash-attention' 2025-12-04T08:54:14.3305437Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:14.3320031Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:14.3332805Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:14.3353525Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:14.3373815Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:14.3394904Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:14.3412773Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:14.3435828Z Entering 'third_party/fmt' 2025-12-04T08:54:14.3455311Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:14.3469001Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:14.3488395Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:14.3513722Z Entering 'third_party/gloo' 2025-12-04T08:54:14.3535235Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:14.3548409Z Entering 'third_party/googletest' 2025-12-04T08:54:14.3561431Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.3573904Z Entering 'third_party/ideep' 2025-12-04T08:54:14.3587469Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:14.3599067Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:14.3612071Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:14.3634745Z Entering 'third_party/ittapi' 2025-12-04T08:54:14.3646339Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:14.3659727Z Entering 'third_party/kineto' 2025-12-04T08:54:14.3670931Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:14.3685368Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:14.3695812Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:14.3719592Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:14.3736256Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:14.3750281Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:14.3772243Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:14.3784387Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:14.3801777Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:14.3813636Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:14.3824910Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:14.3835521Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:14.3858509Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:14.3875750Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:14.3893369Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:14.3903717Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:14.3915477Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.3924863Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:14.3935286Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:14.3957770Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:14.3979825Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:14.4001795Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:14.4014824Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:14.4024768Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:14.4052923Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:14.4066247Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:14.4077225Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:14.4102920Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:14.4119008Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:14.4130891Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:14.4141929Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.4156378Z Entering 'third_party/kleidiai' 2025-12-04T08:54:14.4174106Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:14.4194694Z Entering 'third_party/mimalloc' 2025-12-04T08:54:14.4224059Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:14.4246863Z Entering 'third_party/nlohmann' 2025-12-04T08:54:14.4259454Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:14.4283236Z Entering 'third_party/onnx' 2025-12-04T08:54:14.4297289Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:14.4315993Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:14.4328843Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:14.4342237Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:14.4355659Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:14.4367926Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:14.4378953Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:14.4401708Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:14.4413471Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.4426668Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:14.4438315Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:14.4450264Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:14.4462965Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:14.4476085Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:14.4493139Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:14.4507516Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:14.4525865Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:14.4536562Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:14.4559103Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:14.4581484Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:14.4593545Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:14.4605076Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:14.4618378Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:14.4640575Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:14.4654010Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:14.4674842Z Entering 'third_party/pocketfft' 2025-12-04T08:54:14.4689905Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:14.4700221Z Entering 'third_party/protobuf' 2025-12-04T08:54:14.4712444Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:14.4725496Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:14.4740001Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:14.4749580Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:14.4777216Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.4802604Z Entering 'third_party/psimd' 2025-12-04T08:54:14.4822649Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:14.4834832Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:14.4856786Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:14.4870742Z Entering 'third_party/pybind11' 2025-12-04T08:54:14.4895111Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:14.4908471Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:14.4921126Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:14.4942615Z Entering 'third_party/sleef' 2025-12-04T08:54:14.4969508Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:14.4994616Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:14.5009195Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:14.5029311Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:14.5054076Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:14.5066316Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:14.5078495Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:14.5089086Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:14.5101117Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:14.5112662Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:14.5132112Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:14.5148971Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:14.5167179Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:14.5211313Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5255025Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5292905Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5315149Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5335283Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5364117Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5383359Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5399598Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5429003Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5456089Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5472823Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5489447Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5510747Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5538725Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5592062Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5594151Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5596066Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5620130Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5648376Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5664826Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5682609Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5699775Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5715973Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5742651Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5758655Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5776756Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5803910Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5833082Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5850418Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5873063Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5892219Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5911076Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5930574Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5955614Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.5984961Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6008379Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6030772Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6052590Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6087073Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6106395Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6125422Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6143188Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6166010Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6183338Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6201038Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6230359Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6248036Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6269619Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6299393Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6316794Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6345342Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6363562Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6381921Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6400598Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6425408Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6444684Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6464726Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6483268Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6502533Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6521823Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6541881Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6559290Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6578010Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6612828Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6642059Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6658635Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6677018Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6697298Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6717267Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6735791Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6753602Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6771756Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6790528Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6809304Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6838989Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6857969Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6876681Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6897323Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6916862Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6937218Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6955712Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:14.6976608Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:14.7005990Z ##[endgroup] 2025-12-04T08:54:14.7006521Z ##[group]Fetching the repository 2025-12-04T08:54:14.7010047Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T08:54:15.3851690Z From https://github.com/pytorch/pytorch 2025-12-04T08:54:15.3853428Z - [deleted] (none) -> ciflow/inductor/160174 2025-12-04T08:54:15.3854048Z - [deleted] (none) -> ciflow/trunk/160174 2025-12-04T08:54:18.5586390Z * [new branch] 2.6.0.dev20241004+ -> origin/2.6.0.dev20241004+ 2025-12-04T08:54:18.5589218Z * [new branch] 2.9.1 -> origin/2.9.1 2025-12-04T08:54:18.5589974Z * [new branch] AaronWang04_addmmfusion_perftest -> origin/AaronWang04_addmmfusion_perftest 2025-12-04T08:54:18.5590729Z * [new branch] Flamefire-patch-1 -> origin/Flamefire-patch-1 2025-12-04T08:54:18.5591427Z * [new branch] HDCharles-2.6.0-release-notes -> origin/HDCharles-2.6.0-release-notes 2025-12-04T08:54:18.5592106Z * [new branch] HOPrintFunc -> origin/HOPrintFunc 2025-12-04T08:54:18.5592703Z * [new branch] IvanKobzarev/stack/1 -> origin/IvanKobzarev/stack/1 2025-12-04T08:54:18.5593306Z * [new branch] NicoshevSVE128 -> origin/NicoshevSVE128 2025-12-04T08:54:18.5593920Z * [new branch] PR-AOTInductorNoneBug -> origin/PR-AOTInductorNoneBug 2025-12-04T08:54:18.5594600Z * [new branch] PR-AOTInductorNoneBugFix -> origin/PR-AOTInductorNoneBugFix 2025-12-04T08:54:18.5595261Z * [new branch] PR-FixConfigsIssue -> origin/PR-FixConfigsIssue 2025-12-04T08:54:18.5596330Z * [new branch] PR-NoneBugFix-viable -> origin/PR-NoneBugFix-viable 2025-12-04T08:54:18.5596952Z * [new branch] PR-ResetToZero -> origin/PR-ResetToZero 2025-12-04T08:54:18.5597574Z * [new branch] Update-Flash-Packaging -> origin/Update-Flash-Packaging 2025-12-04T08:54:18.5598262Z * [new branch] VLA_exp -> origin/VLA_exp 2025-12-04T08:54:18.5598818Z * [new branch] activation_bench -> origin/activation_bench 2025-12-04T08:54:18.5599395Z * [new branch] addmm-heuristic -> origin/addmm-heuristic 2025-12-04T08:54:18.5599978Z * [new branch] adi/onednn_aarch64 -> origin/adi/onednn_aarch64 2025-12-04T08:54:18.5600545Z * [new branch] adi/test -> origin/adi/test 2025-12-04T08:54:18.5601077Z * [new branch] adi/test_bgemm -> origin/adi/test_bgemm 2025-12-04T08:54:18.5601634Z * [new branch] adi/test_m8g -> origin/adi/test_m8g 2025-12-04T08:54:18.5602185Z * [new branch] adi/test_onednn -> origin/adi/test_onednn 2025-12-04T08:54:18.5602767Z * [new branch] adi/test_onednn_v3.9 -> origin/adi/test_onednn_v3.9 2025-12-04T08:54:18.5603383Z * [new branch] adi/test_presve_change -> origin/adi/test_presve_change 2025-12-04T08:54:18.5604154Z * [new branch] adi/test_timm -> origin/adi/test_timm 2025-12-04T08:54:18.5604750Z * [new branch] adi/testpresve_change -> origin/adi/testpresve_change 2025-12-04T08:54:18.5605384Z * [new branch] aditew01/test/vec_bf16 -> origin/aditew01/test/vec_bf16 2025-12-04T08:54:18.5606031Z * [new branch] ah-globalfeedback-hook -> origin/ah-globalfeedback-hook 2025-12-04T08:54:18.5606692Z * [new branch] albanD-patch-1 -> origin/albanD-patch-1 2025-12-04T08:54:18.5607348Z * [new branch] also-surround-shimh -> origin/also-surround-shimh 2025-12-04T08:54:18.5607977Z * [new branch] angelayi/aot_compile -> origin/angelayi/aot_compile 2025-12-04T08:54:18.5608787Z * [new branch] angelayi/aoti_additional_files -> origin/angelayi/aoti_additional_files 2025-12-04T08:54:18.5609480Z * [new branch] angelayi/benchmark -> origin/angelayi/benchmark 2025-12-04T08:54:18.5610229Z * [new branch] angelayi/change_pytree_serialization -> origin/angelayi/change_pytree_serialization 2025-12-04T08:54:18.5610979Z * [new branch] angelayi/cpp_loader -> origin/angelayi/cpp_loader 2025-12-04T08:54:18.5611626Z * [new branch] angelayi/inductor_const -> origin/angelayi/inductor_const 2025-12-04T08:54:18.5612232Z * [new branch] angelayi/lstm -> origin/angelayi/lstm 2025-12-04T08:54:18.5612812Z * [new branch] angelayi/no_so_weight -> origin/angelayi/no_so_weight 2025-12-04T08:54:18.5613429Z * [new branch] angelayi/scan_layers -> origin/angelayi/scan_layers 2025-12-04T08:54:18.5614026Z * [new branch] angelayi/side_eff -> origin/angelayi/side_eff 2025-12-04T08:54:18.5614611Z * [new branch] angelayi/state_dict -> origin/angelayi/state_dict 2025-12-04T08:54:18.5615228Z * [new branch] angelayi/symint_input -> origin/angelayi/symint_input 2025-12-04T08:54:18.5615834Z * [new branch] angelayi/symm_mem -> origin/angelayi/symm_mem 2025-12-04T08:54:18.5616439Z * [new branch] angelayi/test_cpp -> origin/angelayi/test_cpp 2025-12-04T08:54:18.5617025Z * [new branch] angelayi/torch_size -> origin/angelayi/torch_size 2025-12-04T08:54:18.5617607Z * [new branch] annotate_assert -> origin/annotate_assert 2025-12-04T08:54:18.5618297Z * [new branch] annotate_fallback_kernel -> origin/annotate_fallback_kernel 2025-12-04T08:54:18.5619158Z * [new branch] annotation_deepcopy -> origin/annotation_deepcopy 2025-12-04T08:54:18.5619745Z * [new branch] annotation_dynamo -> origin/annotation_dynamo 2025-12-04T08:54:18.5620337Z * [new branch] aot_eager_stack_trace -> origin/aot_eager_stack_trace 2025-12-04T08:54:18.5620930Z * [new branch] aoti-cuda-alloc -> origin/aoti-cuda-alloc 2025-12-04T08:54:18.5621503Z * [new branch] aoti_const_device -> origin/aoti_const_device 2025-12-04T08:54:18.5622100Z * [new branch] aoti_fqn_name_interface -> origin/aoti_fqn_name_interface 2025-12-04T08:54:18.5622772Z * [new branch] aoti_package_weights_binary -> origin/aoti_package_weights_binary 2025-12-04T08:54:18.5623433Z * [new branch] aoti_target_windows -> origin/aoti_target_windows 2025-12-04T08:54:18.5624146Z * [new branch] arsh/feat/inductor_check_profiling -> origin/arsh/feat/inductor_check_profiling 2025-12-04T08:54:18.5624844Z * [new branch] async_tp -> origin/async_tp 2025-12-04T08:54:18.5625491Z * [new branch] atalman-inductor-perf-cu124 -> origin/atalman-inductor-perf-cu124 2025-12-04T08:54:18.5626285Z * [new branch] atalman-inductor-perf-cu124.1 -> origin/atalman-inductor-perf-cu124.1 2025-12-04T08:54:18.5627089Z * [new branch] atalman-patch-2 -> origin/atalman-patch-2 2025-12-04T08:54:18.5627665Z * [new branch] atalman-patch-3 -> origin/atalman-patch-3 2025-12-04T08:54:18.5628331Z * [new branch] atalman-patch-4 -> origin/atalman-patch-4 2025-12-04T08:54:18.5628899Z * [new branch] atalman-patch-5 -> origin/atalman-patch-5 2025-12-04T08:54:18.5629462Z * [new branch] atalman-patch-6 -> origin/atalman-patch-6 2025-12-04T08:54:18.5630031Z * [new branch] atalman-patch-7 -> origin/atalman-patch-7 2025-12-04T08:54:18.5630595Z * [new branch] atalman-patch-8 -> origin/atalman-patch-8 2025-12-04T08:54:18.5631194Z * [new branch] atalman_inductor_2.3.1 -> origin/atalman_inductor_2.3.1 2025-12-04T08:54:18.5631820Z * [new branch] atalman_inductor_2.4.0 -> origin/atalman_inductor_2.4.0 2025-12-04T08:54:18.5632448Z * [new branch] atalman_inductor_2.4.x -> origin/atalman_inductor_2.4.x 2025-12-04T08:54:18.5633132Z * [new branch] attention_benchmarking_clean -> origin/attention_benchmarking_clean 2025-12-04T08:54:18.5633838Z * [new branch] bahuang/dt_fix_scalar_add -> origin/bahuang/dt_fix_scalar_add 2025-12-04T08:54:18.5634488Z * [new branch] bahuang/fix_debug_mode -> origin/bahuang/fix_debug_mode 2025-12-04T08:54:18.5635105Z * [new branch] bahuang/fix_expand -> origin/bahuang/fix_expand 2025-12-04T08:54:18.5635933Z * [new branch] bahuang/test -> origin/bahuang/test 2025-12-04T08:54:18.5636476Z * [new branch] base/1.5 -> origin/base/1.5 2025-12-04T08:54:18.5637144Z * [new branch] batching_sdpa_efficient_attention -> origin/batching_sdpa_efficient_attention 2025-12-04T08:54:18.5637864Z * [new branch] bench_scaled_mm_ops -> origin/bench_scaled_mm_ops 2025-12-04T08:54:18.5638528Z * [new branch] benchmark-updates -> origin/benchmark-updates 2025-12-04T08:54:18.5639151Z * [new branch] benchmarking-script -> origin/benchmarking-script 2025-12-04T08:54:18.5639785Z * [new branch] bertmaher/pinbump26 -> origin/bertmaher/pinbump26 2025-12-04T08:54:18.5640378Z * [new branch] bertrand/cutlass -> origin/bertrand/cutlass 2025-12-04T08:54:18.5640980Z * [new branch] bf/bug-static-input -> origin/bf/bug-static-input 2025-12-04T08:54:18.5641660Z * [new branch] bf/cg-backend -> origin/bf/cg-backend 2025-12-04T08:54:18.5642213Z * [new branch] bf/cg-nccl-test -> origin/bf/cg-nccl-test 2025-12-04T08:54:18.5642793Z * [new branch] bf/cg-remove-check -> origin/bf/cg-remove-check 2025-12-04T08:54:18.5643423Z * [new branch] bf/clean-torchbench-hf -> origin/bf/clean-torchbench-hf 2025-12-04T08:54:18.5644051Z * [new branch] bf/combo-debug-log -> origin/bf/combo-debug-log 2025-12-04T08:54:18.5644625Z * [new branch] bf/cudagraph -> origin/bf/cudagraph 2025-12-04T08:54:18.5645361Z * [new branch] bf/cudagraph-disable-input-mutation -> origin/bf/cudagraph-disable-input-mutation 2025-12-04T08:54:18.5646503Z * [new branch] bf/cudagraph-enable-input-mutation-support-benchmark -> origin/bf/cudagraph-enable-input-mutation-support-benchmark 2025-12-04T08:54:18.5647523Z * [new branch] bf/cudagraph-partition -> origin/bf/cudagraph-partition 2025-12-04T08:54:18.5648256Z * [new branch] bf/donated-buffer-bench -> origin/bf/donated-buffer-bench 2025-12-04T08:54:18.5648894Z * [new branch] bf/dynamo-partition -> origin/bf/dynamo-partition 2025-12-04T08:54:18.5649473Z * [new branch] bf/lite -> origin/bf/lite 2025-12-04T08:54:18.5650165Z * [new branch] bf/pa-non-divisible -> origin/bf/pa-non-divisible 2025-12-04T08:54:18.5650889Z * [new branch] bf/partition-cache-free-symbols -> origin/bf/partition-cache-free-symbols 2025-12-04T08:54:18.5652005Z * [new branch] bf/partition-memory-plan -> origin/bf/partition-memory-plan 2025-12-04T08:54:18.5652684Z * [new branch] bf/partition-move-cpu -> origin/bf/partition-move-cpu 2025-12-04T08:54:18.5653375Z * [new branch] bf/partition-view-fallback -> origin/bf/partition-view-fallback 2025-12-04T08:54:18.5654088Z * [new branch] bf/remove-check-55b0c39d -> origin/bf/remove-check-55b0c39d 2025-12-04T08:54:18.5654724Z * [new branch] bf/timm-nov-26-2025 -> origin/bf/timm-nov-26-2025 2025-12-04T08:54:18.5655391Z * [new branch] bf/transformer-pin-4-57-3 -> origin/bf/transformer-pin-4-57-3 2025-12-04T08:54:18.5656117Z * [new branch] bisect_perf_hf_T5_3acc6eac492 -> origin/bisect_perf_hf_T5_3acc6eac492 2025-12-04T08:54:18.5656836Z * [new branch] bisect_perf_hf_T5_3fcf66f61fb -> origin/bisect_perf_hf_T5_3fcf66f61fb 2025-12-04T08:54:18.5657534Z * [new branch] bisect_perf_hf_T5_4009d154129 -> origin/bisect_perf_hf_T5_4009d154129 2025-12-04T08:54:18.5658316Z * [new branch] bisect_perf_hf_T5_40d0740e73d -> origin/bisect_perf_hf_T5_40d0740e73d 2025-12-04T08:54:18.5658999Z * [new branch] bisect_perf_hf_T5_5268754e -> origin/bisect_perf_hf_T5_5268754e 2025-12-04T08:54:18.5659684Z * [new branch] bisect_perf_hf_T5_7d89a8d385c -> origin/bisect_perf_hf_T5_7d89a8d385c 2025-12-04T08:54:18.5660380Z * [new branch] bisect_perf_hf_T5_b7a25c1ee7c -> origin/bisect_perf_hf_T5_b7a25c1ee7c 2025-12-04T08:54:18.5661068Z * [new branch] bisect_perf_hf_T5_c25b201583f -> origin/bisect_perf_hf_T5_c25b201583f 2025-12-04T08:54:18.5661764Z * [new branch] bisect_perf_hf_T5_c93e57efac0 -> origin/bisect_perf_hf_T5_c93e57efac0 2025-12-04T08:54:18.5662463Z * [new branch] bisect_perf_hf_T5_ca9813ea149 -> origin/bisect_perf_hf_T5_ca9813ea149 2025-12-04T08:54:18.5663141Z * [new branch] bisect_perf_hf_T5_d65f194a -> origin/bisect_perf_hf_T5_d65f194a 2025-12-04T08:54:18.5663805Z * [new branch] bisect_perf_hf_T5_da94ab0b -> origin/bisect_perf_hf_T5_da94ab0b 2025-12-04T08:54:18.5664495Z * [new branch] bisect_perf_hf_T5_da94ab0b_new -> origin/bisect_perf_hf_T5_da94ab0b_new 2025-12-04T08:54:18.5665300Z * [new branch] bisect_perf_hf_T5_db4e8a1d8a8 -> origin/bisect_perf_hf_T5_db4e8a1d8a8 2025-12-04T08:54:18.5665995Z * [new branch] bisect_perf_hf_T5_e0d97e936a2 -> origin/bisect_perf_hf_T5_e0d97e936a2 2025-12-04T08:54:18.5666684Z * [new branch] bisect_perf_hf_T5_f23621ec563 -> origin/bisect_perf_hf_T5_f23621ec563 2025-12-04T08:54:18.5667348Z * [new branch] brister/fx_device_type -> origin/brister/fx_device_type 2025-12-04T08:54:18.5668049Z * [new branch] brister/test_inductor_all_fx -> origin/brister/test_inductor_all_fx 2025-12-04T08:54:18.5668949Z * [new branch] brister/tiled_reduction_no_numel_check -> origin/brister/tiled_reduction_no_numel_check 2025-12-04T08:54:18.5669672Z * [new branch] bwd-backup -> origin/bwd-backup 2025-12-04T08:54:18.5670210Z * [new branch] c57382a49 -> origin/c57382a49 2025-12-04T08:54:18.5670757Z * [new branch] ca_0431d47eaa -> origin/ca_0431d47eaa 2025-12-04T08:54:18.5671308Z * [new branch] ca_fix_0431d47eaa -> origin/ca_fix_0431d47eaa 2025-12-04T08:54:18.5671958Z * [new branch] camyllh/test_setup_hooks_push -> origin/camyllh/test_setup_hooks_push 2025-12-04T08:54:18.5672633Z * [new branch] cccclai-patch-1 -> origin/cccclai-patch-1 2025-12-04T08:54:18.5673517Z * [new branch] cherry-pick-159969-by-pytorch_bot_bot_ -> origin/cherry-pick-159969-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5674428Z * [new branch] cherry-pick-160586-by-pytorch_bot_bot_ -> origin/cherry-pick-160586-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5675319Z * [new branch] cherry-pick-162208-by-pytorch_bot_bot_ -> origin/cherry-pick-162208-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5676199Z * [new branch] cherry-pick-163169-by-pytorch_bot_bot_ -> origin/cherry-pick-163169-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5677096Z * [new branch] cherry-pick-165086-by-pytorch_bot_bot_ -> origin/cherry-pick-165086-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5677981Z * [new branch] cherry-pick-165514-by-pytorch_bot_bot_ -> origin/cherry-pick-165514-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5678940Z * [new branch] cherry-pick-165601-by-pytorch_bot_bot_ -> origin/cherry-pick-165601-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5679825Z * [new branch] cherry-pick-165667-by-pytorch_bot_bot_ -> origin/cherry-pick-165667-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5680708Z * [new branch] cherry-pick-165815-by-pytorch_bot_bot_ -> origin/cherry-pick-165815-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5681600Z * [new branch] cherry-pick-165922-by-pytorch_bot_bot_ -> origin/cherry-pick-165922-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5682478Z * [new branch] cherry-pick-166148-by-pytorch_bot_bot_ -> origin/cherry-pick-166148-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5683365Z * [new branch] cherry-pick-166181-by-pytorch_bot_bot_ -> origin/cherry-pick-166181-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5684253Z * [new branch] cherry-pick-166404-by-pytorch_bot_bot_ -> origin/cherry-pick-166404-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5685128Z * [new branch] cherry-pick-166427-by-pytorch_bot_bot_ -> origin/cherry-pick-166427-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5686014Z * [new branch] cherry-pick-166480-by-pytorch_bot_bot_ -> origin/cherry-pick-166480-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5686895Z * [new branch] cherry-pick-166570-by-pytorch_bot_bot_ -> origin/cherry-pick-166570-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5687772Z * [new branch] cherry-pick-166993-by-pytorch_bot_bot_ -> origin/cherry-pick-166993-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5688719Z * [new branch] cherry-pick-167111-by-pytorch_bot_bot_ -> origin/cherry-pick-167111-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5689688Z * [new branch] cherry-pick-167478-by-pytorch_bot_bot_ -> origin/cherry-pick-167478-by-pytorch_bot_bot_ 2025-12-04T08:54:18.5690449Z * [new branch] cherry_pick_166036_166040 -> origin/cherry_pick_166036_166040 2025-12-04T08:54:18.5691077Z * [new branch] cherry_pick_166457 -> origin/cherry_pick_166457 2025-12-04T08:54:18.5691666Z * [new branch] cherrypick_166338 -> origin/cherrypick_166338 2025-12-04T08:54:18.5692248Z * [new branch] cherrypick_166458 -> origin/cherrypick_166458 2025-12-04T08:54:18.5692822Z * [new branch] cherrypick_166586 -> origin/cherrypick_166586 2025-12-04T08:54:18.5693392Z * [new branch] cherrypick_166956 -> origin/cherrypick_166956 2025-12-04T08:54:18.5693939Z * [new branch] ci_attn -> origin/ci_attn 2025-12-04T08:54:18.5694478Z * [new branch] codex-testing -> origin/codex-testing 2025-12-04T08:54:18.5695340Z * [new branch] codex/add-check_memory_overlap-helper-functions -> origin/codex/add-check_memory_overlap-helper-functions 2025-12-04T08:54:18.5696325Z * [new branch] codex/fix-issue-121219-in-pytorch -> origin/codex/fix-issue-121219-in-pytorch 2025-12-04T08:54:18.5697464Z * [new branch] codex/investigate-segfaults-in-get_tensor_storage_id -> origin/codex/investigate-segfaults-in-get_tensor_storage_id 2025-12-04T08:54:18.5698727Z * [new branch] codex/refactor-lintrunner-config-to-use-uv-run -> origin/codex/refactor-lintrunner-config-to-use-uv-run 2025-12-04T08:54:18.5699596Z * [new branch] compatiblpy39util -> origin/compatiblpy39util 2025-12-04T08:54:18.5700176Z * [new branch] cond_hop_device -> origin/cond_hop_device 2025-12-04T08:54:18.5700733Z * [new branch] context_test -> origin/context_test 2025-12-04T08:54:18.5701500Z * [new branch] copilot/code-style-cleanup-python-pip -> origin/copilot/code-style-cleanup-python-pip 2025-12-04T08:54:18.5702289Z * [new branch] cpio/fix_new_ami_tests -> origin/cpio/fix_new_ami_tests 2025-12-04T08:54:18.5703001Z * [new branch] cpp-docs-dependency-upgrade -> origin/cpp-docs-dependency-upgrade 2025-12-04T08:54:18.5703712Z * [new branch] csl/always_produce_xml -> origin/csl/always_produce_xml 2025-12-04T08:54:18.5704361Z * [new branch] csl/build_test_more_procs -> origin/csl/build_test_more_procs 2025-12-04T08:54:18.5705039Z * [new branch] csl/build_test_more_procs2 -> origin/csl/build_test_more_procs2 2025-12-04T08:54:18.5705658Z * [new branch] csl/clean_up -> origin/csl/clean_up 2025-12-04T08:54:18.5706275Z * [new branch] csl/fix_retry_segfault_exit -> origin/csl/fix_retry_segfault_exit 2025-12-04T08:54:18.5706896Z * [new branch] csl/katex -> origin/csl/katex 2025-12-04T08:54:18.5707448Z * [new branch] csl/larger_runner -> origin/csl/larger_runner 2025-12-04T08:54:18.5708016Z * [new branch] csl/lint_testing -> origin/csl/lint_testing 2025-12-04T08:54:18.5708645Z * [new branch] csl/lint_thing -> origin/csl/lint_thing 2025-12-04T08:54:18.5709239Z * [new branch] csl/lintrunner_stuff -> origin/csl/lintrunner_stuff 2025-12-04T08:54:18.5709852Z * [new branch] csl/manually_gen_json -> origin/csl/manually_gen_json 2025-12-04T08:54:18.5710450Z * [new branch] csl/mps_sharding -> origin/csl/mps_sharding 2025-12-04T08:54:18.5711040Z * [new branch] csl/multistage_docker -> origin/csl/multistage_docker 2025-12-04T08:54:18.5711641Z * [new branch] csl/print_timing -> origin/csl/print_timing 2025-12-04T08:54:18.5712237Z * [new branch] csl/remove_experiment -> origin/csl/remove_experiment 2025-12-04T08:54:18.5712983Z * [new branch] csl/remove_maybe_unused_var -> origin/csl/remove_maybe_unused_var 2025-12-04T08:54:18.5713746Z * [new branch] csl/remove_repo_specific_autolabel -> origin/csl/remove_repo_specific_autolabel 2025-12-04T08:54:18.5714485Z * [new branch] csl/remove_run_parallel -> origin/csl/remove_run_parallel 2025-12-04T08:54:18.5715116Z * [new branch] csl/remove_unused_vars -> origin/csl/remove_unused_vars 2025-12-04T08:54:18.5715717Z * [new branch] csl/revert_open -> origin/csl/revert_open 2025-12-04T08:54:18.5716287Z * [new branch] csl/skip_build -> origin/csl/skip_build 2025-12-04T08:54:18.5716913Z * [new branch] csl/smaller_avx_amx_runenrs -> origin/csl/smaller_avx_amx_runenrs 2025-12-04T08:54:18.5717542Z * [new branch] csl/td_job_level -> origin/csl/td_job_level 2025-12-04T08:54:18.5718260Z * [new branch] csl/test_cuda_build_large_runner -> origin/csl/test_cuda_build_large_runner 2025-12-04T08:54:18.5719066Z * [new branch] csl/test_owners_autograd_dispatch_nn -> origin/csl/test_owners_autograd_dispatch_nn 2025-12-04T08:54:18.5719886Z * [new branch] csl/test_owners_higher_confidence -> origin/csl/test_owners_higher_confidence 2025-12-04T08:54:18.5720686Z * [new branch] csl/upload_json_running -> origin/csl/upload_json_running 2025-12-04T08:54:18.5721296Z * [new branch] csl/win_sccache -> origin/csl/win_sccache 2025-12-04T08:54:18.5721842Z * [new branch] csl/xml_stuff -> origin/csl/xml_stuff 2025-12-04T08:54:18.5722398Z * [new branch] cublasrelax2 -> origin/cublasrelax2 2025-12-04T08:54:18.5722944Z * [new branch] cuda_mempool -> origin/cuda_mempool 2025-12-04T08:54:18.5723525Z * [new branch] custom_lowering_dict -> origin/custom_lowering_dict 2025-12-04T08:54:18.5724177Z * [new branch] d4l3k/debug_plane_frtrace -> origin/d4l3k/debug_plane_frtrace 2025-12-04T08:54:18.5724784Z * [new branch] daxia6/2.8o3 -> origin/daxia6/2.8o3 2025-12-04T08:54:18.5725325Z * [new branch] debug-guard -> origin/debug-guard 2025-12-04T08:54:18.5725911Z * [new branch] delete-quant-docs -> origin/delete-quant-docs 2025-12-04T08:54:18.5726976Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.0 2025-12-04T08:54:18.5728526Z * [new branch] dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 -> origin/dependabot/pip/dot-ci/docker/ci_commit_pins/main/transformers-4.57.1 2025-12-04T08:54:18.5729617Z * [new branch] desertfire/test_cpp_wrapper -> origin/desertfire/test_cpp_wrapper 2025-12-04T08:54:18.5730408Z * [new branch] desertfire/triton-cpu-for-aarch64 -> origin/desertfire/triton-cpu-for-aarch64 2025-12-04T08:54:18.5731165Z * [new branch] dev/dhruva/flex_attn_opt -> origin/dev/dhruva/flex_attn_opt 2025-12-04T08:54:18.5731826Z * [new branch] dev/joona/MPSNDArrayAdd -> origin/dev/joona/MPSNDArrayAdd 2025-12-04T08:54:18.5732459Z * [new branch] dev/joona/Unranked -> origin/dev/joona/Unranked 2025-12-04T08:54:18.5733038Z * [new branch] dev/joona/cat -> origin/dev/joona/cat 2025-12-04T08:54:18.5733631Z * [new branch] dev/joona/embeddingbag -> origin/dev/joona/embeddingbag 2025-12-04T08:54:18.5734299Z * [new branch] dev/joona/fix_sdpa_memtest -> origin/dev/joona/fix_sdpa_memtest 2025-12-04T08:54:18.5735004Z * [new branch] dev/joona/getTensorsString -> origin/dev/joona/getTensorsString 2025-12-04T08:54:18.5735725Z * [new branch] dev/joona/mps_linear_macos14 -> origin/dev/joona/mps_linear_macos14 2025-12-04T08:54:18.5736505Z * [new branch] dev/joona/scalar_clamp -> origin/dev/joona/scalar_clamp 2025-12-04T08:54:18.5737105Z * [new branch] dev/joona/sdpa -> origin/dev/joona/sdpa 2025-12-04T08:54:18.5737690Z * [new branch] dev/joona/sdpa_api -> origin/dev/joona/sdpa_api 2025-12-04T08:54:18.5738343Z * [new branch] dev/joona/type_inf -> origin/dev/joona/type_inf 2025-12-04T08:54:18.5738982Z * [new branch] dev/joona/ulpAssertClose -> origin/dev/joona/ulpAssertClose 2025-12-04T08:54:18.5739612Z * [new branch] dev/joona/upsize3d -> origin/dev/joona/upsize3d 2025-12-04T08:54:18.5740182Z * [new branch] disp_counter -> origin/disp_counter 2025-12-04T08:54:18.5740766Z * [new branch] divyanshk-patch-1 -> origin/divyanshk-patch-1 2025-12-04T08:54:18.5741330Z * [new branch] docs -> origin/docs 2025-12-04T08:54:18.5741861Z * [new branch] documentation -> origin/documentation 2025-12-04T08:54:18.5742454Z * [new branch] eager_model_benchmarks -> origin/eager_model_benchmarks 2025-12-04T08:54:18.5743130Z * [new branch] embg/test_inductor_ci_control -> origin/embg/test_inductor_ci_control 2025-12-04T08:54:18.5743949Z * [new branch] embg/triton_l2_prefetch_128B -> origin/embg/triton_l2_prefetch_128B 2025-12-04T08:54:18.5744664Z * [new branch] embg/triton_l2_prefetch_256B -> origin/embg/triton_l2_prefetch_256B 2025-12-04T08:54:18.5745287Z * [new branch] eqy-patch-1 -> origin/eqy-patch-1 2025-12-04T08:54:18.5745837Z * [new branch] eqy-patch-2 -> origin/eqy-patch-2 2025-12-04T08:54:18.5746375Z * [new branch] eqy-patch-3 -> origin/eqy-patch-3 2025-12-04T08:54:18.5746902Z * [new branch] eqy-patch-4 -> origin/eqy-patch-4 2025-12-04T08:54:18.5747439Z * [new branch] eqy-patch-5 -> origin/eqy-patch-5 2025-12-04T08:54:18.5747968Z * [new branch] eqy-patch-6 -> origin/eqy-patch-6 2025-12-04T08:54:18.5748597Z * [new branch] exclamaforte/amd-ma -> origin/exclamaforte/amd-ma 2025-12-04T08:54:18.5749372Z * [new branch] exclamaforte/combo-kernels-perf-run -> origin/exclamaforte/combo-kernels-perf-run 2025-12-04T08:54:18.5750209Z * [new branch] exclamaforte/do_bench_refactor -> origin/exclamaforte/do_bench_refactor 2025-12-04T08:54:18.5751026Z * [new branch] exclamaforte/enable-mem-dep-fusion -> origin/exclamaforte/enable-mem-dep-fusion 2025-12-04T08:54:18.5751987Z * [new branch] exclamaforte/fix-exhaustive-autotuning -> origin/exclamaforte/fix-exhaustive-autotuning 2025-12-04T08:54:18.5752931Z * [new branch] exclamaforte/fix-trace-parsing-fx-svg -> origin/exclamaforte/fix-trace-parsing-fx-svg 2025-12-04T08:54:18.5753922Z * [new branch] exclamaforte/force-pointwise-cat-perf-run -> origin/exclamaforte/force-pointwise-cat-perf-run 2025-12-04T08:54:18.5754783Z * [new branch] exclamaforte/fusion-data -> origin/exclamaforte/fusion-data 2025-12-04T08:54:18.5755531Z * [new branch] exclamaforte/gemm-benchmark-run -> origin/exclamaforte/gemm-benchmark-run 2025-12-04T08:54:18.5756328Z * [new branch] exclamaforte/gemm-export-model -> origin/exclamaforte/gemm-export-model 2025-12-04T08:54:18.5757053Z * [new branch] exclamaforte/gemm-model -> origin/exclamaforte/gemm-model 2025-12-04T08:54:18.5757915Z * [new branch] exclamaforte/gemm-model-all-data-collection -> origin/exclamaforte/gemm-model-all-data-collection 2025-12-04T08:54:18.5758836Z * [new branch] exclamaforte/gemm-to-amd -> origin/exclamaforte/gemm-to-amd 2025-12-04T08:54:18.5759655Z * [new branch] exclamaforte/just-gemm-model -> origin/exclamaforte/just-gemm-model 2025-12-04T08:54:18.5760524Z * [new branch] exclamaforte/just-gemm-model-no-refactor -> origin/exclamaforte/just-gemm-model-no-refactor 2025-12-04T08:54:18.5761411Z * [new branch] exclamaforte/profile-diff-algo -> origin/exclamaforte/profile-diff-algo 2025-12-04T08:54:18.5762272Z * [new branch] exclamaforte/profiler-visualization -> origin/exclamaforte/profiler-visualization 2025-12-04T08:54:18.5763135Z * [new branch] exclamaforte/test_cpp_wrapper_mode -> origin/exclamaforte/test_cpp_wrapper_mode 2025-12-04T08:54:18.5763997Z * [new branch] exclamaforte/update-autotune-configs -> origin/exclamaforte/update-autotune-configs 2025-12-04T08:54:18.5764930Z * [new branch] exclamaforte/update-autotune-configs-2 -> origin/exclamaforte/update-autotune-configs-2 2025-12-04T08:54:18.5765667Z * [new branch] exec -> origin/exec 2025-12-04T08:54:18.5766233Z * [new branch] experimental-mosaic -> origin/experimental-mosaic 2025-12-04T08:54:18.5766845Z * [new branch] export-D61047529 -> origin/export-D61047529 2025-12-04T08:54:18.5767416Z * [new branch] export-D71412006 -> origin/export-D71412006 2025-12-04T08:54:18.5768060Z * [new branch] export-D73042989 -> origin/export-D73042989 2025-12-04T08:54:18.5768799Z * [new branch] export-D78957093 -> origin/export-D78957093 2025-12-04T08:54:18.5769372Z * [new branch] export-D78996107 -> origin/export-D78996107 2025-12-04T08:54:18.5769926Z * [new branch] export-D80823877 -> origin/export-D80823877 2025-12-04T08:54:18.5770577Z * [new branch] export-D80958642 -> origin/export-D80958642 2025-12-04T08:54:18.5771134Z * [new branch] export-D81054193 -> origin/export-D81054193 2025-12-04T08:54:18.5771691Z * [new branch] export-D81204584 -> origin/export-D81204584 2025-12-04T08:54:18.5772248Z * [new branch] export-D81429090 -> origin/export-D81429090 2025-12-04T08:54:18.5772799Z * [new branch] export-D82250826 -> origin/export-D82250826 2025-12-04T08:54:18.5773357Z * [new branch] export-D82253817 -> origin/export-D82253817 2025-12-04T08:54:18.5773919Z * [new branch] export-D83541846 -> origin/export-D83541846 2025-12-04T08:54:18.5774471Z * [new branch] export-D83627170 -> origin/export-D83627170 2025-12-04T08:54:18.5775029Z * [new branch] export-D83766701 -> origin/export-D83766701 2025-12-04T08:54:18.5775581Z * [new branch] export-D83768878 -> origin/export-D83768878 2025-12-04T08:54:18.5776132Z * [new branch] export-D83769447 -> origin/export-D83769447 2025-12-04T08:54:18.5776694Z * [new branch] export-D84089824 -> origin/export-D84089824 2025-12-04T08:54:18.5777250Z * [new branch] export-D84213020 -> origin/export-D84213020 2025-12-04T08:54:18.5777800Z * [new branch] export-D84373821 -> origin/export-D84373821 2025-12-04T08:54:18.5778425Z * [new branch] export-D84612194 -> origin/export-D84612194 2025-12-04T08:54:18.5778986Z * [new branch] export-D84890985 -> origin/export-D84890985 2025-12-04T08:54:18.5779540Z * [new branch] export-D85122326 -> origin/export-D85122326 2025-12-04T08:54:18.5780089Z * [new branch] export-D86256198 -> origin/export-D86256198 2025-12-04T08:54:18.5780648Z * [new branch] export-D86460608 -> origin/export-D86460608 2025-12-04T08:54:18.5781197Z * [new branch] export-D86474796 -> origin/export-D86474796 2025-12-04T08:54:18.5781752Z * [new branch] export-D86712396 -> origin/export-D86712396 2025-12-04T08:54:18.5782403Z * [new branch] export-D87022129 -> origin/export-D87022129 2025-12-04T08:54:18.5782952Z * [new branch] export-D87838959 -> origin/export-D87838959 2025-12-04T08:54:18.5783505Z * [new branch] export-D88319437 -> origin/export-D88319437 2025-12-04T08:54:18.5784219Z * [new branch] exported-model-train-idempotent -> origin/exported-model-train-idempotent 2025-12-04T08:54:18.5784969Z * [new branch] ezyang-titan-october -> origin/ezyang-titan-october 2025-12-04T08:54:18.5785611Z * [new branch] ezyang-titan-october2 -> origin/ezyang-titan-october2 2025-12-04T08:54:18.5786208Z * [new branch] ezyang-war -> origin/ezyang-war 2025-12-04T08:54:18.5786851Z * [new branch] ezyang/wip-aot-descriptors -> origin/ezyang/wip-aot-descriptors 2025-12-04T08:54:18.5787494Z * [new branch] fa_u8_brgemm -> origin/fa_u8_brgemm 2025-12-04T08:54:18.5788156Z * [new branch] fadeputr/sequence_fbgemm -> origin/fadeputr/sequence_fbgemm 2025-12-04T08:54:18.5788784Z * [new branch] fastmath_baseline -> origin/fastmath_baseline 2025-12-04T08:54:18.5789532Z * [new branch] fbcode/warm -> origin/fbcode/warm 2025-12-04T08:54:18.5790148Z * [new branch] fca -> origin/fca 2025-12-04T08:54:18.5790670Z * [new branch] fca2_ca5984c -> origin/fca2_ca5984c 2025-12-04T08:54:18.5791195Z * [new branch] fca5 -> origin/fca5 2025-12-04T08:54:18.5791771Z * [new branch] feature/justknobs-cpp -> origin/feature/justknobs-cpp 2025-12-04T08:54:18.5792421Z * [new branch] feature/numa-forkserver -> origin/feature/numa-forkserver 2025-12-04T08:54:18.5793045Z * [new branch] ffast_math_baseline -> origin/ffast_math_baseline 2025-12-04T08:54:18.5793634Z * [new branch] ffast_math_target -> origin/ffast_math_target 2025-12-04T08:54:18.5794220Z * [new branch] findhao/base_commit -> origin/findhao/base_commit 2025-12-04T08:54:18.5794824Z * [new branch] findhao/base_commit1 -> origin/findhao/base_commit1 2025-12-04T08:54:18.5795439Z * [new branch] findhao/multistream2 -> origin/findhao/multistream2 2025-12-04T08:54:18.5796052Z * [new branch] findhao/multistream5 -> origin/findhao/multistream5 2025-12-04T08:54:18.5796658Z * [new branch] findhao/multistream6 -> origin/findhao/multistream6 2025-12-04T08:54:18.5797290Z * [new branch] findhao/operatorbench3 -> origin/findhao/operatorbench3 2025-12-04T08:54:18.5797941Z * [new branch] findhao/operatorbench5 -> origin/findhao/operatorbench5 2025-12-04T08:54:18.5798617Z * [new branch] findhao/tritonparse -> origin/findhao/tritonparse 2025-12-04T08:54:18.5799317Z * [new branch] fix-ck-gemm-template-format -> origin/fix-ck-gemm-template-format 2025-12-04T08:54:18.5799999Z * [new branch] fix-config-ignore -> origin/fix-config-ignore 2025-12-04T08:54:18.5800582Z * [new branch] fix-dict-guard -> origin/fix-dict-guard 2025-12-04T08:54:18.5801153Z * [new branch] fix_addmm_issue -> origin/fix_addmm_issue 2025-12-04T08:54:18.5801791Z * [new branch] fix_amd_missing_cluster_dims -> origin/fix_amd_missing_cluster_dims 2025-12-04T08:54:18.5802439Z * [new branch] fix_bench_bwd_pass -> origin/fix_bench_bwd_pass 2025-12-04T08:54:18.5803054Z * [new branch] fix_mem_profiler_config -> origin/fix_mem_profiler_config 2025-12-04T08:54:18.5803661Z * [new branch] fix_nvrtc_discovery -> origin/fix_nvrtc_discovery 2025-12-04T08:54:18.5804227Z * [new branch] fix_op_runner -> origin/fix_op_runner 2025-12-04T08:54:18.5804862Z * [new branch] fix_ubn_159469 -> origin/fix_ubn_159469 2025-12-04T08:54:18.5805414Z * [new branch] fixes-triage -> origin/fixes-triage 2025-12-04T08:54:18.5805970Z * [new branch] fixflashinfer -> origin/fixflashinfer 2025-12-04T08:54:18.5806551Z * [new branch] flash_decoding_cpu -> origin/flash_decoding_cpu 2025-12-04T08:54:18.5807133Z * [new branch] flex-flash -> origin/flex-flash 2025-12-04T08:54:18.5807771Z * [new branch] flex_attention_functorch_grad -> origin/flex_attention_functorch_grad 2025-12-04T08:54:18.5808458Z * [new branch] flex_flash -> origin/flex_flash 2025-12-04T08:54:18.5809111Z * [new branch] fmassa/fix_memeff_sharding_rule -> origin/fmassa/fix_memeff_sharding_rule 2025-12-04T08:54:18.5809913Z * [new branch] fmassa/tests_comm_compute_scheduler -> origin/fmassa/tests_comm_compute_scheduler 2025-12-04T08:54:18.5810624Z * [new branch] forkserver_fix -> origin/forkserver_fix 2025-12-04T08:54:18.5811196Z * [new branch] fsdp2_trace_rules -> origin/fsdp2_trace_rules 2025-12-04T08:54:18.5811751Z * [new branch] fx_cpp -> origin/fx_cpp 2025-12-04T08:54:18.5812363Z * [new branch] fy/fix-win -> origin/fy/fix-win 2025-12-04T08:54:18.5812911Z * [new branch] galv-patch-1 -> origin/galv-patch-1 2025-12-04T08:54:18.5813656Z * [new branch] galv/cudagraphs-conditional-nodes-4 -> origin/galv/cudagraphs-conditional-nodes-4 2025-12-04T08:54:18.5814485Z * [new branch] georgehong/cmakelists-patch -> origin/georgehong/cmakelists-patch 2025-12-04T08:54:18.5815159Z * [new branch] gh/AlnisM/1/base -> origin/gh/AlnisM/1/base 2025-12-04T08:54:18.5815742Z * [new branch] gh/AlnisM/1/head -> origin/gh/AlnisM/1/head 2025-12-04T08:54:18.5816337Z * [new branch] gh/EikanWang/67/base -> origin/gh/EikanWang/67/base 2025-12-04T08:54:18.5816949Z * [new branch] gh/EikanWang/67/head -> origin/gh/EikanWang/67/head 2025-12-04T08:54:18.5817560Z * [new branch] gh/Gasoonjia/1/base -> origin/gh/Gasoonjia/1/base 2025-12-04T08:54:18.5818222Z * [new branch] gh/Gasoonjia/1/head -> origin/gh/Gasoonjia/1/head 2025-12-04T08:54:18.5818812Z * [new branch] gh/H-Huang/131/base -> origin/gh/H-Huang/131/base 2025-12-04T08:54:18.5819398Z * [new branch] gh/H-Huang/131/head -> origin/gh/H-Huang/131/head 2025-12-04T08:54:18.5819974Z * [new branch] gh/H-Huang/131/orig -> origin/gh/H-Huang/131/orig 2025-12-04T08:54:18.5820553Z * [new branch] gh/H-Huang/132/base -> origin/gh/H-Huang/132/base 2025-12-04T08:54:18.5821137Z * [new branch] gh/H-Huang/132/head -> origin/gh/H-Huang/132/head 2025-12-04T08:54:18.5821713Z * [new branch] gh/H-Huang/132/orig -> origin/gh/H-Huang/132/orig 2025-12-04T08:54:18.5822286Z * [new branch] gh/H-Huang/180/base -> origin/gh/H-Huang/180/base 2025-12-04T08:54:18.5822859Z * [new branch] gh/H-Huang/180/head -> origin/gh/H-Huang/180/head 2025-12-04T08:54:18.5823438Z * [new branch] gh/H-Huang/180/orig -> origin/gh/H-Huang/180/orig 2025-12-04T08:54:18.5824011Z * [new branch] gh/H-Huang/182/base -> origin/gh/H-Huang/182/base 2025-12-04T08:54:18.5824589Z * [new branch] gh/H-Huang/182/head -> origin/gh/H-Huang/182/head 2025-12-04T08:54:18.5825162Z * [new branch] gh/H-Huang/182/orig -> origin/gh/H-Huang/182/orig 2025-12-04T08:54:18.5825745Z * [new branch] gh/H-Huang/226/base -> origin/gh/H-Huang/226/base 2025-12-04T08:54:18.5826411Z * [new branch] gh/H-Huang/226/head -> origin/gh/H-Huang/226/head 2025-12-04T08:54:18.5826981Z * [new branch] gh/H-Huang/226/orig -> origin/gh/H-Huang/226/orig 2025-12-04T08:54:18.5827557Z * [new branch] gh/H-Huang/228/base -> origin/gh/H-Huang/228/base 2025-12-04T08:54:18.5828179Z * [new branch] gh/H-Huang/228/head -> origin/gh/H-Huang/228/head 2025-12-04T08:54:18.5828761Z * [new branch] gh/H-Huang/228/orig -> origin/gh/H-Huang/228/orig 2025-12-04T08:54:18.5829391Z * [new branch] gh/IvanKobzarev/150/base -> origin/gh/IvanKobzarev/150/base 2025-12-04T08:54:18.5830056Z * [new branch] gh/IvanKobzarev/150/head -> origin/gh/IvanKobzarev/150/head 2025-12-04T08:54:18.5830708Z * [new branch] gh/IvanKobzarev/150/orig -> origin/gh/IvanKobzarev/150/orig 2025-12-04T08:54:18.5831357Z * [new branch] gh/IvanKobzarev/157/base -> origin/gh/IvanKobzarev/157/base 2025-12-04T08:54:18.5832012Z * [new branch] gh/IvanKobzarev/157/head -> origin/gh/IvanKobzarev/157/head 2025-12-04T08:54:18.5832660Z * [new branch] gh/IvanKobzarev/157/orig -> origin/gh/IvanKobzarev/157/orig 2025-12-04T08:54:18.5833301Z * [new branch] gh/IvanKobzarev/159/base -> origin/gh/IvanKobzarev/159/base 2025-12-04T08:54:18.5834026Z * [new branch] gh/IvanKobzarev/159/head -> origin/gh/IvanKobzarev/159/head 2025-12-04T08:54:18.5834673Z * [new branch] gh/IvanKobzarev/159/orig -> origin/gh/IvanKobzarev/159/orig 2025-12-04T08:54:18.5835321Z * [new branch] gh/IvanKobzarev/162/base -> origin/gh/IvanKobzarev/162/base 2025-12-04T08:54:18.5835961Z * [new branch] gh/IvanKobzarev/162/head -> origin/gh/IvanKobzarev/162/head 2025-12-04T08:54:18.5836605Z * [new branch] gh/IvanKobzarev/162/orig -> origin/gh/IvanKobzarev/162/orig 2025-12-04T08:54:18.5837246Z * [new branch] gh/IvanKobzarev/163/base -> origin/gh/IvanKobzarev/163/base 2025-12-04T08:54:18.5837893Z * [new branch] gh/IvanKobzarev/163/head -> origin/gh/IvanKobzarev/163/head 2025-12-04T08:54:18.5838580Z * [new branch] gh/IvanKobzarev/163/orig -> origin/gh/IvanKobzarev/163/orig 2025-12-04T08:54:18.5839228Z * [new branch] gh/IvanKobzarev/166/base -> origin/gh/IvanKobzarev/166/base 2025-12-04T08:54:18.5839875Z * [new branch] gh/IvanKobzarev/166/head -> origin/gh/IvanKobzarev/166/head 2025-12-04T08:54:18.5840517Z * [new branch] gh/IvanKobzarev/166/orig -> origin/gh/IvanKobzarev/166/orig 2025-12-04T08:54:18.5841162Z * [new branch] gh/IvanKobzarev/167/base -> origin/gh/IvanKobzarev/167/base 2025-12-04T08:54:18.5841798Z * [new branch] gh/IvanKobzarev/167/head -> origin/gh/IvanKobzarev/167/head 2025-12-04T08:54:18.5842445Z * [new branch] gh/IvanKobzarev/167/orig -> origin/gh/IvanKobzarev/167/orig 2025-12-04T08:54:18.5843095Z * [new branch] gh/IvanKobzarev/168/base -> origin/gh/IvanKobzarev/168/base 2025-12-04T08:54:18.5843736Z * [new branch] gh/IvanKobzarev/168/head -> origin/gh/IvanKobzarev/168/head 2025-12-04T08:54:18.5844382Z * [new branch] gh/IvanKobzarev/168/orig -> origin/gh/IvanKobzarev/168/orig 2025-12-04T08:54:18.5845028Z * [new branch] gh/IvanKobzarev/169/base -> origin/gh/IvanKobzarev/169/base 2025-12-04T08:54:18.5845669Z * [new branch] gh/IvanKobzarev/169/head -> origin/gh/IvanKobzarev/169/head 2025-12-04T08:54:18.5846309Z * [new branch] gh/IvanKobzarev/169/orig -> origin/gh/IvanKobzarev/169/orig 2025-12-04T08:54:18.5846956Z * [new branch] gh/IvanKobzarev/170/base -> origin/gh/IvanKobzarev/170/base 2025-12-04T08:54:18.5847592Z * [new branch] gh/IvanKobzarev/170/head -> origin/gh/IvanKobzarev/170/head 2025-12-04T08:54:18.5848286Z * [new branch] gh/IvanKobzarev/170/orig -> origin/gh/IvanKobzarev/170/orig 2025-12-04T08:54:18.5849041Z * [new branch] gh/IvanKobzarev/171/base -> origin/gh/IvanKobzarev/171/base 2025-12-04T08:54:18.5849679Z * [new branch] gh/IvanKobzarev/171/head -> origin/gh/IvanKobzarev/171/head 2025-12-04T08:54:18.5850323Z * [new branch] gh/IvanKobzarev/171/orig -> origin/gh/IvanKobzarev/171/orig 2025-12-04T08:54:18.5850967Z * [new branch] gh/IvanKobzarev/172/base -> origin/gh/IvanKobzarev/172/base 2025-12-04T08:54:18.5851609Z * [new branch] gh/IvanKobzarev/172/head -> origin/gh/IvanKobzarev/172/head 2025-12-04T08:54:18.5852250Z * [new branch] gh/IvanKobzarev/172/orig -> origin/gh/IvanKobzarev/172/orig 2025-12-04T08:54:18.5852898Z * [new branch] gh/IvanKobzarev/173/base -> origin/gh/IvanKobzarev/173/base 2025-12-04T08:54:18.5853534Z * [new branch] gh/IvanKobzarev/173/head -> origin/gh/IvanKobzarev/173/head 2025-12-04T08:54:18.5854184Z * [new branch] gh/IvanKobzarev/173/orig -> origin/gh/IvanKobzarev/173/orig 2025-12-04T08:54:18.5854825Z * [new branch] gh/IvanKobzarev/174/base -> origin/gh/IvanKobzarev/174/base 2025-12-04T08:54:18.5855465Z * [new branch] gh/IvanKobzarev/174/head -> origin/gh/IvanKobzarev/174/head 2025-12-04T08:54:18.5856195Z * [new branch] gh/IvanKobzarev/174/orig -> origin/gh/IvanKobzarev/174/orig 2025-12-04T08:54:18.5856842Z * [new branch] gh/IvanKobzarev/175/base -> origin/gh/IvanKobzarev/175/base 2025-12-04T08:54:18.5857479Z * [new branch] gh/IvanKobzarev/175/head -> origin/gh/IvanKobzarev/175/head 2025-12-04T08:54:18.5858187Z * [new branch] gh/IvanKobzarev/175/orig -> origin/gh/IvanKobzarev/175/orig 2025-12-04T08:54:18.5858829Z * [new branch] gh/IvanKobzarev/176/base -> origin/gh/IvanKobzarev/176/base 2025-12-04T08:54:18.5859469Z * [new branch] gh/IvanKobzarev/176/head -> origin/gh/IvanKobzarev/176/head 2025-12-04T08:54:18.5860121Z * [new branch] gh/IvanKobzarev/176/orig -> origin/gh/IvanKobzarev/176/orig 2025-12-04T08:54:18.5860761Z * [new branch] gh/IvanKobzarev/177/base -> origin/gh/IvanKobzarev/177/base 2025-12-04T08:54:18.5861408Z * [new branch] gh/IvanKobzarev/177/head -> origin/gh/IvanKobzarev/177/head 2025-12-04T08:54:18.5862058Z * [new branch] gh/IvanKobzarev/177/orig -> origin/gh/IvanKobzarev/177/orig 2025-12-04T08:54:18.5862698Z * [new branch] gh/IvanKobzarev/178/base -> origin/gh/IvanKobzarev/178/base 2025-12-04T08:54:18.5863345Z * [new branch] gh/IvanKobzarev/178/head -> origin/gh/IvanKobzarev/178/head 2025-12-04T08:54:18.5863988Z * [new branch] gh/IvanKobzarev/178/orig -> origin/gh/IvanKobzarev/178/orig 2025-12-04T08:54:18.5864629Z * [new branch] gh/IvanKobzarev/179/base -> origin/gh/IvanKobzarev/179/base 2025-12-04T08:54:18.5865272Z * [new branch] gh/IvanKobzarev/179/head -> origin/gh/IvanKobzarev/179/head 2025-12-04T08:54:18.5865917Z * [new branch] gh/IvanKobzarev/179/orig -> origin/gh/IvanKobzarev/179/orig 2025-12-04T08:54:18.5866556Z * [new branch] gh/IvanKobzarev/180/base -> origin/gh/IvanKobzarev/180/base 2025-12-04T08:54:18.5867204Z * [new branch] gh/IvanKobzarev/180/head -> origin/gh/IvanKobzarev/180/head 2025-12-04T08:54:18.5867858Z * [new branch] gh/IvanKobzarev/180/orig -> origin/gh/IvanKobzarev/180/orig 2025-12-04T08:54:18.5868566Z * [new branch] gh/IvanKobzarev/181/base -> origin/gh/IvanKobzarev/181/base 2025-12-04T08:54:18.5869207Z * [new branch] gh/IvanKobzarev/181/head -> origin/gh/IvanKobzarev/181/head 2025-12-04T08:54:18.5869862Z * [new branch] gh/IvanKobzarev/181/orig -> origin/gh/IvanKobzarev/181/orig 2025-12-04T08:54:18.5870507Z * [new branch] gh/IvanKobzarev/182/base -> origin/gh/IvanKobzarev/182/base 2025-12-04T08:54:18.5871240Z * [new branch] gh/IvanKobzarev/182/head -> origin/gh/IvanKobzarev/182/head 2025-12-04T08:54:18.5871884Z * [new branch] gh/IvanKobzarev/182/orig -> origin/gh/IvanKobzarev/182/orig 2025-12-04T08:54:18.5872530Z * [new branch] gh/IvanKobzarev/183/base -> origin/gh/IvanKobzarev/183/base 2025-12-04T08:54:18.5873177Z * [new branch] gh/IvanKobzarev/183/head -> origin/gh/IvanKobzarev/183/head 2025-12-04T08:54:18.5873818Z * [new branch] gh/IvanKobzarev/183/orig -> origin/gh/IvanKobzarev/183/orig 2025-12-04T08:54:18.5874458Z * [new branch] gh/IvanKobzarev/184/base -> origin/gh/IvanKobzarev/184/base 2025-12-04T08:54:18.5875095Z * [new branch] gh/IvanKobzarev/184/head -> origin/gh/IvanKobzarev/184/head 2025-12-04T08:54:18.5875740Z * [new branch] gh/IvanKobzarev/184/orig -> origin/gh/IvanKobzarev/184/orig 2025-12-04T08:54:18.5876390Z * [new branch] gh/NikhilAPatel/1/base -> origin/gh/NikhilAPatel/1/base 2025-12-04T08:54:18.5877036Z * [new branch] gh/NikhilAPatel/1/head -> origin/gh/NikhilAPatel/1/head 2025-12-04T08:54:18.5877672Z * [new branch] gh/NikhilAPatel/2/base -> origin/gh/NikhilAPatel/2/base 2025-12-04T08:54:18.5878369Z * [new branch] gh/NikhilAPatel/2/head -> origin/gh/NikhilAPatel/2/head 2025-12-04T08:54:18.5879081Z * [new branch] gh/NikhilAPatel/4/base -> origin/gh/NikhilAPatel/4/base 2025-12-04T08:54:18.5879716Z * [new branch] gh/NikhilAPatel/4/head -> origin/gh/NikhilAPatel/4/head 2025-12-04T08:54:18.5880349Z * [new branch] gh/NikhilAPatel/5/base -> origin/gh/NikhilAPatel/5/base 2025-12-04T08:54:18.5880975Z * [new branch] gh/NikhilAPatel/5/head -> origin/gh/NikhilAPatel/5/head 2025-12-04T08:54:18.5881598Z * [new branch] gh/NikhilAPatel/5/orig -> origin/gh/NikhilAPatel/5/orig 2025-12-04T08:54:18.5882212Z * [new branch] gh/PaliC/17/base -> origin/gh/PaliC/17/base 2025-12-04T08:54:18.5882791Z * [new branch] gh/PaliC/17/head -> origin/gh/PaliC/17/head 2025-12-04T08:54:18.5883364Z * [new branch] gh/PaliC/17/orig -> origin/gh/PaliC/17/orig 2025-12-04T08:54:18.5883929Z * [new branch] gh/PaliC/18/base -> origin/gh/PaliC/18/base 2025-12-04T08:54:18.5884490Z * [new branch] gh/PaliC/18/head -> origin/gh/PaliC/18/head 2025-12-04T08:54:18.5885046Z * [new branch] gh/PaliC/18/orig -> origin/gh/PaliC/18/orig 2025-12-04T08:54:18.5885607Z * [new branch] gh/PaliC/20/base -> origin/gh/PaliC/20/base 2025-12-04T08:54:18.5886163Z * [new branch] gh/PaliC/20/head -> origin/gh/PaliC/20/head 2025-12-04T08:54:18.5886718Z * [new branch] gh/PaliC/20/orig -> origin/gh/PaliC/20/orig 2025-12-04T08:54:18.5887280Z * [new branch] gh/PaliC/21/base -> origin/gh/PaliC/21/base 2025-12-04T08:54:18.5887840Z * [new branch] gh/PaliC/21/head -> origin/gh/PaliC/21/head 2025-12-04T08:54:18.5888445Z * [new branch] gh/PaliC/21/orig -> origin/gh/PaliC/21/orig 2025-12-04T08:54:18.5889007Z * [new branch] gh/PaliC/23/base -> origin/gh/PaliC/23/base 2025-12-04T08:54:18.5889569Z * [new branch] gh/PaliC/23/head -> origin/gh/PaliC/23/head 2025-12-04T08:54:18.5890129Z * [new branch] gh/PaliC/23/orig -> origin/gh/PaliC/23/orig 2025-12-04T08:54:18.5890691Z * [new branch] gh/PaliC/24/base -> origin/gh/PaliC/24/base 2025-12-04T08:54:18.5891244Z * [new branch] gh/PaliC/24/head -> origin/gh/PaliC/24/head 2025-12-04T08:54:18.5891807Z * [new branch] gh/PaliC/24/orig -> origin/gh/PaliC/24/orig 2025-12-04T08:54:18.5892371Z * [new branch] gh/PaliC/25/head -> origin/gh/PaliC/25/head 2025-12-04T08:54:18.5893014Z * [new branch] gh/PaliC/25/next -> origin/gh/PaliC/25/next 2025-12-04T08:54:18.5893573Z * [new branch] gh/PaliC/25/orig -> origin/gh/PaliC/25/orig 2025-12-04T08:54:18.5894137Z * [new branch] gh/PaliC/26/head -> origin/gh/PaliC/26/head 2025-12-04T08:54:18.5894702Z * [new branch] gh/PaliC/26/next -> origin/gh/PaliC/26/next 2025-12-04T08:54:18.5895264Z * [new branch] gh/PaliC/26/orig -> origin/gh/PaliC/26/orig 2025-12-04T08:54:18.5895825Z * [new branch] gh/PaliC/27/next -> origin/gh/PaliC/27/next 2025-12-04T08:54:18.5896381Z * [new branch] gh/PaliC/28/head -> origin/gh/PaliC/28/head 2025-12-04T08:54:18.5896943Z * [new branch] gh/PaliC/28/next -> origin/gh/PaliC/28/next 2025-12-04T08:54:18.5897502Z * [new branch] gh/PaliC/28/orig -> origin/gh/PaliC/28/orig 2025-12-04T08:54:18.5898118Z * [new branch] gh/PaliC/29/head -> origin/gh/PaliC/29/head 2025-12-04T08:54:18.5898684Z * [new branch] gh/PaliC/29/next -> origin/gh/PaliC/29/next 2025-12-04T08:54:18.5899247Z * [new branch] gh/PaliC/29/orig -> origin/gh/PaliC/29/orig 2025-12-04T08:54:18.5899881Z * [new branch] gh/PaliC/30/head -> origin/gh/PaliC/30/head 2025-12-04T08:54:18.5900441Z * [new branch] gh/PaliC/30/next -> origin/gh/PaliC/30/next 2025-12-04T08:54:18.5900999Z * [new branch] gh/PaliC/30/orig -> origin/gh/PaliC/30/orig 2025-12-04T08:54:18.5901560Z * [new branch] gh/PaliC/31/head -> origin/gh/PaliC/31/head 2025-12-04T08:54:18.5902120Z * [new branch] gh/PaliC/31/next -> origin/gh/PaliC/31/next 2025-12-04T08:54:18.5902679Z * [new branch] gh/PaliC/31/orig -> origin/gh/PaliC/31/orig 2025-12-04T08:54:18.5903290Z * [new branch] gh/PaulZhang12/25/base -> origin/gh/PaulZhang12/25/base 2025-12-04T08:54:18.5903922Z * [new branch] gh/PaulZhang12/25/head -> origin/gh/PaulZhang12/25/head 2025-12-04T08:54:18.5904540Z * [new branch] gh/PaulZhang12/25/orig -> origin/gh/PaulZhang12/25/orig 2025-12-04T08:54:18.5905171Z * [new branch] gh/PaulZhang12/28/base -> origin/gh/PaulZhang12/28/base 2025-12-04T08:54:18.5905788Z * [new branch] gh/PaulZhang12/28/head -> origin/gh/PaulZhang12/28/head 2025-12-04T08:54:18.5906402Z * [new branch] gh/PaulZhang12/28/orig -> origin/gh/PaulZhang12/28/orig 2025-12-04T08:54:18.5907020Z * [new branch] gh/PaulZhang12/31/base -> origin/gh/PaulZhang12/31/base 2025-12-04T08:54:18.5907633Z * [new branch] gh/PaulZhang12/31/head -> origin/gh/PaulZhang12/31/head 2025-12-04T08:54:18.5908300Z * [new branch] gh/PaulZhang12/31/orig -> origin/gh/PaulZhang12/31/orig 2025-12-04T08:54:18.5908930Z * [new branch] gh/PaulZhang12/37/base -> origin/gh/PaulZhang12/37/base 2025-12-04T08:54:18.5909548Z * [new branch] gh/PaulZhang12/37/head -> origin/gh/PaulZhang12/37/head 2025-12-04T08:54:18.5910163Z * [new branch] gh/PaulZhang12/37/orig -> origin/gh/PaulZhang12/37/orig 2025-12-04T08:54:18.5910783Z * [new branch] gh/PaulZhang12/40/base -> origin/gh/PaulZhang12/40/base 2025-12-04T08:54:18.5911401Z * [new branch] gh/PaulZhang12/40/head -> origin/gh/PaulZhang12/40/head 2025-12-04T08:54:18.5912014Z * [new branch] gh/PaulZhang12/40/orig -> origin/gh/PaulZhang12/40/orig 2025-12-04T08:54:18.5912631Z * [new branch] gh/PaulZhang12/42/base -> origin/gh/PaulZhang12/42/base 2025-12-04T08:54:18.5913244Z * [new branch] gh/PaulZhang12/42/head -> origin/gh/PaulZhang12/42/head 2025-12-04T08:54:18.5913858Z * [new branch] gh/PaulZhang12/43/base -> origin/gh/PaulZhang12/43/base 2025-12-04T08:54:18.5914556Z * [new branch] gh/PaulZhang12/43/head -> origin/gh/PaulZhang12/43/head 2025-12-04T08:54:18.5915178Z * [new branch] gh/PaulZhang12/43/orig -> origin/gh/PaulZhang12/43/orig 2025-12-04T08:54:18.5915794Z * [new branch] gh/PaulZhang12/44/base -> origin/gh/PaulZhang12/44/base 2025-12-04T08:54:18.5916411Z * [new branch] gh/PaulZhang12/44/head -> origin/gh/PaulZhang12/44/head 2025-12-04T08:54:18.5917025Z * [new branch] gh/PaulZhang12/45/base -> origin/gh/PaulZhang12/45/base 2025-12-04T08:54:18.5917641Z * [new branch] gh/PaulZhang12/45/head -> origin/gh/PaulZhang12/45/head 2025-12-04T08:54:18.5918308Z * [new branch] gh/PaulZhang12/45/orig -> origin/gh/PaulZhang12/45/orig 2025-12-04T08:54:18.5918922Z * [new branch] gh/PaulZhang12/46/base -> origin/gh/PaulZhang12/46/base 2025-12-04T08:54:18.5919541Z * [new branch] gh/PaulZhang12/46/head -> origin/gh/PaulZhang12/46/head 2025-12-04T08:54:18.5920156Z * [new branch] gh/PaulZhang12/46/orig -> origin/gh/PaulZhang12/46/orig 2025-12-04T08:54:18.5920769Z * [new branch] gh/PaulZhang12/47/base -> origin/gh/PaulZhang12/47/base 2025-12-04T08:54:18.5921384Z * [new branch] gh/PaulZhang12/47/head -> origin/gh/PaulZhang12/47/head 2025-12-04T08:54:18.5922101Z * [new branch] gh/PaulZhang12/47/orig -> origin/gh/PaulZhang12/47/orig 2025-12-04T08:54:18.5922717Z * [new branch] gh/PaulZhang12/48/base -> origin/gh/PaulZhang12/48/base 2025-12-04T08:54:18.5923332Z * [new branch] gh/PaulZhang12/48/head -> origin/gh/PaulZhang12/48/head 2025-12-04T08:54:18.5923946Z * [new branch] gh/PaulZhang12/48/orig -> origin/gh/PaulZhang12/48/orig 2025-12-04T08:54:18.5924564Z * [new branch] gh/SamGinzburg/11/base -> origin/gh/SamGinzburg/11/base 2025-12-04T08:54:18.5925190Z * [new branch] gh/SamGinzburg/11/head -> origin/gh/SamGinzburg/11/head 2025-12-04T08:54:18.5925829Z * [new branch] gh/SherlockNoMad/1/base -> origin/gh/SherlockNoMad/1/base 2025-12-04T08:54:18.5926470Z * [new branch] gh/SherlockNoMad/1/head -> origin/gh/SherlockNoMad/1/head 2025-12-04T08:54:18.5927126Z * [new branch] gh/SherlockNoMad/10/base -> origin/gh/SherlockNoMad/10/base 2025-12-04T08:54:18.5927791Z * [new branch] gh/SherlockNoMad/10/head -> origin/gh/SherlockNoMad/10/head 2025-12-04T08:54:18.5928497Z * [new branch] gh/SherlockNoMad/10/orig -> origin/gh/SherlockNoMad/10/orig 2025-12-04T08:54:18.5929143Z * [new branch] gh/SherlockNoMad/11/base -> origin/gh/SherlockNoMad/11/base 2025-12-04T08:54:18.5929789Z * [new branch] gh/SherlockNoMad/11/head -> origin/gh/SherlockNoMad/11/head 2025-12-04T08:54:18.5930438Z * [new branch] gh/SherlockNoMad/11/orig -> origin/gh/SherlockNoMad/11/orig 2025-12-04T08:54:18.5931089Z * [new branch] gh/SherlockNoMad/12/base -> origin/gh/SherlockNoMad/12/base 2025-12-04T08:54:18.5931737Z * [new branch] gh/SherlockNoMad/12/head -> origin/gh/SherlockNoMad/12/head 2025-12-04T08:54:18.5932386Z * [new branch] gh/SherlockNoMad/12/orig -> origin/gh/SherlockNoMad/12/orig 2025-12-04T08:54:18.5933040Z * [new branch] gh/SherlockNoMad/15/base -> origin/gh/SherlockNoMad/15/base 2025-12-04T08:54:18.5933687Z * [new branch] gh/SherlockNoMad/15/head -> origin/gh/SherlockNoMad/15/head 2025-12-04T08:54:18.5934335Z * [new branch] gh/SherlockNoMad/15/orig -> origin/gh/SherlockNoMad/15/orig 2025-12-04T08:54:18.5934983Z * [new branch] gh/SherlockNoMad/17/base -> origin/gh/SherlockNoMad/17/base 2025-12-04T08:54:18.5935627Z * [new branch] gh/SherlockNoMad/17/head -> origin/gh/SherlockNoMad/17/head 2025-12-04T08:54:18.5936358Z * [new branch] gh/SherlockNoMad/17/orig -> origin/gh/SherlockNoMad/17/orig 2025-12-04T08:54:18.5937006Z * [new branch] gh/SherlockNoMad/18/base -> origin/gh/SherlockNoMad/18/base 2025-12-04T08:54:18.5937647Z * [new branch] gh/SherlockNoMad/18/head -> origin/gh/SherlockNoMad/18/head 2025-12-04T08:54:18.5938342Z * [new branch] gh/SherlockNoMad/18/orig -> origin/gh/SherlockNoMad/18/orig 2025-12-04T08:54:18.5938992Z * [new branch] gh/SherlockNoMad/19/base -> origin/gh/SherlockNoMad/19/base 2025-12-04T08:54:18.5939635Z * [new branch] gh/SherlockNoMad/19/head -> origin/gh/SherlockNoMad/19/head 2025-12-04T08:54:18.5940285Z * [new branch] gh/SherlockNoMad/19/orig -> origin/gh/SherlockNoMad/19/orig 2025-12-04T08:54:18.5940928Z * [new branch] gh/SherlockNoMad/2/base -> origin/gh/SherlockNoMad/2/base 2025-12-04T08:54:18.5941566Z * [new branch] gh/SherlockNoMad/2/head -> origin/gh/SherlockNoMad/2/head 2025-12-04T08:54:18.5942223Z * [new branch] gh/SherlockNoMad/20/base -> origin/gh/SherlockNoMad/20/base 2025-12-04T08:54:18.5942868Z * [new branch] gh/SherlockNoMad/20/head -> origin/gh/SherlockNoMad/20/head 2025-12-04T08:54:18.5943512Z * [new branch] gh/SherlockNoMad/20/orig -> origin/gh/SherlockNoMad/20/orig 2025-12-04T08:54:18.5944259Z * [new branch] gh/SherlockNoMad/21/base -> origin/gh/SherlockNoMad/21/base 2025-12-04T08:54:18.5944906Z * [new branch] gh/SherlockNoMad/21/head -> origin/gh/SherlockNoMad/21/head 2025-12-04T08:54:18.5978385Z * [new branch] gh/SherlockNoMad/21/orig -> origin/gh/SherlockNoMad/21/orig 2025-12-04T08:54:18.5978844Z * [new branch] gh/SherlockNoMad/3/base -> origin/gh/SherlockNoMad/3/base 2025-12-04T08:54:18.5979274Z * [new branch] gh/SherlockNoMad/3/head -> origin/gh/SherlockNoMad/3/head 2025-12-04T08:54:18.5979685Z * [new branch] gh/SherlockNoMad/4/base -> origin/gh/SherlockNoMad/4/base 2025-12-04T08:54:18.5980090Z * [new branch] gh/SherlockNoMad/4/head -> origin/gh/SherlockNoMad/4/head 2025-12-04T08:54:18.5980498Z * [new branch] gh/SherlockNoMad/5/base -> origin/gh/SherlockNoMad/5/base 2025-12-04T08:54:18.5980898Z * [new branch] gh/SherlockNoMad/5/head -> origin/gh/SherlockNoMad/5/head 2025-12-04T08:54:18.5981338Z * [new branch] gh/Sidharth123-cpu/24/base -> origin/gh/Sidharth123-cpu/24/base 2025-12-04T08:54:18.5981792Z * [new branch] gh/Sidharth123-cpu/25/base -> origin/gh/Sidharth123-cpu/25/base 2025-12-04T08:54:18.5982230Z * [new branch] gh/Sidharth123-cpu/26/base -> origin/gh/Sidharth123-cpu/26/base 2025-12-04T08:54:18.5982668Z * [new branch] gh/Sidharth123-cpu/27/base -> origin/gh/Sidharth123-cpu/27/base 2025-12-04T08:54:18.5983095Z * [new branch] gh/StrongerXi/1/base -> origin/gh/StrongerXi/1/base 2025-12-04T08:54:18.5983509Z * [new branch] gh/StrongerXi/1/head -> origin/gh/StrongerXi/1/head 2025-12-04T08:54:18.5983919Z * [new branch] gh/StrongerXi/71/base -> origin/gh/StrongerXi/71/base 2025-12-04T08:54:18.5984322Z * [new branch] gh/StrongerXi/71/head -> origin/gh/StrongerXi/71/head 2025-12-04T08:54:18.5984718Z * [new branch] gh/StrongerXi/72/base -> origin/gh/StrongerXi/72/base 2025-12-04T08:54:18.5985110Z * [new branch] gh/StrongerXi/72/head -> origin/gh/StrongerXi/72/head 2025-12-04T08:54:18.5985497Z * [new branch] gh/StrongerXi/73/base -> origin/gh/StrongerXi/73/base 2025-12-04T08:54:18.5985883Z * [new branch] gh/StrongerXi/73/head -> origin/gh/StrongerXi/73/head 2025-12-04T08:54:18.5986270Z * [new branch] gh/StrongerXi/73/orig -> origin/gh/StrongerXi/73/orig 2025-12-04T08:54:18.5986656Z * [new branch] gh/XilunWu/160/base -> origin/gh/XilunWu/160/base 2025-12-04T08:54:18.5987168Z * [new branch] gh/XilunWu/160/head -> origin/gh/XilunWu/160/head 2025-12-04T08:54:18.5987536Z * [new branch] gh/XilunWu/160/orig -> origin/gh/XilunWu/160/orig 2025-12-04T08:54:18.5987905Z * [new branch] gh/XilunWu/163/base -> origin/gh/XilunWu/163/base 2025-12-04T08:54:18.5988320Z * [new branch] gh/XilunWu/163/head -> origin/gh/XilunWu/163/head 2025-12-04T08:54:18.5988685Z * [new branch] gh/XilunWu/163/orig -> origin/gh/XilunWu/163/orig 2025-12-04T08:54:18.5989053Z * [new branch] gh/XilunWu/168/base -> origin/gh/XilunWu/168/base 2025-12-04T08:54:18.5989423Z * [new branch] gh/XilunWu/168/head -> origin/gh/XilunWu/168/head 2025-12-04T08:54:18.5989788Z * [new branch] gh/XilunWu/168/orig -> origin/gh/XilunWu/168/orig 2025-12-04T08:54:18.5990157Z * [new branch] gh/XilunWu/169/base -> origin/gh/XilunWu/169/base 2025-12-04T08:54:18.5990526Z * [new branch] gh/XilunWu/169/head -> origin/gh/XilunWu/169/head 2025-12-04T08:54:18.5990893Z * [new branch] gh/XilunWu/169/orig -> origin/gh/XilunWu/169/orig 2025-12-04T08:54:18.5991259Z * [new branch] gh/XilunWu/170/base -> origin/gh/XilunWu/170/base 2025-12-04T08:54:18.5991679Z * [new branch] gh/XilunWu/170/head -> origin/gh/XilunWu/170/head 2025-12-04T08:54:18.5992051Z * [new branch] gh/XilunWu/170/orig -> origin/gh/XilunWu/170/orig 2025-12-04T08:54:18.5992417Z * [new branch] gh/XilunWu/171/base -> origin/gh/XilunWu/171/base 2025-12-04T08:54:18.5992782Z * [new branch] gh/XilunWu/171/head -> origin/gh/XilunWu/171/head 2025-12-04T08:54:18.5993148Z * [new branch] gh/XilunWu/171/orig -> origin/gh/XilunWu/171/orig 2025-12-04T08:54:18.5993513Z * [new branch] gh/XilunWu/173/base -> origin/gh/XilunWu/173/base 2025-12-04T08:54:18.5993881Z * [new branch] gh/XilunWu/173/head -> origin/gh/XilunWu/173/head 2025-12-04T08:54:18.5994246Z * [new branch] gh/XilunWu/173/orig -> origin/gh/XilunWu/173/orig 2025-12-04T08:54:18.5994615Z * [new branch] gh/XilunWu/175/base -> origin/gh/XilunWu/175/base 2025-12-04T08:54:18.5994986Z * [new branch] gh/XilunWu/175/head -> origin/gh/XilunWu/175/head 2025-12-04T08:54:18.5995354Z * [new branch] gh/XilunWu/175/orig -> origin/gh/XilunWu/175/orig 2025-12-04T08:54:18.5995718Z * [new branch] gh/XilunWu/176/base -> origin/gh/XilunWu/176/base 2025-12-04T08:54:18.5996082Z * [new branch] gh/XilunWu/176/head -> origin/gh/XilunWu/176/head 2025-12-04T08:54:18.5996450Z * [new branch] gh/XilunWu/176/orig -> origin/gh/XilunWu/176/orig 2025-12-04T08:54:18.5996830Z * [new branch] gh/XuehaiPan/14/base -> origin/gh/XuehaiPan/14/base 2025-12-04T08:54:18.5997220Z * [new branch] gh/XuehaiPan/14/head -> origin/gh/XuehaiPan/14/head 2025-12-04T08:54:18.5997604Z * [new branch] gh/XuehaiPan/14/orig -> origin/gh/XuehaiPan/14/orig 2025-12-04T08:54:18.5997990Z * [new branch] gh/XuehaiPan/179/base -> origin/gh/XuehaiPan/179/base 2025-12-04T08:54:18.5998448Z * [new branch] gh/XuehaiPan/179/head -> origin/gh/XuehaiPan/179/head 2025-12-04T08:54:18.5998839Z * [new branch] gh/XuehaiPan/179/orig -> origin/gh/XuehaiPan/179/orig 2025-12-04T08:54:18.5999224Z * [new branch] gh/XuehaiPan/249/base -> origin/gh/XuehaiPan/249/base 2025-12-04T08:54:18.5999610Z * [new branch] gh/XuehaiPan/249/head -> origin/gh/XuehaiPan/249/head 2025-12-04T08:54:18.5999996Z * [new branch] gh/XuehaiPan/249/orig -> origin/gh/XuehaiPan/249/orig 2025-12-04T08:54:18.6000449Z * [new branch] gh/XuehaiPan/253/base -> origin/gh/XuehaiPan/253/base 2025-12-04T08:54:18.6000834Z * [new branch] gh/XuehaiPan/253/head -> origin/gh/XuehaiPan/253/head 2025-12-04T08:54:18.6001221Z * [new branch] gh/XuehaiPan/253/orig -> origin/gh/XuehaiPan/253/orig 2025-12-04T08:54:18.6001603Z * [new branch] gh/XuehaiPan/254/base -> origin/gh/XuehaiPan/254/base 2025-12-04T08:54:18.6001992Z * [new branch] gh/XuehaiPan/254/head -> origin/gh/XuehaiPan/254/head 2025-12-04T08:54:18.6002375Z * [new branch] gh/XuehaiPan/254/orig -> origin/gh/XuehaiPan/254/orig 2025-12-04T08:54:18.6002758Z * [new branch] gh/XuehaiPan/255/base -> origin/gh/XuehaiPan/255/base 2025-12-04T08:54:18.6003140Z * [new branch] gh/XuehaiPan/255/head -> origin/gh/XuehaiPan/255/head 2025-12-04T08:54:18.6003525Z * [new branch] gh/XuehaiPan/255/orig -> origin/gh/XuehaiPan/255/orig 2025-12-04T08:54:18.6003912Z * [new branch] gh/XuehaiPan/271/base -> origin/gh/XuehaiPan/271/base 2025-12-04T08:54:18.6004297Z * [new branch] gh/XuehaiPan/271/head -> origin/gh/XuehaiPan/271/head 2025-12-04T08:54:18.6004680Z * [new branch] gh/XuehaiPan/271/orig -> origin/gh/XuehaiPan/271/orig 2025-12-04T08:54:18.6005065Z * [new branch] gh/XuehaiPan/343/base -> origin/gh/XuehaiPan/343/base 2025-12-04T08:54:18.6005506Z * [new branch] gh/XuehaiPan/343/head -> origin/gh/XuehaiPan/343/head 2025-12-04T08:54:18.6005899Z * [new branch] gh/XuehaiPan/343/orig -> origin/gh/XuehaiPan/343/orig 2025-12-04T08:54:18.6006282Z * [new branch] gh/XuehaiPan/347/base -> origin/gh/XuehaiPan/347/base 2025-12-04T08:54:18.6006668Z * [new branch] gh/XuehaiPan/347/head -> origin/gh/XuehaiPan/347/head 2025-12-04T08:54:18.6007056Z * [new branch] gh/XuehaiPan/347/orig -> origin/gh/XuehaiPan/347/orig 2025-12-04T08:54:18.6007442Z * [new branch] gh/XuehaiPan/348/base -> origin/gh/XuehaiPan/348/base 2025-12-04T08:54:18.6007828Z * [new branch] gh/XuehaiPan/348/head -> origin/gh/XuehaiPan/348/head 2025-12-04T08:54:18.6008258Z * [new branch] gh/XuehaiPan/348/orig -> origin/gh/XuehaiPan/348/orig 2025-12-04T08:54:18.6008649Z * [new branch] gh/XuehaiPan/350/base -> origin/gh/XuehaiPan/350/base 2025-12-04T08:54:18.6009033Z * [new branch] gh/XuehaiPan/350/head -> origin/gh/XuehaiPan/350/head 2025-12-04T08:54:18.6009418Z * [new branch] gh/XuehaiPan/350/orig -> origin/gh/XuehaiPan/350/orig 2025-12-04T08:54:18.6009801Z * [new branch] gh/XuehaiPan/365/base -> origin/gh/XuehaiPan/365/base 2025-12-04T08:54:18.6010185Z * [new branch] gh/XuehaiPan/365/head -> origin/gh/XuehaiPan/365/head 2025-12-04T08:54:18.6010566Z * [new branch] gh/XuehaiPan/365/orig -> origin/gh/XuehaiPan/365/orig 2025-12-04T08:54:18.6010955Z * [new branch] gh/XuehaiPan/366/base -> origin/gh/XuehaiPan/366/base 2025-12-04T08:54:18.6011340Z * [new branch] gh/XuehaiPan/366/head -> origin/gh/XuehaiPan/366/head 2025-12-04T08:54:18.6011722Z * [new branch] gh/XuehaiPan/370/base -> origin/gh/XuehaiPan/370/base 2025-12-04T08:54:18.6012109Z * [new branch] gh/XuehaiPan/370/head -> origin/gh/XuehaiPan/370/head 2025-12-04T08:54:18.6012496Z * [new branch] gh/XuehaiPan/370/orig -> origin/gh/XuehaiPan/370/orig 2025-12-04T08:54:18.6012876Z * [new branch] gh/XuehaiPan/390/base -> origin/gh/XuehaiPan/390/base 2025-12-04T08:54:18.6013261Z * [new branch] gh/XuehaiPan/390/head -> origin/gh/XuehaiPan/390/head 2025-12-04T08:54:18.6013646Z * [new branch] gh/XuehaiPan/390/orig -> origin/gh/XuehaiPan/390/orig 2025-12-04T08:54:18.6014029Z * [new branch] gh/XuehaiPan/391/base -> origin/gh/XuehaiPan/391/base 2025-12-04T08:54:18.6014476Z * [new branch] gh/XuehaiPan/391/head -> origin/gh/XuehaiPan/391/head 2025-12-04T08:54:18.6014860Z * [new branch] gh/XuehaiPan/391/orig -> origin/gh/XuehaiPan/391/orig 2025-12-04T08:54:18.6015243Z * [new branch] gh/XuehaiPan/392/base -> origin/gh/XuehaiPan/392/base 2025-12-04T08:54:18.6015632Z * [new branch] gh/XuehaiPan/392/head -> origin/gh/XuehaiPan/392/head 2025-12-04T08:54:18.6016013Z * [new branch] gh/XuehaiPan/392/orig -> origin/gh/XuehaiPan/392/orig 2025-12-04T08:54:18.6016404Z * [new branch] gh/XuehaiPan/394/base -> origin/gh/XuehaiPan/394/base 2025-12-04T08:54:18.6016788Z * [new branch] gh/XuehaiPan/394/head -> origin/gh/XuehaiPan/394/head 2025-12-04T08:54:18.6017170Z * [new branch] gh/XuehaiPan/394/orig -> origin/gh/XuehaiPan/394/orig 2025-12-04T08:54:18.6017556Z * [new branch] gh/XuehaiPan/397/base -> origin/gh/XuehaiPan/397/base 2025-12-04T08:54:18.6017945Z * [new branch] gh/XuehaiPan/397/head -> origin/gh/XuehaiPan/397/head 2025-12-04T08:54:18.6018368Z * [new branch] gh/XuehaiPan/397/orig -> origin/gh/XuehaiPan/397/orig 2025-12-04T08:54:18.6018755Z * [new branch] gh/XuehaiPan/398/base -> origin/gh/XuehaiPan/398/base 2025-12-04T08:54:18.6019198Z * [new branch] gh/XuehaiPan/398/head -> origin/gh/XuehaiPan/398/head 2025-12-04T08:54:18.6019580Z * [new branch] gh/XuehaiPan/398/orig -> origin/gh/XuehaiPan/398/orig 2025-12-04T08:54:18.6019963Z * [new branch] gh/XuehaiPan/399/base -> origin/gh/XuehaiPan/399/base 2025-12-04T08:54:18.6020353Z * [new branch] gh/XuehaiPan/399/head -> origin/gh/XuehaiPan/399/head 2025-12-04T08:54:18.6020735Z * [new branch] gh/XuehaiPan/399/orig -> origin/gh/XuehaiPan/399/orig 2025-12-04T08:54:18.6021122Z * [new branch] gh/XuehaiPan/400/base -> origin/gh/XuehaiPan/400/base 2025-12-04T08:54:18.6021508Z * [new branch] gh/XuehaiPan/400/head -> origin/gh/XuehaiPan/400/head 2025-12-04T08:54:18.6021888Z * [new branch] gh/XuehaiPan/400/orig -> origin/gh/XuehaiPan/400/orig 2025-12-04T08:54:18.6022289Z * [new branch] gh/ZhiweiYan-96/39/base -> origin/gh/ZhiweiYan-96/39/base 2025-12-04T08:54:18.6022695Z * [new branch] gh/ZhiweiYan-96/39/head -> origin/gh/ZhiweiYan-96/39/head 2025-12-04T08:54:18.6023086Z * [new branch] gh/ZhiweiYan-96/39/orig -> origin/gh/ZhiweiYan-96/39/orig 2025-12-04T08:54:18.6023478Z * [new branch] gh/ZhiweiYan-96/44/base -> origin/gh/ZhiweiYan-96/44/base 2025-12-04T08:54:18.6023875Z * [new branch] gh/ZhiweiYan-96/44/head -> origin/gh/ZhiweiYan-96/44/head 2025-12-04T08:54:18.6024265Z * [new branch] gh/ZhiweiYan-96/45/base -> origin/gh/ZhiweiYan-96/45/base 2025-12-04T08:54:18.6024658Z * [new branch] gh/ZhiweiYan-96/45/head -> origin/gh/ZhiweiYan-96/45/head 2025-12-04T08:54:18.6025050Z * [new branch] gh/ZhiweiYan-96/49/base -> origin/gh/ZhiweiYan-96/49/base 2025-12-04T08:54:18.6025435Z * [new branch] gh/ZhiweiYan-96/49/head -> origin/gh/ZhiweiYan-96/49/head 2025-12-04T08:54:18.6025825Z * [new branch] gh/ZhiweiYan-96/62/base -> origin/gh/ZhiweiYan-96/62/base 2025-12-04T08:54:18.6026210Z * [new branch] gh/ZhiweiYan-96/62/head -> origin/gh/ZhiweiYan-96/62/head 2025-12-04T08:54:18.6026601Z * [new branch] gh/ZhiweiYan-96/66/base -> origin/gh/ZhiweiYan-96/66/base 2025-12-04T08:54:18.6026990Z * [new branch] gh/ZhiweiYan-96/66/head -> origin/gh/ZhiweiYan-96/66/head 2025-12-04T08:54:18.6027376Z * [new branch] gh/ZhiweiYan-96/67/base -> origin/gh/ZhiweiYan-96/67/base 2025-12-04T08:54:18.6027764Z * [new branch] gh/ZhiweiYan-96/67/head -> origin/gh/ZhiweiYan-96/67/head 2025-12-04T08:54:18.6028256Z * [new branch] gh/ZhiweiYan-96/68/base -> origin/gh/ZhiweiYan-96/68/base 2025-12-04T08:54:18.6028647Z * [new branch] gh/ZhiweiYan-96/68/head -> origin/gh/ZhiweiYan-96/68/head 2025-12-04T08:54:18.6029039Z * [new branch] gh/ZhiweiYan-96/68/orig -> origin/gh/ZhiweiYan-96/68/orig 2025-12-04T08:54:18.6029427Z * [new branch] gh/aakhundov/1/base -> origin/gh/aakhundov/1/base 2025-12-04T08:54:18.6029805Z * [new branch] gh/aakhundov/1/head -> origin/gh/aakhundov/1/head 2025-12-04T08:54:18.6030181Z * [new branch] gh/aakhundov/2/base -> origin/gh/aakhundov/2/base 2025-12-04T08:54:18.6030557Z * [new branch] gh/aakhundov/2/head -> origin/gh/aakhundov/2/head 2025-12-04T08:54:18.6030940Z * [new branch] gh/aditew01/openblas -> origin/gh/aditew01/openblas 2025-12-04T08:54:18.6031330Z * [new branch] gh/aditew01/sbgemm -> origin/gh/aditew01/sbgemm 2025-12-04T08:54:18.6031713Z * [new branch] gh/aditew01/vecbf16 -> origin/gh/aditew01/vecbf16 2025-12-04T08:54:18.6032086Z * [new branch] gh/albanD/4/base -> origin/gh/albanD/4/base 2025-12-04T08:54:18.6032452Z * [new branch] gh/albanD/4/head -> origin/gh/albanD/4/head 2025-12-04T08:54:18.6032878Z * [new branch] gh/albanD/4/orig -> origin/gh/albanD/4/orig 2025-12-04T08:54:18.6033424Z * [new branch] gh/alexbrauckmann/paddedtensor_faketensor_init -> origin/gh/alexbrauckmann/paddedtensor_faketensor_init 2025-12-04T08:54:18.6033985Z * [new branch] gh/alexsamardzic/12/base -> origin/gh/alexsamardzic/12/base 2025-12-04T08:54:18.6034409Z * [new branch] gh/alexsamardzic/12/head -> origin/gh/alexsamardzic/12/head 2025-12-04T08:54:18.6034818Z * [new branch] gh/alexsamardzic/12/orig -> origin/gh/alexsamardzic/12/orig 2025-12-04T08:54:18.6035233Z * [new branch] gh/alexsamardzic/14/base -> origin/gh/alexsamardzic/14/base 2025-12-04T08:54:18.6035636Z * [new branch] gh/alexsamardzic/14/head -> origin/gh/alexsamardzic/14/head 2025-12-04T08:54:18.6036046Z * [new branch] gh/alexsamardzic/14/orig -> origin/gh/alexsamardzic/14/orig 2025-12-04T08:54:18.6036457Z * [new branch] gh/alexsamardzic/15/base -> origin/gh/alexsamardzic/15/base 2025-12-04T08:54:18.6036860Z * [new branch] gh/alexsamardzic/15/head -> origin/gh/alexsamardzic/15/head 2025-12-04T08:54:18.6037268Z * [new branch] gh/alexsamardzic/15/orig -> origin/gh/alexsamardzic/15/orig 2025-12-04T08:54:18.6037663Z * [new branch] gh/amjames/18/base -> origin/gh/amjames/18/base 2025-12-04T08:54:18.6038037Z * [new branch] gh/amjames/18/head -> origin/gh/amjames/18/head 2025-12-04T08:54:18.6038463Z * [new branch] gh/amjames/18/orig -> origin/gh/amjames/18/orig 2025-12-04T08:54:18.6038850Z * [new branch] gh/andrewor14/35/base -> origin/gh/andrewor14/35/base 2025-12-04T08:54:18.6039239Z * [new branch] gh/andrewor14/35/head -> origin/gh/andrewor14/35/head 2025-12-04T08:54:18.6039623Z * [new branch] gh/andrewor14/35/orig -> origin/gh/andrewor14/35/orig 2025-12-04T08:54:18.6040012Z * [new branch] gh/andrewor14/50/base -> origin/gh/andrewor14/50/base 2025-12-04T08:54:18.6040390Z * [new branch] gh/andrewor14/50/head -> origin/gh/andrewor14/50/head 2025-12-04T08:54:18.6040770Z * [new branch] gh/andrewor14/50/orig -> origin/gh/andrewor14/50/orig 2025-12-04T08:54:18.6041151Z * [new branch] gh/andyanwang/30/base -> origin/gh/andyanwang/30/base 2025-12-04T08:54:18.6041532Z * [new branch] gh/andyanwang/30/orig -> origin/gh/andyanwang/30/orig 2025-12-04T08:54:18.6041914Z * [new branch] gh/andyanwang/31/base -> origin/gh/andyanwang/31/base 2025-12-04T08:54:18.6042354Z * [new branch] gh/andyanwang/31/orig -> origin/gh/andyanwang/31/orig 2025-12-04T08:54:18.6042733Z * [new branch] gh/andyanwang/39/base -> origin/gh/andyanwang/39/base 2025-12-04T08:54:18.6043120Z * [new branch] gh/andyanwang/39/head -> origin/gh/andyanwang/39/head 2025-12-04T08:54:18.6043507Z * [new branch] gh/andyanwang/39/orig -> origin/gh/andyanwang/39/orig 2025-12-04T08:54:18.6043891Z * [new branch] gh/andyanwang/42/base -> origin/gh/andyanwang/42/base 2025-12-04T08:54:18.6044274Z * [new branch] gh/andyanwang/42/head -> origin/gh/andyanwang/42/head 2025-12-04T08:54:18.6044661Z * [new branch] gh/andyanwang/42/orig -> origin/gh/andyanwang/42/orig 2025-12-04T08:54:18.6045041Z * [new branch] gh/andyanwang/45/base -> origin/gh/andyanwang/45/base 2025-12-04T08:54:18.6045427Z * [new branch] gh/andyanwang/45/head -> origin/gh/andyanwang/45/head 2025-12-04T08:54:18.6045807Z * [new branch] gh/andyanwang/45/orig -> origin/gh/andyanwang/45/orig 2025-12-04T08:54:18.6046192Z * [new branch] gh/angelayi/107/base -> origin/gh/angelayi/107/base 2025-12-04T08:54:18.6046572Z * [new branch] gh/angelayi/107/head -> origin/gh/angelayi/107/head 2025-12-04T08:54:18.6047011Z * [new branch] gh/angelayi/114/base -> origin/gh/angelayi/114/base 2025-12-04T08:54:18.6047392Z * [new branch] gh/angelayi/114/head -> origin/gh/angelayi/114/head 2025-12-04T08:54:18.6047765Z * [new branch] gh/angelayi/114/orig -> origin/gh/angelayi/114/orig 2025-12-04T08:54:18.6048182Z * [new branch] gh/angelayi/116/base -> origin/gh/angelayi/116/base 2025-12-04T08:54:18.6048561Z * [new branch] gh/angelayi/116/head -> origin/gh/angelayi/116/head 2025-12-04T08:54:18.6048937Z * [new branch] gh/angelayi/116/orig -> origin/gh/angelayi/116/orig 2025-12-04T08:54:18.6049309Z * [new branch] gh/angelayi/122/base -> origin/gh/angelayi/122/base 2025-12-04T08:54:18.6049684Z * [new branch] gh/angelayi/122/head -> origin/gh/angelayi/122/head 2025-12-04T08:54:18.6050060Z * [new branch] gh/angelayi/122/orig -> origin/gh/angelayi/122/orig 2025-12-04T08:54:18.6050436Z * [new branch] gh/angelayi/124/base -> origin/gh/angelayi/124/base 2025-12-04T08:54:18.6050812Z * [new branch] gh/angelayi/124/head -> origin/gh/angelayi/124/head 2025-12-04T08:54:18.6051185Z * [new branch] gh/angelayi/124/orig -> origin/gh/angelayi/124/orig 2025-12-04T08:54:18.6051556Z * [new branch] gh/angelayi/128/base -> origin/gh/angelayi/128/base 2025-12-04T08:54:18.6051932Z * [new branch] gh/angelayi/128/head -> origin/gh/angelayi/128/head 2025-12-04T08:54:18.6052311Z * [new branch] gh/angelayi/128/orig -> origin/gh/angelayi/128/orig 2025-12-04T08:54:18.6052680Z * [new branch] gh/angelayi/131/base -> origin/gh/angelayi/131/base 2025-12-04T08:54:18.6053055Z * [new branch] gh/angelayi/131/head -> origin/gh/angelayi/131/head 2025-12-04T08:54:18.6053431Z * [new branch] gh/angelayi/131/orig -> origin/gh/angelayi/131/orig 2025-12-04T08:54:18.6053804Z * [new branch] gh/angelayi/132/base -> origin/gh/angelayi/132/base 2025-12-04T08:54:18.6054178Z * [new branch] gh/angelayi/132/head -> origin/gh/angelayi/132/head 2025-12-04T08:54:18.6054552Z * [new branch] gh/angelayi/132/orig -> origin/gh/angelayi/132/orig 2025-12-04T08:54:18.6054925Z * [new branch] gh/angelayi/133/base -> origin/gh/angelayi/133/base 2025-12-04T08:54:18.6055299Z * [new branch] gh/angelayi/133/head -> origin/gh/angelayi/133/head 2025-12-04T08:54:18.6055728Z * [new branch] gh/angelayi/133/orig -> origin/gh/angelayi/133/orig 2025-12-04T08:54:18.6056104Z * [new branch] gh/angelayi/134/base -> origin/gh/angelayi/134/base 2025-12-04T08:54:18.6056479Z * [new branch] gh/angelayi/134/head -> origin/gh/angelayi/134/head 2025-12-04T08:54:18.6056855Z * [new branch] gh/angelayi/134/orig -> origin/gh/angelayi/134/orig 2025-12-04T08:54:18.6057233Z * [new branch] gh/angelayi/135/base -> origin/gh/angelayi/135/base 2025-12-04T08:54:18.6057609Z * [new branch] gh/angelayi/135/head -> origin/gh/angelayi/135/head 2025-12-04T08:54:18.6057984Z * [new branch] gh/angelayi/135/orig -> origin/gh/angelayi/135/orig 2025-12-04T08:54:18.6058410Z * [new branch] gh/angelayi/136/base -> origin/gh/angelayi/136/base 2025-12-04T08:54:18.6058790Z * [new branch] gh/angelayi/136/head -> origin/gh/angelayi/136/head 2025-12-04T08:54:18.6059171Z * [new branch] gh/angelayi/136/orig -> origin/gh/angelayi/136/orig 2025-12-04T08:54:18.6059547Z * [new branch] gh/angelayi/137/base -> origin/gh/angelayi/137/base 2025-12-04T08:54:18.6059923Z * [new branch] gh/angelayi/137/head -> origin/gh/angelayi/137/head 2025-12-04T08:54:18.6060362Z * [new branch] gh/angelayi/137/orig -> origin/gh/angelayi/137/orig 2025-12-04T08:54:18.6060744Z * [new branch] gh/angelayi/138/base -> origin/gh/angelayi/138/base 2025-12-04T08:54:18.6061121Z * [new branch] gh/angelayi/138/head -> origin/gh/angelayi/138/head 2025-12-04T08:54:18.6061494Z * [new branch] gh/angelayi/138/orig -> origin/gh/angelayi/138/orig 2025-12-04T08:54:18.6061869Z * [new branch] gh/angelayi/139/base -> origin/gh/angelayi/139/base 2025-12-04T08:54:18.6062244Z * [new branch] gh/angelayi/139/head -> origin/gh/angelayi/139/head 2025-12-04T08:54:18.6062622Z * [new branch] gh/angelayi/139/orig -> origin/gh/angelayi/139/orig 2025-12-04T08:54:18.6062999Z * [new branch] gh/angelayi/140/base -> origin/gh/angelayi/140/base 2025-12-04T08:54:18.6063374Z * [new branch] gh/angelayi/140/head -> origin/gh/angelayi/140/head 2025-12-04T08:54:18.6063751Z * [new branch] gh/angelayi/140/orig -> origin/gh/angelayi/140/orig 2025-12-04T08:54:18.6064133Z * [new branch] gh/angelayi/141/base -> origin/gh/angelayi/141/base 2025-12-04T08:54:18.6064506Z * [new branch] gh/angelayi/141/head -> origin/gh/angelayi/141/head 2025-12-04T08:54:18.6064884Z * [new branch] gh/angelayi/141/orig -> origin/gh/angelayi/141/orig 2025-12-04T08:54:18.6065267Z * [new branch] gh/angelayi/142/base -> origin/gh/angelayi/142/base 2025-12-04T08:54:18.6065640Z * [new branch] gh/angelayi/142/head -> origin/gh/angelayi/142/head 2025-12-04T08:54:18.6066025Z * [new branch] gh/angelayi/142/orig -> origin/gh/angelayi/142/orig 2025-12-04T08:54:18.6066404Z * [new branch] gh/angelayi/143/base -> origin/gh/angelayi/143/base 2025-12-04T08:54:18.6066780Z * [new branch] gh/angelayi/143/head -> origin/gh/angelayi/143/head 2025-12-04T08:54:18.6067162Z * [new branch] gh/angelayi/143/orig -> origin/gh/angelayi/143/orig 2025-12-04T08:54:18.6067695Z * [new branch] gh/angelayi/144/base -> origin/gh/angelayi/144/base 2025-12-04T08:54:18.6068117Z * [new branch] gh/angelayi/144/head -> origin/gh/angelayi/144/head 2025-12-04T08:54:18.6068495Z * [new branch] gh/angelayi/144/orig -> origin/gh/angelayi/144/orig 2025-12-04T08:54:18.6068886Z * [new branch] gh/anijain2305/753/base -> origin/gh/anijain2305/753/base 2025-12-04T08:54:18.6069278Z * [new branch] gh/anijain2305/753/head -> origin/gh/anijain2305/753/head 2025-12-04T08:54:18.6069729Z * [new branch] gh/anijain2305/753/orig -> origin/gh/anijain2305/753/orig 2025-12-04T08:54:18.6070119Z * [new branch] gh/anijain2305/810/base -> origin/gh/anijain2305/810/base 2025-12-04T08:54:18.6070503Z * [new branch] gh/anijain2305/810/head -> origin/gh/anijain2305/810/head 2025-12-04T08:54:18.6070914Z * [new branch] gh/anijain2305/810/orig -> origin/gh/anijain2305/810/orig 2025-12-04T08:54:18.6071301Z * [new branch] gh/anijain2305/854/base -> origin/gh/anijain2305/854/base 2025-12-04T08:54:18.6071683Z * [new branch] gh/anijain2305/854/head -> origin/gh/anijain2305/854/head 2025-12-04T08:54:18.6072070Z * [new branch] gh/anijain2305/854/orig -> origin/gh/anijain2305/854/orig 2025-12-04T08:54:18.6072456Z * [new branch] gh/anijain2305/864/base -> origin/gh/anijain2305/864/base 2025-12-04T08:54:18.6072846Z * [new branch] gh/anijain2305/864/head -> origin/gh/anijain2305/864/head 2025-12-04T08:54:18.6073232Z * [new branch] gh/anijain2305/864/orig -> origin/gh/anijain2305/864/orig 2025-12-04T08:54:18.6073616Z * [new branch] gh/anijain2305/870/base -> origin/gh/anijain2305/870/base 2025-12-04T08:54:18.6073997Z * [new branch] gh/anijain2305/870/head -> origin/gh/anijain2305/870/head 2025-12-04T08:54:18.6074442Z * [new branch] gh/anijain2305/870/orig -> origin/gh/anijain2305/870/orig 2025-12-04T08:54:18.6074828Z * [new branch] gh/anijain2305/873/base -> origin/gh/anijain2305/873/base 2025-12-04T08:54:18.6075216Z * [new branch] gh/anijain2305/873/head -> origin/gh/anijain2305/873/head 2025-12-04T08:54:18.6075601Z * [new branch] gh/anijain2305/873/orig -> origin/gh/anijain2305/873/orig 2025-12-04T08:54:18.6075984Z * [new branch] gh/anijain2305/894/base -> origin/gh/anijain2305/894/base 2025-12-04T08:54:18.6076375Z * [new branch] gh/anijain2305/894/head -> origin/gh/anijain2305/894/head 2025-12-04T08:54:18.6076758Z * [new branch] gh/anijain2305/894/orig -> origin/gh/anijain2305/894/orig 2025-12-04T08:54:18.6077141Z * [new branch] gh/anijain2305/895/base -> origin/gh/anijain2305/895/base 2025-12-04T08:54:18.6077529Z * [new branch] gh/anijain2305/895/head -> origin/gh/anijain2305/895/head 2025-12-04T08:54:18.6077913Z * [new branch] gh/anijain2305/895/orig -> origin/gh/anijain2305/895/orig 2025-12-04T08:54:18.6078350Z * [new branch] gh/anijain2305/910/base -> origin/gh/anijain2305/910/base 2025-12-04T08:54:18.6078735Z * [new branch] gh/anijain2305/910/head -> origin/gh/anijain2305/910/head 2025-12-04T08:54:18.6079119Z * [new branch] gh/anijain2305/910/orig -> origin/gh/anijain2305/910/orig 2025-12-04T08:54:18.6079504Z * [new branch] gh/anijain2305/919/base -> origin/gh/anijain2305/919/base 2025-12-04T08:54:18.6079891Z * [new branch] gh/anijain2305/919/head -> origin/gh/anijain2305/919/head 2025-12-04T08:54:18.6080277Z * [new branch] gh/anijain2305/919/orig -> origin/gh/anijain2305/919/orig 2025-12-04T08:54:18.6080655Z * [new branch] gh/anijain2305/922/base -> origin/gh/anijain2305/922/base 2025-12-04T08:54:18.6081042Z * [new branch] gh/anijain2305/922/head -> origin/gh/anijain2305/922/head 2025-12-04T08:54:18.6081425Z * [new branch] gh/anijain2305/922/orig -> origin/gh/anijain2305/922/orig 2025-12-04T08:54:18.6081808Z * [new branch] gh/anijain2305/932/base -> origin/gh/anijain2305/932/base 2025-12-04T08:54:18.6082192Z * [new branch] gh/anijain2305/932/head -> origin/gh/anijain2305/932/head 2025-12-04T08:54:18.6082577Z * [new branch] gh/anijain2305/932/orig -> origin/gh/anijain2305/932/orig 2025-12-04T08:54:18.6083032Z * [new branch] gh/anijain2305/940/base -> origin/gh/anijain2305/940/base 2025-12-04T08:54:18.6083415Z * [new branch] gh/anijain2305/940/head -> origin/gh/anijain2305/940/head 2025-12-04T08:54:18.6083800Z * [new branch] gh/anijain2305/940/orig -> origin/gh/anijain2305/940/orig 2025-12-04T08:54:18.6084183Z * [new branch] gh/anijain2305/941/base -> origin/gh/anijain2305/941/base 2025-12-04T08:54:18.6084571Z * [new branch] gh/anijain2305/941/head -> origin/gh/anijain2305/941/head 2025-12-04T08:54:18.6084952Z * [new branch] gh/anijain2305/941/orig -> origin/gh/anijain2305/941/orig 2025-12-04T08:54:18.6085340Z * [new branch] gh/anijain2305/942/base -> origin/gh/anijain2305/942/base 2025-12-04T08:54:18.6085729Z * [new branch] gh/anijain2305/942/head -> origin/gh/anijain2305/942/head 2025-12-04T08:54:18.6086113Z * [new branch] gh/anijain2305/942/orig -> origin/gh/anijain2305/942/orig 2025-12-04T08:54:18.6086500Z * [new branch] gh/anijain2305/943/base -> origin/gh/anijain2305/943/base 2025-12-04T08:54:18.6086889Z * [new branch] gh/anijain2305/943/head -> origin/gh/anijain2305/943/head 2025-12-04T08:54:18.6087270Z * [new branch] gh/anijain2305/943/orig -> origin/gh/anijain2305/943/orig 2025-12-04T08:54:18.6087713Z * [new branch] gh/anijain2305/944/base -> origin/gh/anijain2305/944/base 2025-12-04T08:54:18.6088140Z * [new branch] gh/anijain2305/944/head -> origin/gh/anijain2305/944/head 2025-12-04T08:54:18.6088527Z * [new branch] gh/anijain2305/944/orig -> origin/gh/anijain2305/944/orig 2025-12-04T08:54:18.6088910Z * [new branch] gh/anijain2305/945/base -> origin/gh/anijain2305/945/base 2025-12-04T08:54:18.6089294Z * [new branch] gh/anijain2305/945/head -> origin/gh/anijain2305/945/head 2025-12-04T08:54:18.6089675Z * [new branch] gh/anijain2305/945/orig -> origin/gh/anijain2305/945/orig 2025-12-04T08:54:18.6090065Z * [new branch] gh/anijain2305/946/base -> origin/gh/anijain2305/946/base 2025-12-04T08:54:18.6090454Z * [new branch] gh/anijain2305/946/head -> origin/gh/anijain2305/946/head 2025-12-04T08:54:18.6090839Z * [new branch] gh/anijain2305/946/orig -> origin/gh/anijain2305/946/orig 2025-12-04T08:54:18.6091229Z * [new branch] gh/anijain2305/947/base -> origin/gh/anijain2305/947/base 2025-12-04T08:54:18.6091619Z * [new branch] gh/anijain2305/947/head -> origin/gh/anijain2305/947/head 2025-12-04T08:54:18.6092007Z * [new branch] gh/anijain2305/947/orig -> origin/gh/anijain2305/947/orig 2025-12-04T08:54:18.6092392Z * [new branch] gh/anijain2305/948/base -> origin/gh/anijain2305/948/base 2025-12-04T08:54:18.6092776Z * [new branch] gh/anijain2305/948/head -> origin/gh/anijain2305/948/head 2025-12-04T08:54:18.6093167Z * [new branch] gh/anijain2305/948/orig -> origin/gh/anijain2305/948/orig 2025-12-04T08:54:18.6093551Z * [new branch] gh/anijain2305/949/base -> origin/gh/anijain2305/949/base 2025-12-04T08:54:18.6093935Z * [new branch] gh/anijain2305/949/head -> origin/gh/anijain2305/949/head 2025-12-04T08:54:18.6094323Z * [new branch] gh/anijain2305/949/orig -> origin/gh/anijain2305/949/orig 2025-12-04T08:54:18.6094716Z * [new branch] gh/anijain2305/950/base -> origin/gh/anijain2305/950/base 2025-12-04T08:54:18.6095099Z * [new branch] gh/anijain2305/950/head -> origin/gh/anijain2305/950/head 2025-12-04T08:54:18.6095486Z * [new branch] gh/anijain2305/950/orig -> origin/gh/anijain2305/950/orig 2025-12-04T08:54:18.6095871Z * [new branch] gh/anijain2305/951/base -> origin/gh/anijain2305/951/base 2025-12-04T08:54:18.6096257Z * [new branch] gh/anijain2305/951/head -> origin/gh/anijain2305/951/head 2025-12-04T08:54:18.6096700Z * [new branch] gh/anijain2305/951/orig -> origin/gh/anijain2305/951/orig 2025-12-04T08:54:18.6097082Z * [new branch] gh/anijain2305/952/base -> origin/gh/anijain2305/952/base 2025-12-04T08:54:18.6097468Z * [new branch] gh/anijain2305/952/head -> origin/gh/anijain2305/952/head 2025-12-04T08:54:18.6097857Z * [new branch] gh/anijain2305/952/orig -> origin/gh/anijain2305/952/orig 2025-12-04T08:54:18.6098307Z * [new branch] gh/anijain2305/953/base -> origin/gh/anijain2305/953/base 2025-12-04T08:54:18.6098690Z * [new branch] gh/anijain2305/953/head -> origin/gh/anijain2305/953/head 2025-12-04T08:54:18.6099076Z * [new branch] gh/anijain2305/953/orig -> origin/gh/anijain2305/953/orig 2025-12-04T08:54:18.6099461Z * [new branch] gh/anijain2305/954/base -> origin/gh/anijain2305/954/base 2025-12-04T08:54:18.6099843Z * [new branch] gh/anijain2305/954/head -> origin/gh/anijain2305/954/head 2025-12-04T08:54:18.6100234Z * [new branch] gh/anijain2305/954/orig -> origin/gh/anijain2305/954/orig 2025-12-04T08:54:18.6100617Z * [new branch] gh/anijain2305/955/base -> origin/gh/anijain2305/955/base 2025-12-04T08:54:18.6101001Z * [new branch] gh/anijain2305/955/head -> origin/gh/anijain2305/955/head 2025-12-04T08:54:18.6101443Z * [new branch] gh/anijain2305/955/orig -> origin/gh/anijain2305/955/orig 2025-12-04T08:54:18.6101834Z * [new branch] gh/anijain2305/956/base -> origin/gh/anijain2305/956/base 2025-12-04T08:54:18.6102215Z * [new branch] gh/anijain2305/956/head -> origin/gh/anijain2305/956/head 2025-12-04T08:54:18.6102601Z * [new branch] gh/anijain2305/956/orig -> origin/gh/anijain2305/956/orig 2025-12-04T08:54:18.6102983Z * [new branch] gh/anijain2305/957/base -> origin/gh/anijain2305/957/base 2025-12-04T08:54:18.6103369Z * [new branch] gh/anijain2305/957/head -> origin/gh/anijain2305/957/head 2025-12-04T08:54:18.6103756Z * [new branch] gh/anijain2305/957/orig -> origin/gh/anijain2305/957/orig 2025-12-04T08:54:18.6104140Z * [new branch] gh/anijain2305/958/base -> origin/gh/anijain2305/958/base 2025-12-04T08:54:18.6104525Z * [new branch] gh/anijain2305/958/head -> origin/gh/anijain2305/958/head 2025-12-04T08:54:18.6104916Z * [new branch] gh/anijain2305/958/orig -> origin/gh/anijain2305/958/orig 2025-12-04T08:54:18.6105296Z * [new branch] gh/anijain2305/959/base -> origin/gh/anijain2305/959/base 2025-12-04T08:54:18.6105682Z * [new branch] gh/anijain2305/959/head -> origin/gh/anijain2305/959/head 2025-12-04T08:54:18.6106069Z * [new branch] gh/anijain2305/959/orig -> origin/gh/anijain2305/959/orig 2025-12-04T08:54:18.6106452Z * [new branch] gh/anijain2305/960/base -> origin/gh/anijain2305/960/base 2025-12-04T08:54:18.6106837Z * [new branch] gh/anijain2305/960/head -> origin/gh/anijain2305/960/head 2025-12-04T08:54:18.6107223Z * [new branch] gh/anijain2305/960/orig -> origin/gh/anijain2305/960/orig 2025-12-04T08:54:18.6107605Z * [new branch] gh/anijain2305/961/base -> origin/gh/anijain2305/961/base 2025-12-04T08:54:18.6107992Z * [new branch] gh/anijain2305/961/head -> origin/gh/anijain2305/961/head 2025-12-04T08:54:18.6108423Z * [new branch] gh/anijain2305/961/orig -> origin/gh/anijain2305/961/orig 2025-12-04T08:54:18.6108809Z * [new branch] gh/anijain2305/962/base -> origin/gh/anijain2305/962/base 2025-12-04T08:54:18.6109190Z * [new branch] gh/anijain2305/962/head -> origin/gh/anijain2305/962/head 2025-12-04T08:54:18.6109576Z * [new branch] gh/anijain2305/962/orig -> origin/gh/anijain2305/962/orig 2025-12-04T08:54:18.6109960Z * [new branch] gh/anijain2305/963/base -> origin/gh/anijain2305/963/base 2025-12-04T08:54:18.6110399Z * [new branch] gh/anijain2305/963/head -> origin/gh/anijain2305/963/head 2025-12-04T08:54:18.6110783Z * [new branch] gh/anijain2305/963/orig -> origin/gh/anijain2305/963/orig 2025-12-04T08:54:18.6111166Z * [new branch] gh/anijain2305/964/base -> origin/gh/anijain2305/964/base 2025-12-04T08:54:18.6111555Z * [new branch] gh/anijain2305/964/head -> origin/gh/anijain2305/964/head 2025-12-04T08:54:18.6111937Z * [new branch] gh/anijain2305/964/orig -> origin/gh/anijain2305/964/orig 2025-12-04T08:54:18.6112324Z * [new branch] gh/anijain2305/965/base -> origin/gh/anijain2305/965/base 2025-12-04T08:54:18.6112707Z * [new branch] gh/anijain2305/965/head -> origin/gh/anijain2305/965/head 2025-12-04T08:54:18.6113089Z * [new branch] gh/anijain2305/965/orig -> origin/gh/anijain2305/965/orig 2025-12-04T08:54:18.6113475Z * [new branch] gh/anijain2305/966/base -> origin/gh/anijain2305/966/base 2025-12-04T08:54:18.6113864Z * [new branch] gh/anijain2305/966/head -> origin/gh/anijain2305/966/head 2025-12-04T08:54:18.6114244Z * [new branch] gh/anijain2305/966/orig -> origin/gh/anijain2305/966/orig 2025-12-04T08:54:18.6114629Z * [new branch] gh/anijain2305/967/base -> origin/gh/anijain2305/967/base 2025-12-04T08:54:18.6115064Z * [new branch] gh/anijain2305/967/head -> origin/gh/anijain2305/967/head 2025-12-04T08:54:18.6115454Z * [new branch] gh/anijain2305/967/orig -> origin/gh/anijain2305/967/orig 2025-12-04T08:54:18.6115838Z * [new branch] gh/anijain2305/968/base -> origin/gh/anijain2305/968/base 2025-12-04T08:54:18.6116221Z * [new branch] gh/anijain2305/968/head -> origin/gh/anijain2305/968/head 2025-12-04T08:54:18.6116603Z * [new branch] gh/anijain2305/968/orig -> origin/gh/anijain2305/968/orig 2025-12-04T08:54:18.6116992Z * [new branch] gh/anijain2305/969/base -> origin/gh/anijain2305/969/base 2025-12-04T08:54:18.6117378Z * [new branch] gh/anijain2305/969/head -> origin/gh/anijain2305/969/head 2025-12-04T08:54:18.6117761Z * [new branch] gh/anijain2305/969/orig -> origin/gh/anijain2305/969/orig 2025-12-04T08:54:18.6118236Z * [new branch] gh/anijain2305/970/base -> origin/gh/anijain2305/970/base 2025-12-04T08:54:18.6118621Z * [new branch] gh/anijain2305/970/head -> origin/gh/anijain2305/970/head 2025-12-04T08:54:18.6119003Z * [new branch] gh/anijain2305/970/orig -> origin/gh/anijain2305/970/orig 2025-12-04T08:54:18.6119386Z * [new branch] gh/anjali411/216/base -> origin/gh/anjali411/216/base 2025-12-04T08:54:18.6119772Z * [new branch] gh/anjali411/216/head -> origin/gh/anjali411/216/head 2025-12-04T08:54:18.6120153Z * [new branch] gh/anjali411/216/orig -> origin/gh/anjali411/216/orig 2025-12-04T08:54:18.6120538Z * [new branch] gh/anshul-si/1/base -> origin/gh/anshul-si/1/base 2025-12-04T08:54:18.6120909Z * [new branch] gh/anshul-si/1/head -> origin/gh/anshul-si/1/head 2025-12-04T08:54:18.6121281Z * [new branch] gh/anshul-si/2/base -> origin/gh/anshul-si/2/base 2025-12-04T08:54:18.6121656Z * [new branch] gh/anshul-si/2/head -> origin/gh/anshul-si/2/head 2025-12-04T08:54:18.6122022Z * [new branch] gh/anshul-si/3/base -> origin/gh/anshul-si/3/base 2025-12-04T08:54:18.6122391Z * [new branch] gh/anshul-si/3/head -> origin/gh/anshul-si/3/head 2025-12-04T08:54:18.6122759Z * [new branch] gh/anshul-si/4/base -> origin/gh/anshul-si/4/base 2025-12-04T08:54:18.6123125Z * [new branch] gh/anshul-si/4/head -> origin/gh/anshul-si/4/head 2025-12-04T08:54:18.6123496Z * [new branch] gh/anshul-si/5/base -> origin/gh/anshul-si/5/base 2025-12-04T08:54:18.6123923Z * [new branch] gh/anshul-si/5/head -> origin/gh/anshul-si/5/head 2025-12-04T08:54:18.6124298Z * [new branch] gh/anshul-si/53/base -> origin/gh/anshul-si/53/base 2025-12-04T08:54:18.6124677Z * [new branch] gh/anshul-si/53/head -> origin/gh/anshul-si/53/head 2025-12-04T08:54:18.6125056Z * [new branch] gh/anshul-si/58/base -> origin/gh/anshul-si/58/base 2025-12-04T08:54:18.6125428Z * [new branch] gh/anshul-si/58/head -> origin/gh/anshul-si/58/head 2025-12-04T08:54:18.6125801Z * [new branch] gh/anshul-si/66/base -> origin/gh/anshul-si/66/base 2025-12-04T08:54:18.6126171Z * [new branch] gh/anshul-si/66/head -> origin/gh/anshul-si/66/head 2025-12-04T08:54:18.6126540Z * [new branch] gh/anshul-si/66/orig -> origin/gh/anshul-si/66/orig 2025-12-04T08:54:18.6126909Z * [new branch] gh/anshul-si/67/base -> origin/gh/anshul-si/67/base 2025-12-04T08:54:18.6127283Z * [new branch] gh/anshul-si/67/head -> origin/gh/anshul-si/67/head 2025-12-04T08:54:18.6127653Z * [new branch] gh/anshul-si/67/orig -> origin/gh/anshul-si/67/orig 2025-12-04T08:54:18.6128027Z * [new branch] gh/anshul-si/68/base -> origin/gh/anshul-si/68/base 2025-12-04T08:54:18.6128509Z * [new branch] gh/anshul-si/68/head -> origin/gh/anshul-si/68/head 2025-12-04T08:54:18.6128881Z * [new branch] gh/anshul-si/68/orig -> origin/gh/anshul-si/68/orig 2025-12-04T08:54:18.6129247Z * [new branch] gh/anshul-si/69/base -> origin/gh/anshul-si/69/base 2025-12-04T08:54:18.6129616Z * [new branch] gh/anshul-si/69/head -> origin/gh/anshul-si/69/head 2025-12-04T08:54:18.6129985Z * [new branch] gh/anshul-si/69/orig -> origin/gh/anshul-si/69/orig 2025-12-04T08:54:18.6130353Z * [new branch] gh/anshul-si/70/base -> origin/gh/anshul-si/70/base 2025-12-04T08:54:18.6130725Z * [new branch] gh/anshul-si/70/head -> origin/gh/anshul-si/70/head 2025-12-04T08:54:18.6131092Z * [new branch] gh/anshul-si/70/orig -> origin/gh/anshul-si/70/orig 2025-12-04T08:54:18.6131462Z * [new branch] gh/anshul-si/71/base -> origin/gh/anshul-si/71/base 2025-12-04T08:54:18.6131833Z * [new branch] gh/anshul-si/71/head -> origin/gh/anshul-si/71/head 2025-12-04T08:54:18.6132201Z * [new branch] gh/anshul-si/71/orig -> origin/gh/anshul-si/71/orig 2025-12-04T08:54:18.6132570Z * [new branch] gh/anshul-si/72/base -> origin/gh/anshul-si/72/base 2025-12-04T08:54:18.6132940Z * [new branch] gh/anshul-si/72/head -> origin/gh/anshul-si/72/head 2025-12-04T08:54:18.6133309Z * [new branch] gh/anshul-si/72/orig -> origin/gh/anshul-si/72/orig 2025-12-04T08:54:18.6133682Z * [new branch] gh/anshul-si/73/base -> origin/gh/anshul-si/73/base 2025-12-04T08:54:18.6134055Z * [new branch] gh/anshul-si/73/head -> origin/gh/anshul-si/73/head 2025-12-04T08:54:18.6134425Z * [new branch] gh/anshul-si/73/orig -> origin/gh/anshul-si/73/orig 2025-12-04T08:54:18.6134798Z * [new branch] gh/aorenste/132/base -> origin/gh/aorenste/132/base 2025-12-04T08:54:18.6135179Z * [new branch] gh/aorenste/132/head -> origin/gh/aorenste/132/head 2025-12-04T08:54:18.6135550Z * [new branch] gh/aorenste/134/base -> origin/gh/aorenste/134/base 2025-12-04T08:54:18.6135926Z * [new branch] gh/aorenste/134/head -> origin/gh/aorenste/134/head 2025-12-04T08:54:18.6136302Z * [new branch] gh/aorenste/134/orig -> origin/gh/aorenste/134/orig 2025-12-04T08:54:18.6136673Z * [new branch] gh/aorenste/139/base -> origin/gh/aorenste/139/base 2025-12-04T08:54:18.6137048Z * [new branch] gh/aorenste/139/head -> origin/gh/aorenste/139/head 2025-12-04T08:54:18.6137491Z * [new branch] gh/aorenste/139/orig -> origin/gh/aorenste/139/orig 2025-12-04T08:54:18.6137862Z * [new branch] gh/aorenste/141/base -> origin/gh/aorenste/141/base 2025-12-04T08:54:18.6138283Z * [new branch] gh/aorenste/141/head -> origin/gh/aorenste/141/head 2025-12-04T08:54:18.6138660Z * [new branch] gh/aorenste/145/base -> origin/gh/aorenste/145/base 2025-12-04T08:54:18.6139037Z * [new branch] gh/aorenste/145/head -> origin/gh/aorenste/145/head 2025-12-04T08:54:18.6139413Z * [new branch] gh/aorenste/145/orig -> origin/gh/aorenste/145/orig 2025-12-04T08:54:18.6139790Z * [new branch] gh/aorenste/146/base -> origin/gh/aorenste/146/base 2025-12-04T08:54:18.6140162Z * [new branch] gh/aorenste/146/head -> origin/gh/aorenste/146/head 2025-12-04T08:54:18.6140540Z * [new branch] gh/aorenste/146/orig -> origin/gh/aorenste/146/orig 2025-12-04T08:54:18.6140912Z * [new branch] gh/aorenste/147/base -> origin/gh/aorenste/147/base 2025-12-04T08:54:18.6141286Z * [new branch] gh/aorenste/147/head -> origin/gh/aorenste/147/head 2025-12-04T08:54:18.6141662Z * [new branch] gh/aorenste/147/orig -> origin/gh/aorenste/147/orig 2025-12-04T08:54:18.6142088Z * [new branch] gh/aorenste/148/base -> origin/gh/aorenste/148/base 2025-12-04T08:54:18.6142464Z * [new branch] gh/aorenste/148/head -> origin/gh/aorenste/148/head 2025-12-04T08:54:18.6142838Z * [new branch] gh/aorenste/148/orig -> origin/gh/aorenste/148/orig 2025-12-04T08:54:18.6143210Z * [new branch] gh/aorenste/149/base -> origin/gh/aorenste/149/base 2025-12-04T08:54:18.6143586Z * [new branch] gh/aorenste/149/head -> origin/gh/aorenste/149/head 2025-12-04T08:54:18.6143964Z * [new branch] gh/aorenste/149/orig -> origin/gh/aorenste/149/orig 2025-12-04T08:54:18.6144337Z * [new branch] gh/aorenste/150/base -> origin/gh/aorenste/150/base 2025-12-04T08:54:18.6144710Z * [new branch] gh/aorenste/150/head -> origin/gh/aorenste/150/head 2025-12-04T08:54:18.6145087Z * [new branch] gh/aorenste/150/orig -> origin/gh/aorenste/150/orig 2025-12-04T08:54:18.6145463Z * [new branch] gh/aorenste/151/base -> origin/gh/aorenste/151/base 2025-12-04T08:54:18.6146362Z * [new branch] gh/aorenste/151/head -> origin/gh/aorenste/151/head 2025-12-04T08:54:18.6146738Z * [new branch] gh/aorenste/151/orig -> origin/gh/aorenste/151/orig 2025-12-04T08:54:18.6147113Z * [new branch] gh/aorenste/152/base -> origin/gh/aorenste/152/base 2025-12-04T08:54:18.6147486Z * [new branch] gh/aorenste/152/head -> origin/gh/aorenste/152/head 2025-12-04T08:54:18.6147862Z * [new branch] gh/aorenste/152/orig -> origin/gh/aorenste/152/orig 2025-12-04T08:54:18.6148299Z * [new branch] gh/aorenste/153/base -> origin/gh/aorenste/153/base 2025-12-04T08:54:18.6148674Z * [new branch] gh/aorenste/153/head -> origin/gh/aorenste/153/head 2025-12-04T08:54:18.6149048Z * [new branch] gh/aorenste/153/orig -> origin/gh/aorenste/153/orig 2025-12-04T08:54:18.6149427Z * [new branch] gh/aorenste/154/base -> origin/gh/aorenste/154/base 2025-12-04T08:54:18.6149800Z * [new branch] gh/aorenste/154/head -> origin/gh/aorenste/154/head 2025-12-04T08:54:18.6150172Z * [new branch] gh/aorenste/154/orig -> origin/gh/aorenste/154/orig 2025-12-04T08:54:18.6150547Z * [new branch] gh/aorenste/155/base -> origin/gh/aorenste/155/base 2025-12-04T08:54:18.6150923Z * [new branch] gh/aorenste/155/head -> origin/gh/aorenste/155/head 2025-12-04T08:54:18.6151360Z * [new branch] gh/aorenste/155/orig -> origin/gh/aorenste/155/orig 2025-12-04T08:54:18.6151736Z * [new branch] gh/aorenste/156/base -> origin/gh/aorenste/156/base 2025-12-04T08:54:18.6152110Z * [new branch] gh/aorenste/156/head -> origin/gh/aorenste/156/head 2025-12-04T08:54:18.6152484Z * [new branch] gh/aorenste/156/orig -> origin/gh/aorenste/156/orig 2025-12-04T08:54:18.6152863Z * [new branch] gh/aorenste/157/base -> origin/gh/aorenste/157/base 2025-12-04T08:54:18.6153238Z * [new branch] gh/aorenste/157/head -> origin/gh/aorenste/157/head 2025-12-04T08:54:18.6153611Z * [new branch] gh/aorenste/157/orig -> origin/gh/aorenste/157/orig 2025-12-04T08:54:18.6153986Z * [new branch] gh/aorenste/158/base -> origin/gh/aorenste/158/base 2025-12-04T08:54:18.6154359Z * [new branch] gh/aorenste/158/head -> origin/gh/aorenste/158/head 2025-12-04T08:54:18.6154737Z * [new branch] gh/aorenste/158/orig -> origin/gh/aorenste/158/orig 2025-12-04T08:54:18.6155111Z * [new branch] gh/aorenste/159/base -> origin/gh/aorenste/159/base 2025-12-04T08:54:18.6155326Z * [new branch] gh/aorenste/159/head -> origin/gh/aorenste/159/head 2025-12-04T08:54:18.6155547Z * [new branch] gh/aorenste/159/orig -> origin/gh/aorenste/159/orig 2025-12-04T08:54:18.6155741Z * [new branch] gh/avikchaudhuri/1/base -> origin/gh/avikchaudhuri/1/base 2025-12-04T08:54:18.6155941Z * [new branch] gh/avikchaudhuri/1/head -> origin/gh/avikchaudhuri/1/head 2025-12-04T08:54:18.6156133Z * [new branch] gh/avikchaudhuri/2/base -> origin/gh/avikchaudhuri/2/base 2025-12-04T08:54:18.6156328Z * [new branch] gh/avikchaudhuri/2/head -> origin/gh/avikchaudhuri/2/head 2025-12-04T08:54:18.6156522Z * [new branch] gh/avikchaudhuri/2/orig -> origin/gh/avikchaudhuri/2/orig 2025-12-04T08:54:18.6156713Z * [new branch] gh/bdhirsh/666/base -> origin/gh/bdhirsh/666/base 2025-12-04T08:54:18.6156897Z * [new branch] gh/bdhirsh/666/head -> origin/gh/bdhirsh/666/head 2025-12-04T08:54:18.6157075Z * [new branch] gh/bdhirsh/666/orig -> origin/gh/bdhirsh/666/orig 2025-12-04T08:54:18.6157259Z * [new branch] gh/bdhirsh/668/base -> origin/gh/bdhirsh/668/base 2025-12-04T08:54:18.6157441Z * [new branch] gh/bdhirsh/668/head -> origin/gh/bdhirsh/668/head 2025-12-04T08:54:18.6157620Z * [new branch] gh/bdhirsh/668/orig -> origin/gh/bdhirsh/668/orig 2025-12-04T08:54:18.6157802Z * [new branch] gh/bdhirsh/669/base -> origin/gh/bdhirsh/669/base 2025-12-04T08:54:18.6157982Z * [new branch] gh/bdhirsh/669/head -> origin/gh/bdhirsh/669/head 2025-12-04T08:54:18.6158218Z * [new branch] gh/bdhirsh/669/orig -> origin/gh/bdhirsh/669/orig 2025-12-04T08:54:18.6158401Z * [new branch] gh/bdhirsh/670/base -> origin/gh/bdhirsh/670/base 2025-12-04T08:54:18.6158582Z * [new branch] gh/bdhirsh/670/head -> origin/gh/bdhirsh/670/head 2025-12-04T08:54:18.6158759Z * [new branch] gh/bdhirsh/670/orig -> origin/gh/bdhirsh/670/orig 2025-12-04T08:54:18.6158941Z * [new branch] gh/bdhirsh/672/base -> origin/gh/bdhirsh/672/base 2025-12-04T08:54:18.6159121Z * [new branch] gh/bdhirsh/672/head -> origin/gh/bdhirsh/672/head 2025-12-04T08:54:18.6159307Z * [new branch] gh/bdhirsh/672/orig -> origin/gh/bdhirsh/672/orig 2025-12-04T08:54:18.6159489Z * [new branch] gh/bdhirsh/675/base -> origin/gh/bdhirsh/675/base 2025-12-04T08:54:18.6159671Z * [new branch] gh/bdhirsh/675/head -> origin/gh/bdhirsh/675/head 2025-12-04T08:54:18.6159850Z * [new branch] gh/bdhirsh/675/orig -> origin/gh/bdhirsh/675/orig 2025-12-04T08:54:18.6160074Z * [new branch] gh/bdhirsh/676/base -> origin/gh/bdhirsh/676/base 2025-12-04T08:54:18.6160254Z * [new branch] gh/bdhirsh/676/head -> origin/gh/bdhirsh/676/head 2025-12-04T08:54:18.6160434Z * [new branch] gh/bdhirsh/676/orig -> origin/gh/bdhirsh/676/orig 2025-12-04T08:54:18.6160615Z * [new branch] gh/bdhirsh/677/base -> origin/gh/bdhirsh/677/base 2025-12-04T08:54:18.6160795Z * [new branch] gh/bdhirsh/677/head -> origin/gh/bdhirsh/677/head 2025-12-04T08:54:18.6160871Z * [new branch] gh/bdhirsh/677/orig -> origin/gh/bdhirsh/677/orig 2025-12-04T08:54:18.6160942Z * [new branch] gh/bdhirsh/678/base -> origin/gh/bdhirsh/678/base 2025-12-04T08:54:18.6161012Z * [new branch] gh/bdhirsh/678/head -> origin/gh/bdhirsh/678/head 2025-12-04T08:54:18.6161084Z * [new branch] gh/bdhirsh/678/orig -> origin/gh/bdhirsh/678/orig 2025-12-04T08:54:18.6161155Z * [new branch] gh/bdhirsh/679/base -> origin/gh/bdhirsh/679/base 2025-12-04T08:54:18.6161227Z * [new branch] gh/bdhirsh/679/head -> origin/gh/bdhirsh/679/head 2025-12-04T08:54:18.6161297Z * [new branch] gh/bdhirsh/679/orig -> origin/gh/bdhirsh/679/orig 2025-12-04T08:54:18.6161399Z * [new branch] gh/bdhirsh/680/base -> origin/gh/bdhirsh/680/base 2025-12-04T08:54:18.6161473Z * [new branch] gh/bdhirsh/680/head -> origin/gh/bdhirsh/680/head 2025-12-04T08:54:18.6161544Z * [new branch] gh/bdhirsh/680/orig -> origin/gh/bdhirsh/680/orig 2025-12-04T08:54:18.6161613Z * [new branch] gh/bdhirsh/681/base -> origin/gh/bdhirsh/681/base 2025-12-04T08:54:18.6161687Z * [new branch] gh/bdhirsh/681/head -> origin/gh/bdhirsh/681/head 2025-12-04T08:54:18.6161757Z * [new branch] gh/bdhirsh/681/orig -> origin/gh/bdhirsh/681/orig 2025-12-04T08:54:18.6161853Z * [new branch] gh/benjaminglass1/101/base -> origin/gh/benjaminglass1/101/base 2025-12-04T08:54:18.6161946Z * [new branch] gh/benjaminglass1/101/head -> origin/gh/benjaminglass1/101/head 2025-12-04T08:54:18.6162033Z * [new branch] gh/benjaminglass1/101/orig -> origin/gh/benjaminglass1/101/orig 2025-12-04T08:54:18.6162121Z * [new branch] gh/benjaminglass1/102/base -> origin/gh/benjaminglass1/102/base 2025-12-04T08:54:18.6162211Z * [new branch] gh/benjaminglass1/102/head -> origin/gh/benjaminglass1/102/head 2025-12-04T08:54:18.6162297Z * [new branch] gh/benjaminglass1/102/orig -> origin/gh/benjaminglass1/102/orig 2025-12-04T08:54:18.6162382Z * [new branch] gh/benjaminglass1/106/base -> origin/gh/benjaminglass1/106/base 2025-12-04T08:54:18.6162471Z * [new branch] gh/benjaminglass1/106/head -> origin/gh/benjaminglass1/106/head 2025-12-04T08:54:18.6162559Z * [new branch] gh/benjaminglass1/106/orig -> origin/gh/benjaminglass1/106/orig 2025-12-04T08:54:18.6162644Z * [new branch] gh/benjaminglass1/107/base -> origin/gh/benjaminglass1/107/base 2025-12-04T08:54:18.6162733Z * [new branch] gh/benjaminglass1/107/head -> origin/gh/benjaminglass1/107/head 2025-12-04T08:54:18.6162820Z * [new branch] gh/benjaminglass1/107/orig -> origin/gh/benjaminglass1/107/orig 2025-12-04T08:54:18.6162909Z * [new branch] gh/benjaminglass1/108/base -> origin/gh/benjaminglass1/108/base 2025-12-04T08:54:18.6162995Z * [new branch] gh/benjaminglass1/108/head -> origin/gh/benjaminglass1/108/head 2025-12-04T08:54:18.6163080Z * [new branch] gh/benjaminglass1/108/orig -> origin/gh/benjaminglass1/108/orig 2025-12-04T08:54:18.6163169Z * [new branch] gh/benjaminglass1/109/base -> origin/gh/benjaminglass1/109/base 2025-12-04T08:54:18.6163281Z * [new branch] gh/benjaminglass1/109/head -> origin/gh/benjaminglass1/109/head 2025-12-04T08:54:18.6163366Z * [new branch] gh/benjaminglass1/109/orig -> origin/gh/benjaminglass1/109/orig 2025-12-04T08:54:18.6163454Z * [new branch] gh/benjaminglass1/97/base -> origin/gh/benjaminglass1/97/base 2025-12-04T08:54:18.6163539Z * [new branch] gh/benjaminglass1/97/head -> origin/gh/benjaminglass1/97/head 2025-12-04T08:54:18.6163623Z * [new branch] gh/benjaminglass1/97/orig -> origin/gh/benjaminglass1/97/orig 2025-12-04T08:54:18.6163705Z * [new branch] gh/bobrenjc93/570/base -> origin/gh/bobrenjc93/570/base 2025-12-04T08:54:18.6163782Z * [new branch] gh/bobrenjc93/570/head -> origin/gh/bobrenjc93/570/head 2025-12-04T08:54:18.6163858Z * [new branch] gh/bobrenjc93/570/orig -> origin/gh/bobrenjc93/570/orig 2025-12-04T08:54:18.6163935Z * [new branch] gh/bobrenjc93/604/base -> origin/gh/bobrenjc93/604/base 2025-12-04T08:54:18.6164011Z * [new branch] gh/bobrenjc93/604/head -> origin/gh/bobrenjc93/604/head 2025-12-04T08:54:18.6164085Z * [new branch] gh/bobrenjc93/604/orig -> origin/gh/bobrenjc93/604/orig 2025-12-04T08:54:18.6164161Z * [new branch] gh/bobrenjc93/638/base -> origin/gh/bobrenjc93/638/base 2025-12-04T08:54:18.6164261Z * [new branch] gh/bobrenjc93/638/head -> origin/gh/bobrenjc93/638/head 2025-12-04T08:54:18.6164336Z * [new branch] gh/bobrenjc93/638/orig -> origin/gh/bobrenjc93/638/orig 2025-12-04T08:54:18.6164413Z * [new branch] gh/bobrenjc93/653/base -> origin/gh/bobrenjc93/653/base 2025-12-04T08:54:18.6164489Z * [new branch] gh/bobrenjc93/653/head -> origin/gh/bobrenjc93/653/head 2025-12-04T08:54:18.6164564Z * [new branch] gh/bobrenjc93/653/orig -> origin/gh/bobrenjc93/653/orig 2025-12-04T08:54:18.6164639Z * [new branch] gh/bobrenjc93/654/base -> origin/gh/bobrenjc93/654/base 2025-12-04T08:54:18.6164714Z * [new branch] gh/bobrenjc93/654/head -> origin/gh/bobrenjc93/654/head 2025-12-04T08:54:18.6164790Z * [new branch] gh/bobrenjc93/654/orig -> origin/gh/bobrenjc93/654/orig 2025-12-04T08:54:18.6164866Z * [new branch] gh/bobrenjc93/657/base -> origin/gh/bobrenjc93/657/base 2025-12-04T08:54:18.6164941Z * [new branch] gh/bobrenjc93/657/head -> origin/gh/bobrenjc93/657/head 2025-12-04T08:54:18.6165018Z * [new branch] gh/bobrenjc93/657/orig -> origin/gh/bobrenjc93/657/orig 2025-12-04T08:54:18.6165093Z * [new branch] gh/bobrenjc93/672/base -> origin/gh/bobrenjc93/672/base 2025-12-04T08:54:18.6165167Z * [new branch] gh/bobrenjc93/672/head -> origin/gh/bobrenjc93/672/head 2025-12-04T08:54:18.6165244Z * [new branch] gh/bobrenjc93/672/orig -> origin/gh/bobrenjc93/672/orig 2025-12-04T08:54:18.6165318Z * [new branch] gh/bobrenjc93/679/base -> origin/gh/bobrenjc93/679/base 2025-12-04T08:54:18.6165392Z * [new branch] gh/bobrenjc93/679/head -> origin/gh/bobrenjc93/679/head 2025-12-04T08:54:18.6165468Z * [new branch] gh/bobrenjc93/679/orig -> origin/gh/bobrenjc93/679/orig 2025-12-04T08:54:18.6165542Z * [new branch] gh/bobrenjc93/680/base -> origin/gh/bobrenjc93/680/base 2025-12-04T08:54:18.6165617Z * [new branch] gh/bobrenjc93/680/head -> origin/gh/bobrenjc93/680/head 2025-12-04T08:54:18.6165694Z * [new branch] gh/bobrenjc93/680/orig -> origin/gh/bobrenjc93/680/orig 2025-12-04T08:54:18.6165767Z * [new branch] gh/bobrenjc93/681/base -> origin/gh/bobrenjc93/681/base 2025-12-04T08:54:18.6165841Z * [new branch] gh/bobrenjc93/681/head -> origin/gh/bobrenjc93/681/head 2025-12-04T08:54:18.6165918Z * [new branch] gh/bobrenjc93/681/orig -> origin/gh/bobrenjc93/681/orig 2025-12-04T08:54:18.6166019Z * [new branch] gh/bobrenjc93/682/base -> origin/gh/bobrenjc93/682/base 2025-12-04T08:54:18.6166095Z * [new branch] gh/bobrenjc93/682/head -> origin/gh/bobrenjc93/682/head 2025-12-04T08:54:18.6166169Z * [new branch] gh/bobrenjc93/682/orig -> origin/gh/bobrenjc93/682/orig 2025-12-04T08:54:18.6166244Z * [new branch] gh/bobrenjc93/683/base -> origin/gh/bobrenjc93/683/base 2025-12-04T08:54:18.6166319Z * [new branch] gh/bobrenjc93/683/head -> origin/gh/bobrenjc93/683/head 2025-12-04T08:54:18.6166393Z * [new branch] gh/bobrenjc93/683/orig -> origin/gh/bobrenjc93/683/orig 2025-12-04T08:54:18.6166467Z * [new branch] gh/bobrenjc93/684/base -> origin/gh/bobrenjc93/684/base 2025-12-04T08:54:18.6166543Z * [new branch] gh/bobrenjc93/684/head -> origin/gh/bobrenjc93/684/head 2025-12-04T08:54:18.6166617Z * [new branch] gh/bobrenjc93/684/orig -> origin/gh/bobrenjc93/684/orig 2025-12-04T08:54:18.6166692Z * [new branch] gh/bobrenjc93/685/base -> origin/gh/bobrenjc93/685/base 2025-12-04T08:54:18.6166768Z * [new branch] gh/bobrenjc93/685/head -> origin/gh/bobrenjc93/685/head 2025-12-04T08:54:18.6166841Z * [new branch] gh/bobrenjc93/685/orig -> origin/gh/bobrenjc93/685/orig 2025-12-04T08:54:18.6166939Z * [new branch] gh/bobrenjc93/686/base -> origin/gh/bobrenjc93/686/base 2025-12-04T08:54:18.6167016Z * [new branch] gh/bobrenjc93/686/head -> origin/gh/bobrenjc93/686/head 2025-12-04T08:54:18.6167090Z * [new branch] gh/bobrenjc93/686/orig -> origin/gh/bobrenjc93/686/orig 2025-12-04T08:54:18.6167166Z * [new branch] gh/bobrenjc93/687/base -> origin/gh/bobrenjc93/687/base 2025-12-04T08:54:18.6167242Z * [new branch] gh/bobrenjc93/687/head -> origin/gh/bobrenjc93/687/head 2025-12-04T08:54:18.6167316Z * [new branch] gh/bobrenjc93/687/orig -> origin/gh/bobrenjc93/687/orig 2025-12-04T08:54:18.6167391Z * [new branch] gh/bobrenjc93/688/base -> origin/gh/bobrenjc93/688/base 2025-12-04T08:54:18.6167467Z * [new branch] gh/bobrenjc93/688/head -> origin/gh/bobrenjc93/688/head 2025-12-04T08:54:18.6167541Z * [new branch] gh/bobrenjc93/688/orig -> origin/gh/bobrenjc93/688/orig 2025-12-04T08:54:18.6167619Z * [new branch] gh/bobrenjc93/689/base -> origin/gh/bobrenjc93/689/base 2025-12-04T08:54:18.6167692Z * [new branch] gh/bobrenjc93/689/head -> origin/gh/bobrenjc93/689/head 2025-12-04T08:54:18.6167765Z * [new branch] gh/bobrenjc93/689/orig -> origin/gh/bobrenjc93/689/orig 2025-12-04T08:54:18.6167842Z * [new branch] gh/bobrenjc93/690/base -> origin/gh/bobrenjc93/690/base 2025-12-04T08:54:18.6167917Z * [new branch] gh/bobrenjc93/690/head -> origin/gh/bobrenjc93/690/head 2025-12-04T08:54:18.6167991Z * [new branch] gh/bobrenjc93/690/orig -> origin/gh/bobrenjc93/690/orig 2025-12-04T08:54:18.6168118Z * [new branch] gh/bobrenjc93/691/base -> origin/gh/bobrenjc93/691/base 2025-12-04T08:54:18.6168194Z * [new branch] gh/bobrenjc93/691/head -> origin/gh/bobrenjc93/691/head 2025-12-04T08:54:18.6168268Z * [new branch] gh/bobrenjc93/691/orig -> origin/gh/bobrenjc93/691/orig 2025-12-04T08:54:18.6168344Z * [new branch] gh/bobrenjc93/692/base -> origin/gh/bobrenjc93/692/base 2025-12-04T08:54:18.6168418Z * [new branch] gh/bobrenjc93/692/head -> origin/gh/bobrenjc93/692/head 2025-12-04T08:54:18.6168491Z * [new branch] gh/bobrenjc93/692/orig -> origin/gh/bobrenjc93/692/orig 2025-12-04T08:54:18.6168568Z * [new branch] gh/bobrenjc93/693/base -> origin/gh/bobrenjc93/693/base 2025-12-04T08:54:18.6168641Z * [new branch] gh/bobrenjc93/693/head -> origin/gh/bobrenjc93/693/head 2025-12-04T08:54:18.6168748Z * [new branch] gh/bobrenjc93/693/orig -> origin/gh/bobrenjc93/693/orig 2025-12-04T08:54:18.6168823Z * [new branch] gh/bobrenjc93/694/base -> origin/gh/bobrenjc93/694/base 2025-12-04T08:54:18.6168898Z * [new branch] gh/bobrenjc93/694/head -> origin/gh/bobrenjc93/694/head 2025-12-04T08:54:18.6168972Z * [new branch] gh/bobrenjc93/694/orig -> origin/gh/bobrenjc93/694/orig 2025-12-04T08:54:18.6169049Z * [new branch] gh/bobrenjc93/695/base -> origin/gh/bobrenjc93/695/base 2025-12-04T08:54:18.6169122Z * [new branch] gh/bobrenjc93/695/head -> origin/gh/bobrenjc93/695/head 2025-12-04T08:54:18.6169199Z * [new branch] gh/bobrenjc93/695/orig -> origin/gh/bobrenjc93/695/orig 2025-12-04T08:54:18.6169268Z * [new branch] gh/c00w/23/base -> origin/gh/c00w/23/base 2025-12-04T08:54:18.6169335Z * [new branch] gh/c00w/23/head -> origin/gh/c00w/23/head 2025-12-04T08:54:18.6169405Z * [new branch] gh/c00w/53/base -> origin/gh/c00w/53/base 2025-12-04T08:54:18.6169469Z * [new branch] gh/c00w/53/head -> origin/gh/c00w/53/head 2025-12-04T08:54:18.6169534Z * [new branch] gh/c00w/53/orig -> origin/gh/c00w/53/orig 2025-12-04T08:54:18.6169627Z * [new branch] gh/c00w/54/base -> origin/gh/c00w/54/base 2025-12-04T08:54:18.6169693Z * [new branch] gh/c00w/54/head -> origin/gh/c00w/54/head 2025-12-04T08:54:18.6169756Z * [new branch] gh/c00w/54/orig -> origin/gh/c00w/54/orig 2025-12-04T08:54:18.6169823Z * [new branch] gh/c00w/56/base -> origin/gh/c00w/56/base 2025-12-04T08:54:18.6169886Z * [new branch] gh/c00w/56/head -> origin/gh/c00w/56/head 2025-12-04T08:54:18.6169950Z * [new branch] gh/c00w/56/orig -> origin/gh/c00w/56/orig 2025-12-04T08:54:18.6170018Z * [new branch] gh/c00w/57/base -> origin/gh/c00w/57/base 2025-12-04T08:54:18.6170082Z * [new branch] gh/c00w/57/head -> origin/gh/c00w/57/head 2025-12-04T08:54:18.6170146Z * [new branch] gh/c00w/57/orig -> origin/gh/c00w/57/orig 2025-12-04T08:54:18.6170213Z * [new branch] gh/c00w/58/base -> origin/gh/c00w/58/base 2025-12-04T08:54:18.6170277Z * [new branch] gh/c00w/58/head -> origin/gh/c00w/58/head 2025-12-04T08:54:18.6170342Z * [new branch] gh/c00w/58/orig -> origin/gh/c00w/58/orig 2025-12-04T08:54:18.6170419Z * [new branch] gh/clee2000/1/base -> origin/gh/clee2000/1/base 2025-12-04T08:54:18.6170491Z * [new branch] gh/clee2000/1/head -> origin/gh/clee2000/1/head 2025-12-04T08:54:18.6170561Z * [new branch] gh/clee2000/1/orig -> origin/gh/clee2000/1/orig 2025-12-04T08:54:18.6170645Z * [new branch] gh/coconutruben/1/base -> origin/gh/coconutruben/1/base 2025-12-04T08:54:18.6170722Z * [new branch] gh/coconutruben/1/head -> origin/gh/coconutruben/1/head 2025-12-04T08:54:18.6170803Z * [new branch] gh/coconutruben/55/base -> origin/gh/coconutruben/55/base 2025-12-04T08:54:18.6170885Z * [new branch] gh/coconutruben/55/head -> origin/gh/coconutruben/55/head 2025-12-04T08:54:18.6170964Z * [new branch] gh/coconutruben/55/orig -> origin/gh/coconutruben/55/orig 2025-12-04T08:54:18.6171044Z * [new branch] gh/coconutruben/57/base -> origin/gh/coconutruben/57/base 2025-12-04T08:54:18.6171121Z * [new branch] gh/coconutruben/57/head -> origin/gh/coconutruben/57/head 2025-12-04T08:54:18.6171198Z * [new branch] gh/coconutruben/57/orig -> origin/gh/coconutruben/57/orig 2025-12-04T08:54:18.6171277Z * [new branch] gh/coconutruben/70/base -> origin/gh/coconutruben/70/base 2025-12-04T08:54:18.6171380Z * [new branch] gh/coconutruben/70/head -> origin/gh/coconutruben/70/head 2025-12-04T08:54:18.6171457Z * [new branch] gh/coconutruben/70/orig -> origin/gh/coconutruben/70/orig 2025-12-04T08:54:18.6171535Z * [new branch] gh/coconutruben/71/base -> origin/gh/coconutruben/71/base 2025-12-04T08:54:18.6171613Z * [new branch] gh/coconutruben/71/head -> origin/gh/coconutruben/71/head 2025-12-04T08:54:18.6171690Z * [new branch] gh/coconutruben/71/orig -> origin/gh/coconutruben/71/orig 2025-12-04T08:54:18.6171768Z * [new branch] gh/coconutruben/72/base -> origin/gh/coconutruben/72/base 2025-12-04T08:54:18.6171844Z * [new branch] gh/coconutruben/72/head -> origin/gh/coconutruben/72/head 2025-12-04T08:54:18.6171920Z * [new branch] gh/coconutruben/72/orig -> origin/gh/coconutruben/72/orig 2025-12-04T08:54:18.6171998Z * [new branch] gh/coconutruben/73/base -> origin/gh/coconutruben/73/base 2025-12-04T08:54:18.6172077Z * [new branch] gh/coconutruben/73/head -> origin/gh/coconutruben/73/head 2025-12-04T08:54:18.6172154Z * [new branch] gh/coconutruben/73/orig -> origin/gh/coconutruben/73/orig 2025-12-04T08:54:18.6172233Z * [new branch] gh/coconutruben/74/base -> origin/gh/coconutruben/74/base 2025-12-04T08:54:18.6172331Z * [new branch] gh/coconutruben/74/head -> origin/gh/coconutruben/74/head 2025-12-04T08:54:18.6172409Z * [new branch] gh/coconutruben/74/orig -> origin/gh/coconutruben/74/orig 2025-12-04T08:54:18.6172489Z * [new branch] gh/coconutruben/79/base -> origin/gh/coconutruben/79/base 2025-12-04T08:54:18.6172566Z * [new branch] gh/coconutruben/79/head -> origin/gh/coconutruben/79/head 2025-12-04T08:54:18.6172642Z * [new branch] gh/coconutruben/79/orig -> origin/gh/coconutruben/79/orig 2025-12-04T08:54:18.6172723Z * [new branch] gh/coconutruben/80/base -> origin/gh/coconutruben/80/base 2025-12-04T08:54:18.6172801Z * [new branch] gh/coconutruben/80/head -> origin/gh/coconutruben/80/head 2025-12-04T08:54:18.6172880Z * [new branch] gh/coconutruben/80/orig -> origin/gh/coconutruben/80/orig 2025-12-04T08:54:18.6172955Z * [new branch] gh/coconutruben/82/base -> origin/gh/coconutruben/82/base 2025-12-04T08:54:18.6173033Z * [new branch] gh/coconutruben/82/head -> origin/gh/coconutruben/82/head 2025-12-04T08:54:18.6173111Z * [new branch] gh/coconutruben/82/orig -> origin/gh/coconutruben/82/orig 2025-12-04T08:54:18.6173187Z * [new branch] gh/coconutruben/83/base -> origin/gh/coconutruben/83/base 2025-12-04T08:54:18.6173263Z * [new branch] gh/coconutruben/83/head -> origin/gh/coconutruben/83/head 2025-12-04T08:54:18.6173342Z * [new branch] gh/coconutruben/83/orig -> origin/gh/coconutruben/83/orig 2025-12-04T08:54:18.6173419Z * [new branch] gh/coconutruben/84/base -> origin/gh/coconutruben/84/base 2025-12-04T08:54:18.6173495Z * [new branch] gh/coconutruben/84/head -> origin/gh/coconutruben/84/head 2025-12-04T08:54:18.6173574Z * [new branch] gh/coconutruben/84/orig -> origin/gh/coconutruben/84/orig 2025-12-04T08:54:18.6173652Z * [new branch] gh/coconutruben/85/base -> origin/gh/coconutruben/85/base 2025-12-04T08:54:18.6173729Z * [new branch] gh/coconutruben/85/head -> origin/gh/coconutruben/85/head 2025-12-04T08:54:18.6173807Z * [new branch] gh/coconutruben/85/orig -> origin/gh/coconutruben/85/orig 2025-12-04T08:54:18.6173883Z * [new branch] gh/coconutruben/86/base -> origin/gh/coconutruben/86/base 2025-12-04T08:54:18.6173958Z * [new branch] gh/coconutruben/86/head -> origin/gh/coconutruben/86/head 2025-12-04T08:54:18.6174037Z * [new branch] gh/coconutruben/86/orig -> origin/gh/coconutruben/86/orig 2025-12-04T08:54:18.6174135Z * [new branch] gh/colinchan15/1/base -> origin/gh/colinchan15/1/base 2025-12-04T08:54:18.6174212Z * [new branch] gh/colinchan15/1/head -> origin/gh/colinchan15/1/head 2025-12-04T08:54:18.6174292Z * [new branch] gh/colinchan15/2/base -> origin/gh/colinchan15/2/base 2025-12-04T08:54:18.6174368Z * [new branch] gh/colinchan15/2/head -> origin/gh/colinchan15/2/head 2025-12-04T08:54:18.6174446Z * [new branch] gh/colinchan15/3/base -> origin/gh/colinchan15/3/base 2025-12-04T08:54:18.6174520Z * [new branch] gh/colinchan15/3/head -> origin/gh/colinchan15/3/head 2025-12-04T08:54:18.6174593Z * [new branch] gh/colinchan15/6/base -> origin/gh/colinchan15/6/base 2025-12-04T08:54:18.6174670Z * [new branch] gh/colinchan15/6/head -> origin/gh/colinchan15/6/head 2025-12-04T08:54:18.6174738Z * [new branch] gh/d4l3k/1/base -> origin/gh/d4l3k/1/base 2025-12-04T08:54:18.6174805Z * [new branch] gh/d4l3k/1/head -> origin/gh/d4l3k/1/head 2025-12-04T08:54:18.6174874Z * [new branch] gh/d4l3k/2/base -> origin/gh/d4l3k/2/base 2025-12-04T08:54:18.6174939Z * [new branch] gh/d4l3k/2/head -> origin/gh/d4l3k/2/head 2025-12-04T08:54:18.6175036Z * [new branch] gh/d4l3k/2/orig -> origin/gh/d4l3k/2/orig 2025-12-04T08:54:18.6175104Z * [new branch] gh/d4l3k/3/base -> origin/gh/d4l3k/3/base 2025-12-04T08:54:18.6175167Z * [new branch] gh/d4l3k/3/head -> origin/gh/d4l3k/3/head 2025-12-04T08:54:18.6175231Z * [new branch] gh/d4l3k/3/orig -> origin/gh/d4l3k/3/orig 2025-12-04T08:54:18.6175299Z * [new branch] gh/d4l3k/4/base -> origin/gh/d4l3k/4/base 2025-12-04T08:54:18.6175363Z * [new branch] gh/d4l3k/4/head -> origin/gh/d4l3k/4/head 2025-12-04T08:54:18.6175429Z * [new branch] gh/d4l3k/4/orig -> origin/gh/d4l3k/4/orig 2025-12-04T08:54:18.6175496Z * [new branch] gh/d4l3k/5/base -> origin/gh/d4l3k/5/base 2025-12-04T08:54:18.6175561Z * [new branch] gh/d4l3k/5/orig -> origin/gh/d4l3k/5/orig 2025-12-04T08:54:18.6175650Z * [new branch] gh/davidberard98/392/base -> origin/gh/davidberard98/392/base 2025-12-04T08:54:18.6175740Z * [new branch] gh/davidberard98/392/head -> origin/gh/davidberard98/392/head 2025-12-04T08:54:18.6175824Z * [new branch] gh/davidberard98/392/orig -> origin/gh/davidberard98/392/orig 2025-12-04T08:54:18.6175906Z * [new branch] gh/davidberard98/399/base -> origin/gh/davidberard98/399/base 2025-12-04T08:54:18.6175991Z * [new branch] gh/davidberard98/399/head -> origin/gh/davidberard98/399/head 2025-12-04T08:54:18.6176073Z * [new branch] gh/davidberard98/399/orig -> origin/gh/davidberard98/399/orig 2025-12-04T08:54:18.6176154Z * [new branch] gh/desertfire/605/base -> origin/gh/desertfire/605/base 2025-12-04T08:54:18.6176230Z * [new branch] gh/desertfire/605/head -> origin/gh/desertfire/605/head 2025-12-04T08:54:18.6176305Z * [new branch] gh/desertfire/605/orig -> origin/gh/desertfire/605/orig 2025-12-04T08:54:18.6176384Z * [new branch] gh/desertfire/606/base -> origin/gh/desertfire/606/base 2025-12-04T08:54:18.6176458Z * [new branch] gh/desertfire/606/head -> origin/gh/desertfire/606/head 2025-12-04T08:54:18.6176531Z * [new branch] gh/desertfire/606/orig -> origin/gh/desertfire/606/orig 2025-12-04T08:54:18.6176608Z * [new branch] gh/desertfire/607/base -> origin/gh/desertfire/607/base 2025-12-04T08:54:18.6176682Z * [new branch] gh/desertfire/607/head -> origin/gh/desertfire/607/head 2025-12-04T08:54:18.6176779Z * [new branch] gh/desertfire/607/orig -> origin/gh/desertfire/607/orig 2025-12-04T08:54:18.6176856Z * [new branch] gh/desertfire/608/base -> origin/gh/desertfire/608/base 2025-12-04T08:54:18.6176929Z * [new branch] gh/desertfire/608/head -> origin/gh/desertfire/608/head 2025-12-04T08:54:18.6177003Z * [new branch] gh/desertfire/608/orig -> origin/gh/desertfire/608/orig 2025-12-04T08:54:18.6177082Z * [new branch] gh/desertfire/609/base -> origin/gh/desertfire/609/base 2025-12-04T08:54:18.6177156Z * [new branch] gh/desertfire/609/head -> origin/gh/desertfire/609/head 2025-12-04T08:54:18.6177229Z * [new branch] gh/desertfire/609/orig -> origin/gh/desertfire/609/orig 2025-12-04T08:54:18.6177306Z * [new branch] gh/desertfire/610/base -> origin/gh/desertfire/610/base 2025-12-04T08:54:18.6177380Z * [new branch] gh/desertfire/610/head -> origin/gh/desertfire/610/head 2025-12-04T08:54:18.6177455Z * [new branch] gh/desertfire/610/orig -> origin/gh/desertfire/610/orig 2025-12-04T08:54:18.6177532Z * [new branch] gh/desertfire/611/base -> origin/gh/desertfire/611/base 2025-12-04T08:54:18.6177605Z * [new branch] gh/desertfire/611/head -> origin/gh/desertfire/611/head 2025-12-04T08:54:18.6177681Z * [new branch] gh/desertfire/611/orig -> origin/gh/desertfire/611/orig 2025-12-04T08:54:18.6177778Z * [new branch] gh/desertfire/612/base -> origin/gh/desertfire/612/base 2025-12-04T08:54:18.6177853Z * [new branch] gh/desertfire/612/head -> origin/gh/desertfire/612/head 2025-12-04T08:54:18.6177929Z * [new branch] gh/desertfire/612/orig -> origin/gh/desertfire/612/orig 2025-12-04T08:54:18.6178003Z * [new branch] gh/desertfire/613/base -> origin/gh/desertfire/613/base 2025-12-04T08:54:18.6178130Z * [new branch] gh/desertfire/613/head -> origin/gh/desertfire/613/head 2025-12-04T08:54:18.6178211Z * [new branch] gh/desertfire/613/orig -> origin/gh/desertfire/613/orig 2025-12-04T08:54:18.6178285Z * [new branch] gh/desertfire/614/base -> origin/gh/desertfire/614/base 2025-12-04T08:54:18.6178359Z * [new branch] gh/desertfire/614/head -> origin/gh/desertfire/614/head 2025-12-04T08:54:18.6178437Z * [new branch] gh/desertfire/614/orig -> origin/gh/desertfire/614/orig 2025-12-04T08:54:18.6178510Z * [new branch] gh/desertfire/615/base -> origin/gh/desertfire/615/base 2025-12-04T08:54:18.6178584Z * [new branch] gh/desertfire/615/head -> origin/gh/desertfire/615/head 2025-12-04T08:54:18.6178660Z * [new branch] gh/desertfire/615/orig -> origin/gh/desertfire/615/orig 2025-12-04T08:54:18.6178734Z * [new branch] gh/desertfire/616/base -> origin/gh/desertfire/616/base 2025-12-04T08:54:18.6178810Z * [new branch] gh/desertfire/616/head -> origin/gh/desertfire/616/head 2025-12-04T08:54:18.6178889Z * [new branch] gh/desertfire/616/orig -> origin/gh/desertfire/616/orig 2025-12-04T08:54:18.6178963Z * [new branch] gh/desertfire/617/base -> origin/gh/desertfire/617/base 2025-12-04T08:54:18.6179037Z * [new branch] gh/desertfire/617/head -> origin/gh/desertfire/617/head 2025-12-04T08:54:18.6179115Z * [new branch] gh/desertfire/617/orig -> origin/gh/desertfire/617/orig 2025-12-04T08:54:18.6179187Z * [new branch] gh/dharakk/1/base -> origin/gh/dharakk/1/base 2025-12-04T08:54:18.6179258Z * [new branch] gh/dharakk/1/head -> origin/gh/dharakk/1/head 2025-12-04T08:54:18.6179334Z * [new branch] gh/drisspg/170/base -> origin/gh/drisspg/170/base 2025-12-04T08:54:18.6179405Z * [new branch] gh/drisspg/170/head -> origin/gh/drisspg/170/head 2025-12-04T08:54:18.6179479Z * [new branch] gh/drisspg/170/orig -> origin/gh/drisspg/170/orig 2025-12-04T08:54:18.6179580Z * [new branch] gh/drisspg/182/base -> origin/gh/drisspg/182/base 2025-12-04T08:54:18.6179651Z * [new branch] gh/drisspg/182/head -> origin/gh/drisspg/182/head 2025-12-04T08:54:18.6179723Z * [new branch] gh/drisspg/183/base -> origin/gh/drisspg/183/base 2025-12-04T08:54:18.6179795Z * [new branch] gh/drisspg/183/head -> origin/gh/drisspg/183/head 2025-12-04T08:54:18.6179865Z * [new branch] gh/drisspg/184/base -> origin/gh/drisspg/184/base 2025-12-04T08:54:18.6179938Z * [new branch] gh/drisspg/184/head -> origin/gh/drisspg/184/head 2025-12-04T08:54:18.6180008Z * [new branch] gh/drisspg/185/base -> origin/gh/drisspg/185/base 2025-12-04T08:54:18.6180078Z * [new branch] gh/drisspg/185/head -> origin/gh/drisspg/185/head 2025-12-04T08:54:18.6180151Z * [new branch] gh/drisspg/194/base -> origin/gh/drisspg/194/base 2025-12-04T08:54:18.6180222Z * [new branch] gh/drisspg/194/head -> origin/gh/drisspg/194/head 2025-12-04T08:54:18.6180292Z * [new branch] gh/drisspg/194/orig -> origin/gh/drisspg/194/orig 2025-12-04T08:54:18.6180366Z * [new branch] gh/drisspg/200/base -> origin/gh/drisspg/200/base 2025-12-04T08:54:18.6180470Z * [new branch] gh/drisspg/200/head -> origin/gh/drisspg/200/head 2025-12-04T08:54:18.6180541Z * [new branch] gh/drisspg/200/orig -> origin/gh/drisspg/200/orig 2025-12-04T08:54:18.6180612Z * [new branch] gh/drisspg/218/base -> origin/gh/drisspg/218/base 2025-12-04T08:54:18.6180682Z * [new branch] gh/drisspg/218/head -> origin/gh/drisspg/218/head 2025-12-04T08:54:18.6180752Z * [new branch] gh/drisspg/218/orig -> origin/gh/drisspg/218/orig 2025-12-04T08:54:18.6180824Z * [new branch] gh/drisspg/219/base -> origin/gh/drisspg/219/base 2025-12-04T08:54:18.6180895Z * [new branch] gh/drisspg/219/head -> origin/gh/drisspg/219/head 2025-12-04T08:54:18.6180965Z * [new branch] gh/drisspg/219/orig -> origin/gh/drisspg/219/orig 2025-12-04T08:54:18.6181037Z * [new branch] gh/drisspg/220/base -> origin/gh/drisspg/220/base 2025-12-04T08:54:18.6181108Z * [new branch] gh/drisspg/220/head -> origin/gh/drisspg/220/head 2025-12-04T08:54:18.6181179Z * [new branch] gh/drisspg/220/orig -> origin/gh/drisspg/220/orig 2025-12-04T08:54:18.6181250Z * [new branch] gh/drisspg/221/base -> origin/gh/drisspg/221/base 2025-12-04T08:54:18.6181319Z * [new branch] gh/drisspg/221/head -> origin/gh/drisspg/221/head 2025-12-04T08:54:18.6181391Z * [new branch] gh/drisspg/221/orig -> origin/gh/drisspg/221/orig 2025-12-04T08:54:18.6181460Z * [new branch] gh/drisspg/222/base -> origin/gh/drisspg/222/base 2025-12-04T08:54:18.6181531Z * [new branch] gh/drisspg/222/head -> origin/gh/drisspg/222/head 2025-12-04T08:54:18.6181602Z * [new branch] gh/drisspg/222/orig -> origin/gh/drisspg/222/orig 2025-12-04T08:54:18.6181671Z * [new branch] gh/drisspg/223/base -> origin/gh/drisspg/223/base 2025-12-04T08:54:18.6181742Z * [new branch] gh/drisspg/223/head -> origin/gh/drisspg/223/head 2025-12-04T08:54:18.6181814Z * [new branch] gh/drisspg/223/orig -> origin/gh/drisspg/223/orig 2025-12-04T08:54:18.6181886Z * [new branch] gh/drisspg/224/base -> origin/gh/drisspg/224/base 2025-12-04T08:54:18.6181957Z * [new branch] gh/drisspg/224/head -> origin/gh/drisspg/224/head 2025-12-04T08:54:18.6182028Z * [new branch] gh/drisspg/224/orig -> origin/gh/drisspg/224/orig 2025-12-04T08:54:18.6182098Z * [new branch] gh/drisspg/225/base -> origin/gh/drisspg/225/base 2025-12-04T08:54:18.6182190Z * [new branch] gh/drisspg/225/head -> origin/gh/drisspg/225/head 2025-12-04T08:54:18.6182263Z * [new branch] gh/drisspg/225/orig -> origin/gh/drisspg/225/orig 2025-12-04T08:54:18.6182333Z * [new branch] gh/drisspg/226/base -> origin/gh/drisspg/226/base 2025-12-04T08:54:18.6182404Z * [new branch] gh/drisspg/226/head -> origin/gh/drisspg/226/head 2025-12-04T08:54:18.6182475Z * [new branch] gh/drisspg/226/orig -> origin/gh/drisspg/226/orig 2025-12-04T08:54:18.6182545Z * [new branch] gh/drisspg/227/base -> origin/gh/drisspg/227/base 2025-12-04T08:54:18.6182616Z * [new branch] gh/drisspg/227/head -> origin/gh/drisspg/227/head 2025-12-04T08:54:18.6182688Z * [new branch] gh/drisspg/227/orig -> origin/gh/drisspg/227/orig 2025-12-04T08:54:18.6182759Z * [new branch] gh/drisspg/228/base -> origin/gh/drisspg/228/base 2025-12-04T08:54:18.6182832Z * [new branch] gh/drisspg/228/head -> origin/gh/drisspg/228/head 2025-12-04T08:54:18.6182903Z * [new branch] gh/drisspg/228/orig -> origin/gh/drisspg/228/orig 2025-12-04T08:54:18.6182972Z * [new branch] gh/drisspg/229/base -> origin/gh/drisspg/229/base 2025-12-04T08:54:18.6183065Z * [new branch] gh/drisspg/229/head -> origin/gh/drisspg/229/head 2025-12-04T08:54:18.6183136Z * [new branch] gh/drisspg/229/orig -> origin/gh/drisspg/229/orig 2025-12-04T08:54:18.6183205Z * [new branch] gh/drisspg/230/base -> origin/gh/drisspg/230/base 2025-12-04T08:54:18.6183277Z * [new branch] gh/drisspg/230/head -> origin/gh/drisspg/230/head 2025-12-04T08:54:18.6183347Z * [new branch] gh/drisspg/230/orig -> origin/gh/drisspg/230/orig 2025-12-04T08:54:18.6183422Z * [new branch] gh/dsjohns2/1/base -> origin/gh/dsjohns2/1/base 2025-12-04T08:54:18.6183497Z * [new branch] gh/dsjohns2/1/head -> origin/gh/dsjohns2/1/head 2025-12-04T08:54:18.6183577Z * [new branch] gh/dzmitry-huba/1/base -> origin/gh/dzmitry-huba/1/base 2025-12-04T08:54:18.6183654Z * [new branch] gh/dzmitry-huba/1/head -> origin/gh/dzmitry-huba/1/head 2025-12-04T08:54:18.6183735Z * [new branch] gh/dzmitry-huba/12/base -> origin/gh/dzmitry-huba/12/base 2025-12-04T08:54:18.6183812Z * [new branch] gh/dzmitry-huba/12/head -> origin/gh/dzmitry-huba/12/head 2025-12-04T08:54:18.6183888Z * [new branch] gh/dzmitry-huba/12/orig -> origin/gh/dzmitry-huba/12/orig 2025-12-04T08:54:18.6183967Z * [new branch] gh/dzmitry-huba/13/base -> origin/gh/dzmitry-huba/13/base 2025-12-04T08:54:18.6184043Z * [new branch] gh/dzmitry-huba/13/head -> origin/gh/dzmitry-huba/13/head 2025-12-04T08:54:18.6184119Z * [new branch] gh/dzmitry-huba/13/orig -> origin/gh/dzmitry-huba/13/orig 2025-12-04T08:54:18.6184196Z * [new branch] gh/dzmitry-huba/14/base -> origin/gh/dzmitry-huba/14/base 2025-12-04T08:54:18.6184271Z * [new branch] gh/dzmitry-huba/14/head -> origin/gh/dzmitry-huba/14/head 2025-12-04T08:54:18.6184347Z * [new branch] gh/dzmitry-huba/14/orig -> origin/gh/dzmitry-huba/14/orig 2025-12-04T08:54:18.6184426Z * [new branch] gh/dzmitry-huba/15/base -> origin/gh/dzmitry-huba/15/base 2025-12-04T08:54:18.6184501Z * [new branch] gh/dzmitry-huba/15/head -> origin/gh/dzmitry-huba/15/head 2025-12-04T08:54:18.6184578Z * [new branch] gh/dzmitry-huba/15/orig -> origin/gh/dzmitry-huba/15/orig 2025-12-04T08:54:18.6184654Z * [new branch] gh/dzmitry-huba/16/base -> origin/gh/dzmitry-huba/16/base 2025-12-04T08:54:18.6184730Z * [new branch] gh/dzmitry-huba/16/head -> origin/gh/dzmitry-huba/16/head 2025-12-04T08:54:18.6184836Z * [new branch] gh/dzmitry-huba/16/orig -> origin/gh/dzmitry-huba/16/orig 2025-12-04T08:54:18.6184912Z * [new branch] gh/dzmitry-huba/17/base -> origin/gh/dzmitry-huba/17/base 2025-12-04T08:54:18.6184987Z * [new branch] gh/dzmitry-huba/17/head -> origin/gh/dzmitry-huba/17/head 2025-12-04T08:54:18.6185065Z * [new branch] gh/dzmitry-huba/17/orig -> origin/gh/dzmitry-huba/17/orig 2025-12-04T08:54:18.6185142Z * [new branch] gh/dzmitry-huba/2/base -> origin/gh/dzmitry-huba/2/base 2025-12-04T08:54:18.6185218Z * [new branch] gh/dzmitry-huba/2/head -> origin/gh/dzmitry-huba/2/head 2025-12-04T08:54:18.6185296Z * [new branch] gh/dzmitry-huba/3/base -> origin/gh/dzmitry-huba/3/base 2025-12-04T08:54:18.6185371Z * [new branch] gh/dzmitry-huba/3/head -> origin/gh/dzmitry-huba/3/head 2025-12-04T08:54:18.6185446Z * [new branch] gh/eellison/808/base -> origin/gh/eellison/808/base 2025-12-04T08:54:18.6185526Z * [new branch] gh/eellison/808/head -> origin/gh/eellison/808/head 2025-12-04T08:54:18.6185598Z * [new branch] gh/eellison/808/orig -> origin/gh/eellison/808/orig 2025-12-04T08:54:18.6185671Z * [new branch] gh/eellison/822/base -> origin/gh/eellison/822/base 2025-12-04T08:54:18.6185765Z * [new branch] gh/eellison/822/head -> origin/gh/eellison/822/head 2025-12-04T08:54:18.6185836Z * [new branch] gh/eellison/822/orig -> origin/gh/eellison/822/orig 2025-12-04T08:54:18.6185907Z * [new branch] gh/eellison/823/base -> origin/gh/eellison/823/base 2025-12-04T08:54:18.6185980Z * [new branch] gh/eellison/823/head -> origin/gh/eellison/823/head 2025-12-04T08:54:18.6186050Z * [new branch] gh/eellison/823/orig -> origin/gh/eellison/823/orig 2025-12-04T08:54:18.6186123Z * [new branch] gh/eellison/862/base -> origin/gh/eellison/862/base 2025-12-04T08:54:18.6186195Z * [new branch] gh/eellison/862/head -> origin/gh/eellison/862/head 2025-12-04T08:54:18.6186267Z * [new branch] gh/eellison/862/orig -> origin/gh/eellison/862/orig 2025-12-04T08:54:18.6186340Z * [new branch] gh/eellison/863/base -> origin/gh/eellison/863/base 2025-12-04T08:54:18.6186413Z * [new branch] gh/eellison/863/head -> origin/gh/eellison/863/head 2025-12-04T08:54:18.6186485Z * [new branch] gh/eellison/863/orig -> origin/gh/eellison/863/orig 2025-12-04T08:54:18.6186558Z * [new branch] gh/eellison/864/base -> origin/gh/eellison/864/base 2025-12-04T08:54:18.6186629Z * [new branch] gh/eellison/864/head -> origin/gh/eellison/864/head 2025-12-04T08:54:18.6186700Z * [new branch] gh/eellison/864/orig -> origin/gh/eellison/864/orig 2025-12-04T08:54:18.6186773Z * [new branch] gh/eellison/865/base -> origin/gh/eellison/865/base 2025-12-04T08:54:18.6186846Z * [new branch] gh/eellison/865/head -> origin/gh/eellison/865/head 2025-12-04T08:54:18.6186917Z * [new branch] gh/eellison/865/orig -> origin/gh/eellison/865/orig 2025-12-04T08:54:18.6186990Z * [new branch] gh/eellison/866/base -> origin/gh/eellison/866/base 2025-12-04T08:54:18.6187062Z * [new branch] gh/eellison/866/head -> origin/gh/eellison/866/head 2025-12-04T08:54:18.6187133Z * [new branch] gh/eellison/866/orig -> origin/gh/eellison/866/orig 2025-12-04T08:54:18.6187207Z * [new branch] gh/eellison/867/base -> origin/gh/eellison/867/base 2025-12-04T08:54:18.6187278Z * [new branch] gh/eellison/867/head -> origin/gh/eellison/867/head 2025-12-04T08:54:18.6187348Z * [new branch] gh/eellison/867/orig -> origin/gh/eellison/867/orig 2025-12-04T08:54:18.6187445Z * [new branch] gh/eellison/868/base -> origin/gh/eellison/868/base 2025-12-04T08:54:18.6187516Z * [new branch] gh/eellison/868/head -> origin/gh/eellison/868/head 2025-12-04T08:54:18.6187587Z * [new branch] gh/eellison/868/orig -> origin/gh/eellison/868/orig 2025-12-04T08:54:18.6187661Z * [new branch] gh/eellison/869/base -> origin/gh/eellison/869/base 2025-12-04T08:54:18.6187733Z * [new branch] gh/eellison/869/head -> origin/gh/eellison/869/head 2025-12-04T08:54:18.6187807Z * [new branch] gh/eellison/869/orig -> origin/gh/eellison/869/orig 2025-12-04T08:54:18.6187878Z * [new branch] gh/eellison/870/base -> origin/gh/eellison/870/base 2025-12-04T08:54:18.6187950Z * [new branch] gh/eellison/870/head -> origin/gh/eellison/870/head 2025-12-04T08:54:18.6188023Z * [new branch] gh/eellison/870/orig -> origin/gh/eellison/870/orig 2025-12-04T08:54:18.6188151Z * [new branch] gh/eellison/871/base -> origin/gh/eellison/871/base 2025-12-04T08:54:18.6188224Z * [new branch] gh/eellison/871/head -> origin/gh/eellison/871/head 2025-12-04T08:54:18.6188298Z * [new branch] gh/eellison/871/orig -> origin/gh/eellison/871/orig 2025-12-04T08:54:18.6188368Z * [new branch] gh/eellison/872/base -> origin/gh/eellison/872/base 2025-12-04T08:54:18.6188467Z * [new branch] gh/eellison/872/head -> origin/gh/eellison/872/head 2025-12-04T08:54:18.6188542Z * [new branch] gh/eellison/872/orig -> origin/gh/eellison/872/orig 2025-12-04T08:54:18.6188612Z * [new branch] gh/eellison/873/base -> origin/gh/eellison/873/base 2025-12-04T08:54:18.6188684Z * [new branch] gh/eellison/873/head -> origin/gh/eellison/873/head 2025-12-04T08:54:18.6188757Z * [new branch] gh/eellison/873/orig -> origin/gh/eellison/873/orig 2025-12-04T08:54:18.6188831Z * [new branch] gh/eellison/874/base -> origin/gh/eellison/874/base 2025-12-04T08:54:18.6188903Z * [new branch] gh/eellison/874/head -> origin/gh/eellison/874/head 2025-12-04T08:54:18.6188978Z * [new branch] gh/eellison/874/orig -> origin/gh/eellison/874/orig 2025-12-04T08:54:18.6189049Z * [new branch] gh/eellison/875/base -> origin/gh/eellison/875/base 2025-12-04T08:54:18.6189122Z * [new branch] gh/eellison/875/head -> origin/gh/eellison/875/head 2025-12-04T08:54:18.6189195Z * [new branch] gh/eellison/875/orig -> origin/gh/eellison/875/orig 2025-12-04T08:54:18.6189266Z * [new branch] gh/eellison/876/base -> origin/gh/eellison/876/base 2025-12-04T08:54:18.6189339Z * [new branch] gh/eellison/876/head -> origin/gh/eellison/876/head 2025-12-04T08:54:18.6189411Z * [new branch] gh/eellison/876/orig -> origin/gh/eellison/876/orig 2025-12-04T08:54:18.6189485Z * [new branch] gh/eellison/877/base -> origin/gh/eellison/877/base 2025-12-04T08:54:18.6189559Z * [new branch] gh/eellison/877/head -> origin/gh/eellison/877/head 2025-12-04T08:54:18.6189630Z * [new branch] gh/eellison/877/orig -> origin/gh/eellison/877/orig 2025-12-04T08:54:18.6189703Z * [new branch] gh/eellison/878/base -> origin/gh/eellison/878/base 2025-12-04T08:54:18.6189776Z * [new branch] gh/eellison/878/head -> origin/gh/eellison/878/head 2025-12-04T08:54:18.6189846Z * [new branch] gh/eellison/878/orig -> origin/gh/eellison/878/orig 2025-12-04T08:54:18.6189918Z * [new branch] gh/eellison/879/base -> origin/gh/eellison/879/base 2025-12-04T08:54:18.6189991Z * [new branch] gh/eellison/879/head -> origin/gh/eellison/879/head 2025-12-04T08:54:18.6190061Z * [new branch] gh/eellison/879/orig -> origin/gh/eellison/879/orig 2025-12-04T08:54:18.6190161Z * [new branch] gh/eellison/880/base -> origin/gh/eellison/880/base 2025-12-04T08:54:18.6190235Z * [new branch] gh/eellison/880/head -> origin/gh/eellison/880/head 2025-12-04T08:54:18.6190305Z * [new branch] gh/eellison/880/orig -> origin/gh/eellison/880/orig 2025-12-04T08:54:18.6190378Z * [new branch] gh/eellison/881/base -> origin/gh/eellison/881/base 2025-12-04T08:54:18.6190451Z * [new branch] gh/eellison/881/head -> origin/gh/eellison/881/head 2025-12-04T08:54:18.6190524Z * [new branch] gh/eellison/881/orig -> origin/gh/eellison/881/orig 2025-12-04T08:54:18.6190594Z * [new branch] gh/eellison/882/base -> origin/gh/eellison/882/base 2025-12-04T08:54:18.6190667Z * [new branch] gh/eellison/882/head -> origin/gh/eellison/882/head 2025-12-04T08:54:18.6190739Z * [new branch] gh/eellison/882/orig -> origin/gh/eellison/882/orig 2025-12-04T08:54:18.6190813Z * [new branch] gh/eellison/883/base -> origin/gh/eellison/883/base 2025-12-04T08:54:18.6190883Z * [new branch] gh/eellison/883/head -> origin/gh/eellison/883/head 2025-12-04T08:54:18.6190954Z * [new branch] gh/eellison/883/orig -> origin/gh/eellison/883/orig 2025-12-04T08:54:18.6191049Z * [new branch] gh/eellison/884/base -> origin/gh/eellison/884/base 2025-12-04T08:54:18.6191121Z * [new branch] gh/eellison/884/head -> origin/gh/eellison/884/head 2025-12-04T08:54:18.6191193Z * [new branch] gh/eellison/884/orig -> origin/gh/eellison/884/orig 2025-12-04T08:54:18.6191264Z * [new branch] gh/etaf/147/base -> origin/gh/etaf/147/base 2025-12-04T08:54:18.6191331Z * [new branch] gh/etaf/147/head -> origin/gh/etaf/147/head 2025-12-04T08:54:18.6191397Z * [new branch] gh/etaf/154/base -> origin/gh/etaf/154/base 2025-12-04T08:54:18.6191466Z * [new branch] gh/etaf/154/head -> origin/gh/etaf/154/head 2025-12-04T08:54:18.6191531Z * [new branch] gh/etaf/154/orig -> origin/gh/etaf/154/orig 2025-12-04T08:54:18.6191595Z * [new branch] gh/etaf/156/base -> origin/gh/etaf/156/base 2025-12-04T08:54:18.6191666Z * [new branch] gh/etaf/156/head -> origin/gh/etaf/156/head 2025-12-04T08:54:18.6191731Z * [new branch] gh/etaf/156/orig -> origin/gh/etaf/156/orig 2025-12-04T08:54:18.6191796Z * [new branch] gh/etaf/157/base -> origin/gh/etaf/157/base 2025-12-04T08:54:18.6191863Z * [new branch] gh/etaf/157/head -> origin/gh/etaf/157/head 2025-12-04T08:54:18.6191927Z * [new branch] gh/etaf/157/orig -> origin/gh/etaf/157/orig 2025-12-04T08:54:18.6191992Z * [new branch] gh/etaf/158/base -> origin/gh/etaf/158/base 2025-12-04T08:54:18.6192061Z * [new branch] gh/etaf/158/head -> origin/gh/etaf/158/head 2025-12-04T08:54:18.6192126Z * [new branch] gh/etaf/158/orig -> origin/gh/etaf/158/orig 2025-12-04T08:54:18.6192190Z * [new branch] gh/etaf/159/base -> origin/gh/etaf/159/base 2025-12-04T08:54:18.6192258Z * [new branch] gh/etaf/159/head -> origin/gh/etaf/159/head 2025-12-04T08:54:18.6192323Z * [new branch] gh/etaf/159/orig -> origin/gh/etaf/159/orig 2025-12-04T08:54:18.6192392Z * [new branch] gh/etaf/160/base -> origin/gh/etaf/160/base 2025-12-04T08:54:18.6192458Z * [new branch] gh/etaf/160/head -> origin/gh/etaf/160/head 2025-12-04T08:54:18.6192524Z * [new branch] gh/etaf/160/orig -> origin/gh/etaf/160/orig 2025-12-04T08:54:18.6192591Z * [new branch] gh/etaf/161/base -> origin/gh/etaf/161/base 2025-12-04T08:54:18.6192679Z * [new branch] gh/etaf/161/head -> origin/gh/etaf/161/head 2025-12-04T08:54:18.6192744Z * [new branch] gh/etaf/161/orig -> origin/gh/etaf/161/orig 2025-12-04T08:54:18.6192813Z * [new branch] gh/etaf/166/base -> origin/gh/etaf/166/base 2025-12-04T08:54:18.6192877Z * [new branch] gh/etaf/166/head -> origin/gh/etaf/166/head 2025-12-04T08:54:18.6192946Z * [new branch] gh/etaf/166/orig -> origin/gh/etaf/166/orig 2025-12-04T08:54:18.6193013Z * [new branch] gh/etaf/167/base -> origin/gh/etaf/167/base 2025-12-04T08:54:18.6193078Z * [new branch] gh/etaf/167/head -> origin/gh/etaf/167/head 2025-12-04T08:54:18.6193143Z * [new branch] gh/etaf/167/orig -> origin/gh/etaf/167/orig 2025-12-04T08:54:18.6193210Z * [new branch] gh/etaf/168/base -> origin/gh/etaf/168/base 2025-12-04T08:54:18.6193275Z * [new branch] gh/etaf/168/head -> origin/gh/etaf/168/head 2025-12-04T08:54:18.6193341Z * [new branch] gh/etaf/168/orig -> origin/gh/etaf/168/orig 2025-12-04T08:54:18.6193409Z * [new branch] gh/etaf/172/base -> origin/gh/etaf/172/base 2025-12-04T08:54:18.6193475Z * [new branch] gh/etaf/172/head -> origin/gh/etaf/172/head 2025-12-04T08:54:18.6193561Z * [new branch] gh/etaf/172/orig -> origin/gh/etaf/172/orig 2025-12-04T08:54:18.6193629Z * [new branch] gh/etaf/173/base -> origin/gh/etaf/173/base 2025-12-04T08:54:18.6193694Z * [new branch] gh/etaf/173/head -> origin/gh/etaf/173/head 2025-12-04T08:54:18.6193759Z * [new branch] gh/etaf/173/orig -> origin/gh/etaf/173/orig 2025-12-04T08:54:18.6193826Z * [new branch] gh/etaf/174/base -> origin/gh/etaf/174/base 2025-12-04T08:54:18.6193891Z * [new branch] gh/etaf/174/head -> origin/gh/etaf/174/head 2025-12-04T08:54:18.6193961Z * [new branch] gh/etaf/175/base -> origin/gh/etaf/175/base 2025-12-04T08:54:18.6194027Z * [new branch] gh/etaf/175/head -> origin/gh/etaf/175/head 2025-12-04T08:54:18.6194092Z * [new branch] gh/etaf/175/orig -> origin/gh/etaf/175/orig 2025-12-04T08:54:18.6194160Z * [new branch] gh/etaf/176/base -> origin/gh/etaf/176/base 2025-12-04T08:54:18.6194225Z * [new branch] gh/etaf/176/head -> origin/gh/etaf/176/head 2025-12-04T08:54:18.6194291Z * [new branch] gh/etaf/176/orig -> origin/gh/etaf/176/orig 2025-12-04T08:54:18.6194358Z * [new branch] gh/etaf/177/base -> origin/gh/etaf/177/base 2025-12-04T08:54:18.6194423Z * [new branch] gh/etaf/177/head -> origin/gh/etaf/177/head 2025-12-04T08:54:18.6194489Z * [new branch] gh/etaf/177/orig -> origin/gh/etaf/177/orig 2025-12-04T08:54:18.6194557Z * [new branch] gh/etaf/178/base -> origin/gh/etaf/178/base 2025-12-04T08:54:18.6194622Z * [new branch] gh/etaf/178/head -> origin/gh/etaf/178/head 2025-12-04T08:54:18.6194687Z * [new branch] gh/etaf/178/orig -> origin/gh/etaf/178/orig 2025-12-04T08:54:18.6194754Z * [new branch] gh/etaf/179/base -> origin/gh/etaf/179/base 2025-12-04T08:54:18.6194820Z * [new branch] gh/etaf/179/head -> origin/gh/etaf/179/head 2025-12-04T08:54:18.6194884Z * [new branch] gh/etaf/179/orig -> origin/gh/etaf/179/orig 2025-12-04T08:54:18.6194953Z * [new branch] gh/etaf/180/base -> origin/gh/etaf/180/base 2025-12-04T08:54:18.6195017Z * [new branch] gh/etaf/180/head -> origin/gh/etaf/180/head 2025-12-04T08:54:18.6195082Z * [new branch] gh/etaf/180/orig -> origin/gh/etaf/180/orig 2025-12-04T08:54:18.6195194Z * [new branch] gh/exclamaforte/1/base -> origin/gh/exclamaforte/1/base 2025-12-04T08:54:18.6195272Z * [new branch] gh/exclamaforte/1/head -> origin/gh/exclamaforte/1/head 2025-12-04T08:54:18.6195350Z * [new branch] gh/exclamaforte/2/base -> origin/gh/exclamaforte/2/base 2025-12-04T08:54:18.6195429Z * [new branch] gh/exclamaforte/2/head -> origin/gh/exclamaforte/2/head 2025-12-04T08:54:18.6195505Z * [new branch] gh/exclamaforte/3/base -> origin/gh/exclamaforte/3/base 2025-12-04T08:54:18.6195583Z * [new branch] gh/exclamaforte/3/head -> origin/gh/exclamaforte/3/head 2025-12-04T08:54:18.6195659Z * [new branch] gh/exclamaforte/4/base -> origin/gh/exclamaforte/4/base 2025-12-04T08:54:18.6195734Z * [new branch] gh/exclamaforte/4/head -> origin/gh/exclamaforte/4/head 2025-12-04T08:54:18.6195807Z * [new branch] gh/ezyang/2374/base -> origin/gh/ezyang/2374/base 2025-12-04T08:54:18.6195879Z * [new branch] gh/ezyang/2374/head -> origin/gh/ezyang/2374/head 2025-12-04T08:54:18.6195949Z * [new branch] gh/ezyang/2374/orig -> origin/gh/ezyang/2374/orig 2025-12-04T08:54:18.6196020Z * [new branch] gh/ezyang/2973/base -> origin/gh/ezyang/2973/base 2025-12-04T08:54:18.6196111Z * [new branch] gh/ezyang/2973/head -> origin/gh/ezyang/2973/head 2025-12-04T08:54:18.6196181Z * [new branch] gh/ezyang/2973/orig -> origin/gh/ezyang/2973/orig 2025-12-04T08:54:18.6196252Z * [new branch] gh/ezyang/2974/base -> origin/gh/ezyang/2974/base 2025-12-04T08:54:18.6196322Z * [new branch] gh/ezyang/2974/head -> origin/gh/ezyang/2974/head 2025-12-04T08:54:18.6196391Z * [new branch] gh/ezyang/2974/orig -> origin/gh/ezyang/2974/orig 2025-12-04T08:54:18.6196461Z * [new branch] gh/ezyang/3131/base -> origin/gh/ezyang/3131/base 2025-12-04T08:54:18.6196531Z * [new branch] gh/ezyang/3131/head -> origin/gh/ezyang/3131/head 2025-12-04T08:54:18.6196600Z * [new branch] gh/ezyang/3131/orig -> origin/gh/ezyang/3131/orig 2025-12-04T08:54:18.6196673Z * [new branch] gh/ezyang/3139/base -> origin/gh/ezyang/3139/base 2025-12-04T08:54:18.6196856Z * [new branch] gh/ezyang/3139/head -> origin/gh/ezyang/3139/head 2025-12-04T08:54:18.6197038Z * [new branch] gh/ezyang/3139/orig -> origin/gh/ezyang/3139/orig 2025-12-04T08:54:18.6197220Z * [new branch] gh/ezyang/3140/base -> origin/gh/ezyang/3140/base 2025-12-04T08:54:18.6197396Z * [new branch] gh/ezyang/3140/head -> origin/gh/ezyang/3140/head 2025-12-04T08:54:18.6197575Z * [new branch] gh/ezyang/3140/orig -> origin/gh/ezyang/3140/orig 2025-12-04T08:54:18.6197752Z * [new branch] gh/ezyang/3143/base -> origin/gh/ezyang/3143/base 2025-12-04T08:54:18.6197929Z * [new branch] gh/ezyang/3143/head -> origin/gh/ezyang/3143/head 2025-12-04T08:54:18.6198180Z * [new branch] gh/ezyang/3143/orig -> origin/gh/ezyang/3143/orig 2025-12-04T08:54:18.6198360Z * [new branch] gh/ezyang/3144/base -> origin/gh/ezyang/3144/base 2025-12-04T08:54:18.6198536Z * [new branch] gh/ezyang/3144/head -> origin/gh/ezyang/3144/head 2025-12-04T08:54:18.6198713Z * [new branch] gh/ezyang/3144/orig -> origin/gh/ezyang/3144/orig 2025-12-04T08:54:18.6198890Z * [new branch] gh/ezyang/3167/base -> origin/gh/ezyang/3167/base 2025-12-04T08:54:18.6199065Z * [new branch] gh/ezyang/3167/head -> origin/gh/ezyang/3167/head 2025-12-04T08:54:18.6199242Z * [new branch] gh/ezyang/3167/orig -> origin/gh/ezyang/3167/orig 2025-12-04T08:54:18.6199420Z * [new branch] gh/ezyang/3173/base -> origin/gh/ezyang/3173/base 2025-12-04T08:54:18.6199790Z * [new branch] gh/ezyang/3173/head -> origin/gh/ezyang/3173/head 2025-12-04T08:54:18.6199968Z * [new branch] gh/ezyang/3173/orig -> origin/gh/ezyang/3173/orig 2025-12-04T08:54:18.6200146Z * [new branch] gh/ezyang/3175/base -> origin/gh/ezyang/3175/base 2025-12-04T08:54:18.6200329Z * [new branch] gh/ezyang/3175/head -> origin/gh/ezyang/3175/head 2025-12-04T08:54:18.6200507Z * [new branch] gh/ezyang/3175/orig -> origin/gh/ezyang/3175/orig 2025-12-04T08:54:18.6200682Z * [new branch] gh/ezyang/3182/base -> origin/gh/ezyang/3182/base 2025-12-04T08:54:18.6200860Z * [new branch] gh/ezyang/3182/head -> origin/gh/ezyang/3182/head 2025-12-04T08:54:18.6201040Z * [new branch] gh/ezyang/3182/orig -> origin/gh/ezyang/3182/orig 2025-12-04T08:54:18.6201215Z * [new branch] gh/ezyang/3185/base -> origin/gh/ezyang/3185/base 2025-12-04T08:54:18.6201397Z * [new branch] gh/ezyang/3185/head -> origin/gh/ezyang/3185/head 2025-12-04T08:54:18.6201574Z * [new branch] gh/ezyang/3185/orig -> origin/gh/ezyang/3185/orig 2025-12-04T08:54:18.6201749Z * [new branch] gh/ezyang/3189/base -> origin/gh/ezyang/3189/base 2025-12-04T08:54:18.6201977Z * [new branch] gh/ezyang/3189/head -> origin/gh/ezyang/3189/head 2025-12-04T08:54:18.6202156Z * [new branch] gh/ezyang/3189/orig -> origin/gh/ezyang/3189/orig 2025-12-04T08:54:18.6202330Z * [new branch] gh/ezyang/3191/base -> origin/gh/ezyang/3191/base 2025-12-04T08:54:18.6202508Z * [new branch] gh/ezyang/3191/head -> origin/gh/ezyang/3191/head 2025-12-04T08:54:18.6202685Z * [new branch] gh/ezyang/3191/orig -> origin/gh/ezyang/3191/orig 2025-12-04T08:54:18.6223472Z * [new branch] gh/ezyang/3192/base -> origin/gh/ezyang/3192/base 2025-12-04T08:54:18.6223674Z * [new branch] gh/ezyang/3192/head -> origin/gh/ezyang/3192/head 2025-12-04T08:54:18.6223855Z * [new branch] gh/ezyang/3192/orig -> origin/gh/ezyang/3192/orig 2025-12-04T08:54:18.6224034Z * [new branch] gh/ezyang/3193/base -> origin/gh/ezyang/3193/base 2025-12-04T08:54:18.6224213Z * [new branch] gh/ezyang/3193/head -> origin/gh/ezyang/3193/head 2025-12-04T08:54:18.6224392Z * [new branch] gh/ezyang/3193/orig -> origin/gh/ezyang/3193/orig 2025-12-04T08:54:18.6224572Z * [new branch] gh/ezyang/3194/base -> origin/gh/ezyang/3194/base 2025-12-04T08:54:18.6224748Z * [new branch] gh/ezyang/3194/head -> origin/gh/ezyang/3194/head 2025-12-04T08:54:18.6224927Z * [new branch] gh/ezyang/3194/orig -> origin/gh/ezyang/3194/orig 2025-12-04T08:54:18.6225110Z * [new branch] gh/ezyang/3195/base -> origin/gh/ezyang/3195/base 2025-12-04T08:54:18.6225296Z * [new branch] gh/ezyang/3195/head -> origin/gh/ezyang/3195/head 2025-12-04T08:54:18.6225477Z * [new branch] gh/ezyang/3195/orig -> origin/gh/ezyang/3195/orig 2025-12-04T08:54:18.6225656Z * [new branch] gh/ezyang/3196/base -> origin/gh/ezyang/3196/base 2025-12-04T08:54:18.6225835Z * [new branch] gh/ezyang/3196/head -> origin/gh/ezyang/3196/head 2025-12-04T08:54:18.6226013Z * [new branch] gh/ezyang/3196/orig -> origin/gh/ezyang/3196/orig 2025-12-04T08:54:18.6226190Z * [new branch] gh/ezyang/3197/base -> origin/gh/ezyang/3197/base 2025-12-04T08:54:18.6226366Z * [new branch] gh/ezyang/3197/head -> origin/gh/ezyang/3197/head 2025-12-04T08:54:18.6226541Z * [new branch] gh/ezyang/3197/orig -> origin/gh/ezyang/3197/orig 2025-12-04T08:54:18.6226718Z * [new branch] gh/ezyang/3198/base -> origin/gh/ezyang/3198/base 2025-12-04T08:54:18.6226960Z * [new branch] gh/ezyang/3198/head -> origin/gh/ezyang/3198/head 2025-12-04T08:54:18.6227136Z * [new branch] gh/ezyang/3198/orig -> origin/gh/ezyang/3198/orig 2025-12-04T08:54:18.6227311Z * [new branch] gh/ezyang/3199/base -> origin/gh/ezyang/3199/base 2025-12-04T08:54:18.6227490Z * [new branch] gh/ezyang/3199/head -> origin/gh/ezyang/3199/head 2025-12-04T08:54:18.6227669Z * [new branch] gh/ezyang/3199/orig -> origin/gh/ezyang/3199/orig 2025-12-04T08:54:18.6227844Z * [new branch] gh/ezyang/3200/base -> origin/gh/ezyang/3200/base 2025-12-04T08:54:18.6228020Z * [new branch] gh/ezyang/3200/head -> origin/gh/ezyang/3200/head 2025-12-04T08:54:18.6228238Z * [new branch] gh/ezyang/3200/orig -> origin/gh/ezyang/3200/orig 2025-12-04T08:54:18.6228412Z * [new branch] gh/ezyang/3201/base -> origin/gh/ezyang/3201/base 2025-12-04T08:54:18.6228589Z * [new branch] gh/ezyang/3201/head -> origin/gh/ezyang/3201/head 2025-12-04T08:54:18.6228766Z * [new branch] gh/ezyang/3201/orig -> origin/gh/ezyang/3201/orig 2025-12-04T08:54:18.6228940Z * [new branch] gh/ezyang/3202/base -> origin/gh/ezyang/3202/base 2025-12-04T08:54:18.6229154Z * [new branch] gh/ezyang/3202/head -> origin/gh/ezyang/3202/head 2025-12-04T08:54:18.6229331Z * [new branch] gh/ezyang/3202/orig -> origin/gh/ezyang/3202/orig 2025-12-04T08:54:18.6229505Z * [new branch] gh/ezyang/3203/base -> origin/gh/ezyang/3203/base 2025-12-04T08:54:18.6229680Z * [new branch] gh/ezyang/3203/head -> origin/gh/ezyang/3203/head 2025-12-04T08:54:18.6229882Z * [new branch] gh/ezyang/3203/orig -> origin/gh/ezyang/3203/orig 2025-12-04T08:54:18.6230071Z * [new branch] gh/ezyang/3204/base -> origin/gh/ezyang/3204/base 2025-12-04T08:54:18.6230255Z * [new branch] gh/ezyang/3204/head -> origin/gh/ezyang/3204/head 2025-12-04T08:54:18.6230438Z * [new branch] gh/ezyang/3204/orig -> origin/gh/ezyang/3204/orig 2025-12-04T08:54:18.6230614Z * [new branch] gh/ezyang/3205/base -> origin/gh/ezyang/3205/base 2025-12-04T08:54:18.6230790Z * [new branch] gh/ezyang/3205/head -> origin/gh/ezyang/3205/head 2025-12-04T08:54:18.6230969Z * [new branch] gh/ezyang/3205/orig -> origin/gh/ezyang/3205/orig 2025-12-04T08:54:18.6231146Z * [new branch] gh/ezyang/3206/base -> origin/gh/ezyang/3206/base 2025-12-04T08:54:18.6231320Z * [new branch] gh/ezyang/3206/head -> origin/gh/ezyang/3206/head 2025-12-04T08:54:18.6231496Z * [new branch] gh/ezyang/3206/orig -> origin/gh/ezyang/3206/orig 2025-12-04T08:54:18.6231670Z * [new branch] gh/ezyang/3207/base -> origin/gh/ezyang/3207/base 2025-12-04T08:54:18.6231848Z * [new branch] gh/ezyang/3207/head -> origin/gh/ezyang/3207/head 2025-12-04T08:54:18.6232023Z * [new branch] gh/ezyang/3207/orig -> origin/gh/ezyang/3207/orig 2025-12-04T08:54:18.6232196Z * [new branch] gh/ezyang/3208/base -> origin/gh/ezyang/3208/base 2025-12-04T08:54:18.6232375Z * [new branch] gh/ezyang/3208/head -> origin/gh/ezyang/3208/head 2025-12-04T08:54:18.6232550Z * [new branch] gh/ezyang/3208/orig -> origin/gh/ezyang/3208/orig 2025-12-04T08:54:18.6232723Z * [new branch] gh/ezyang/3209/base -> origin/gh/ezyang/3209/base 2025-12-04T08:54:18.6232900Z * [new branch] gh/ezyang/3209/head -> origin/gh/ezyang/3209/head 2025-12-04T08:54:18.6233076Z * [new branch] gh/ezyang/3209/orig -> origin/gh/ezyang/3209/orig 2025-12-04T08:54:18.6233257Z * [new branch] gh/fadara01/3/base -> origin/gh/fadara01/3/base 2025-12-04T08:54:18.6233472Z * [new branch] gh/fadara01/3/head -> origin/gh/fadara01/3/head 2025-12-04T08:54:18.6233650Z * [new branch] gh/fadara01/3/orig -> origin/gh/fadara01/3/orig 2025-12-04T08:54:18.6233824Z * [new branch] gh/fadara01/5/base -> origin/gh/fadara01/5/base 2025-12-04T08:54:18.6234000Z * [new branch] gh/fadara01/5/head -> origin/gh/fadara01/5/head 2025-12-04T08:54:18.6234174Z * [new branch] gh/fadara01/5/orig -> origin/gh/fadara01/5/orig 2025-12-04T08:54:18.6234347Z * [new branch] gh/fadara01/6/base -> origin/gh/fadara01/6/base 2025-12-04T08:54:18.6234523Z * [new branch] gh/fadara01/6/head -> origin/gh/fadara01/6/head 2025-12-04T08:54:18.6234696Z * [new branch] gh/fadara01/6/orig -> origin/gh/fadara01/6/orig 2025-12-04T08:54:18.6234869Z * [new branch] gh/fadara01/7/base -> origin/gh/fadara01/7/base 2025-12-04T08:54:18.6235044Z * [new branch] gh/fadara01/7/head -> origin/gh/fadara01/7/head 2025-12-04T08:54:18.6235216Z * [new branch] gh/fadara01/7/orig -> origin/gh/fadara01/7/orig 2025-12-04T08:54:18.6235389Z * [new branch] gh/fadara01/8/base -> origin/gh/fadara01/8/base 2025-12-04T08:54:18.6235583Z * [new branch] gh/fadara01/8/head -> origin/gh/fadara01/8/head 2025-12-04T08:54:18.6235756Z * [new branch] gh/fadara01/8/orig -> origin/gh/fadara01/8/orig 2025-12-04T08:54:18.6235931Z * [new branch] gh/fadara01/9/base -> origin/gh/fadara01/9/base 2025-12-04T08:54:18.6236106Z * [new branch] gh/fadara01/9/head -> origin/gh/fadara01/9/head 2025-12-04T08:54:18.6236279Z * [new branch] gh/fadara01/9/orig -> origin/gh/fadara01/9/orig 2025-12-04T08:54:18.6236453Z * [new branch] gh/fduwjj/182/base -> origin/gh/fduwjj/182/base 2025-12-04T08:54:18.6236634Z * [new branch] gh/fduwjj/182/head -> origin/gh/fduwjj/182/head 2025-12-04T08:54:18.6236806Z * [new branch] gh/fduwjj/182/orig -> origin/gh/fduwjj/182/orig 2025-12-04T08:54:18.6236981Z * [new branch] gh/fduwjj/211/base -> origin/gh/fduwjj/211/base 2025-12-04T08:54:18.6237156Z * [new branch] gh/fduwjj/211/head -> origin/gh/fduwjj/211/head 2025-12-04T08:54:18.6237329Z * [new branch] gh/fduwjj/211/orig -> origin/gh/fduwjj/211/orig 2025-12-04T08:54:18.6237502Z * [new branch] gh/fduwjj/212/base -> origin/gh/fduwjj/212/base 2025-12-04T08:54:18.6237677Z * [new branch] gh/fduwjj/212/head -> origin/gh/fduwjj/212/head 2025-12-04T08:54:18.6237849Z * [new branch] gh/fduwjj/212/orig -> origin/gh/fduwjj/212/orig 2025-12-04T08:54:18.6238023Z * [new branch] gh/fduwjj/213/base -> origin/gh/fduwjj/213/base 2025-12-04T08:54:18.6238247Z * [new branch] gh/fduwjj/213/head -> origin/gh/fduwjj/213/head 2025-12-04T08:54:18.6238420Z * [new branch] gh/fduwjj/213/orig -> origin/gh/fduwjj/213/orig 2025-12-04T08:54:18.6238593Z * [new branch] gh/fduwjj/226/base -> origin/gh/fduwjj/226/base 2025-12-04T08:54:18.6238768Z * [new branch] gh/fduwjj/226/head -> origin/gh/fduwjj/226/head 2025-12-04T08:54:18.6238942Z * [new branch] gh/fduwjj/226/orig -> origin/gh/fduwjj/226/orig 2025-12-04T08:54:18.6239116Z * [new branch] gh/fduwjj/229/base -> origin/gh/fduwjj/229/base 2025-12-04T08:54:18.6239287Z * [new branch] gh/fduwjj/229/head -> origin/gh/fduwjj/229/head 2025-12-04T08:54:18.6239463Z * [new branch] gh/fduwjj/229/orig -> origin/gh/fduwjj/229/orig 2025-12-04T08:54:18.6239636Z * [new branch] gh/fduwjj/233/base -> origin/gh/fduwjj/233/base 2025-12-04T08:54:18.6239843Z * [new branch] gh/fduwjj/233/head -> origin/gh/fduwjj/233/head 2025-12-04T08:54:18.6240018Z * [new branch] gh/fduwjj/233/orig -> origin/gh/fduwjj/233/orig 2025-12-04T08:54:18.6240192Z * [new branch] gh/fduwjj/234/base -> origin/gh/fduwjj/234/base 2025-12-04T08:54:18.6240364Z * [new branch] gh/fduwjj/234/head -> origin/gh/fduwjj/234/head 2025-12-04T08:54:18.6240539Z * [new branch] gh/fduwjj/234/orig -> origin/gh/fduwjj/234/orig 2025-12-04T08:54:18.6240713Z * [new branch] gh/fduwjj/235/base -> origin/gh/fduwjj/235/base 2025-12-04T08:54:18.6240901Z * [new branch] gh/fduwjj/235/head -> origin/gh/fduwjj/235/head 2025-12-04T08:54:18.6241075Z * [new branch] gh/fduwjj/235/orig -> origin/gh/fduwjj/235/orig 2025-12-04T08:54:18.6241251Z * [new branch] gh/fduwjj/236/base -> origin/gh/fduwjj/236/base 2025-12-04T08:54:18.6241427Z * [new branch] gh/fduwjj/236/head -> origin/gh/fduwjj/236/head 2025-12-04T08:54:18.6241602Z * [new branch] gh/fduwjj/236/orig -> origin/gh/fduwjj/236/orig 2025-12-04T08:54:18.6241778Z * [new branch] gh/fduwjj/237/base -> origin/gh/fduwjj/237/base 2025-12-04T08:54:18.6241951Z * [new branch] gh/fduwjj/237/head -> origin/gh/fduwjj/237/head 2025-12-04T08:54:18.6242163Z * [new branch] gh/fduwjj/237/orig -> origin/gh/fduwjj/237/orig 2025-12-04T08:54:18.6242338Z * [new branch] gh/fduwjj/238/base -> origin/gh/fduwjj/238/base 2025-12-04T08:54:18.6242512Z * [new branch] gh/fduwjj/238/head -> origin/gh/fduwjj/238/head 2025-12-04T08:54:18.6242686Z * [new branch] gh/fduwjj/238/orig -> origin/gh/fduwjj/238/orig 2025-12-04T08:54:18.6242859Z * [new branch] gh/fduwjj/239/base -> origin/gh/fduwjj/239/base 2025-12-04T08:54:18.6243038Z * [new branch] gh/fduwjj/239/head -> origin/gh/fduwjj/239/head 2025-12-04T08:54:18.6243211Z * [new branch] gh/fduwjj/239/orig -> origin/gh/fduwjj/239/orig 2025-12-04T08:54:18.6243386Z * [new branch] gh/fegin/332/base -> origin/gh/fegin/332/base 2025-12-04T08:54:18.6243562Z * [new branch] gh/fegin/332/head -> origin/gh/fegin/332/head 2025-12-04T08:54:18.6243738Z * [new branch] gh/fegin/332/orig -> origin/gh/fegin/332/orig 2025-12-04T08:54:18.6243907Z * [new branch] gh/fegin/333/base -> origin/gh/fegin/333/base 2025-12-04T08:54:18.6244076Z * [new branch] gh/fegin/333/head -> origin/gh/fegin/333/head 2025-12-04T08:54:18.6244247Z * [new branch] gh/fegin/333/orig -> origin/gh/fegin/333/orig 2025-12-04T08:54:18.6244416Z * [new branch] gh/fegin/334/base -> origin/gh/fegin/334/base 2025-12-04T08:54:18.6244587Z * [new branch] gh/fegin/334/head -> origin/gh/fegin/334/head 2025-12-04T08:54:18.6244757Z * [new branch] gh/fegin/334/orig -> origin/gh/fegin/334/orig 2025-12-04T08:54:18.6244927Z * [new branch] gh/fegin/335/base -> origin/gh/fegin/335/base 2025-12-04T08:54:18.6245096Z * [new branch] gh/fegin/335/head -> origin/gh/fegin/335/head 2025-12-04T08:54:18.6245269Z * [new branch] gh/fegin/335/orig -> origin/gh/fegin/335/orig 2025-12-04T08:54:18.6245441Z * [new branch] gh/fffrog/160/base -> origin/gh/fffrog/160/base 2025-12-04T08:54:18.6245618Z * [new branch] gh/fffrog/160/head -> origin/gh/fffrog/160/head 2025-12-04T08:54:18.6245791Z * [new branch] gh/fffrog/177/base -> origin/gh/fffrog/177/base 2025-12-04T08:54:18.6245964Z * [new branch] gh/fffrog/177/head -> origin/gh/fffrog/177/head 2025-12-04T08:54:18.6246171Z * [new branch] gh/fffrog/177/orig -> origin/gh/fffrog/177/orig 2025-12-04T08:54:18.6246341Z * [new branch] gh/fffrog/178/base -> origin/gh/fffrog/178/base 2025-12-04T08:54:18.6246519Z * [new branch] gh/fffrog/178/head -> origin/gh/fffrog/178/head 2025-12-04T08:54:18.6246693Z * [new branch] gh/fffrog/178/orig -> origin/gh/fffrog/178/orig 2025-12-04T08:54:18.6246867Z * [new branch] gh/fffrog/181/base -> origin/gh/fffrog/181/base 2025-12-04T08:54:18.6247039Z * [new branch] gh/fffrog/181/head -> origin/gh/fffrog/181/head 2025-12-04T08:54:18.6247215Z * [new branch] gh/fffrog/181/orig -> origin/gh/fffrog/181/orig 2025-12-04T08:54:18.6247386Z * [new branch] gh/fffrog/183/base -> origin/gh/fffrog/183/base 2025-12-04T08:54:18.6247559Z * [new branch] gh/fffrog/183/head -> origin/gh/fffrog/183/head 2025-12-04T08:54:18.6247734Z * [new branch] gh/fffrog/183/orig -> origin/gh/fffrog/183/orig 2025-12-04T08:54:18.6247908Z * [new branch] gh/fxdawnn/10/base -> origin/gh/fxdawnn/10/base 2025-12-04T08:54:18.6248122Z * [new branch] gh/fxdawnn/10/head -> origin/gh/fxdawnn/10/head 2025-12-04T08:54:18.6248299Z * [new branch] gh/fxdawnn/10/orig -> origin/gh/fxdawnn/10/orig 2025-12-04T08:54:18.6248499Z * [new branch] gh/fxdawnn/11/base -> origin/gh/fxdawnn/11/base 2025-12-04T08:54:18.6248674Z * [new branch] gh/fxdawnn/11/head -> origin/gh/fxdawnn/11/head 2025-12-04T08:54:18.6248848Z * [new branch] gh/fxdawnn/11/orig -> origin/gh/fxdawnn/11/orig 2025-12-04T08:54:18.6249023Z * [new branch] gh/fxdawnn/12/base -> origin/gh/fxdawnn/12/base 2025-12-04T08:54:18.6249198Z * [new branch] gh/fxdawnn/12/head -> origin/gh/fxdawnn/12/head 2025-12-04T08:54:18.6249371Z * [new branch] gh/fxdawnn/12/orig -> origin/gh/fxdawnn/12/orig 2025-12-04T08:54:18.6249548Z * [new branch] gh/fxdawnn/13/base -> origin/gh/fxdawnn/13/base 2025-12-04T08:54:18.6249722Z * [new branch] gh/fxdawnn/13/head -> origin/gh/fxdawnn/13/head 2025-12-04T08:54:18.6249896Z * [new branch] gh/fxdawnn/13/orig -> origin/gh/fxdawnn/13/orig 2025-12-04T08:54:18.6250074Z * [new branch] gh/fxdawnn/14/base -> origin/gh/fxdawnn/14/base 2025-12-04T08:54:18.6250248Z * [new branch] gh/fxdawnn/14/head -> origin/gh/fxdawnn/14/head 2025-12-04T08:54:18.6250420Z * [new branch] gh/fxdawnn/14/orig -> origin/gh/fxdawnn/14/orig 2025-12-04T08:54:18.6250595Z * [new branch] gh/fxdawnn/15/base -> origin/gh/fxdawnn/15/base 2025-12-04T08:54:18.6250771Z * [new branch] gh/fxdawnn/15/head -> origin/gh/fxdawnn/15/head 2025-12-04T08:54:18.6250945Z * [new branch] gh/fxdawnn/15/orig -> origin/gh/fxdawnn/15/orig 2025-12-04T08:54:18.6251126Z * [new branch] gh/fxdawnn/6/base -> origin/gh/fxdawnn/6/base 2025-12-04T08:54:18.6251301Z * [new branch] gh/fxdawnn/6/head -> origin/gh/fxdawnn/6/head 2025-12-04T08:54:18.6251472Z * [new branch] gh/fxdawnn/6/orig -> origin/gh/fxdawnn/6/orig 2025-12-04T08:54:18.6251647Z * [new branch] gh/fxdawnn/7/base -> origin/gh/fxdawnn/7/base 2025-12-04T08:54:18.6251819Z * [new branch] gh/fxdawnn/7/head -> origin/gh/fxdawnn/7/head 2025-12-04T08:54:18.6251990Z * [new branch] gh/fxdawnn/7/orig -> origin/gh/fxdawnn/7/orig 2025-12-04T08:54:18.6252162Z * [new branch] gh/fxdawnn/9/base -> origin/gh/fxdawnn/9/base 2025-12-04T08:54:18.6252334Z * [new branch] gh/fxdawnn/9/head -> origin/gh/fxdawnn/9/head 2025-12-04T08:54:18.6252503Z * [new branch] gh/fxdawnn/9/orig -> origin/gh/fxdawnn/9/orig 2025-12-04T08:54:18.6252709Z * [new branch] gh/galv/1/base -> origin/gh/galv/1/base 2025-12-04T08:54:18.6252877Z * [new branch] gh/galv/1/head -> origin/gh/galv/1/head 2025-12-04T08:54:18.6253045Z * [new branch] gh/galv/1/orig -> origin/gh/galv/1/orig 2025-12-04T08:54:18.6253211Z * [new branch] gh/galv/2/base -> origin/gh/galv/2/base 2025-12-04T08:54:18.6253373Z * [new branch] gh/galv/2/head -> origin/gh/galv/2/head 2025-12-04T08:54:18.6253537Z * [new branch] gh/galv/2/orig -> origin/gh/galv/2/orig 2025-12-04T08:54:18.6253701Z * [new branch] gh/galv/3/base -> origin/gh/galv/3/base 2025-12-04T08:54:18.6253863Z * [new branch] gh/galv/3/head -> origin/gh/galv/3/head 2025-12-04T08:54:18.6254027Z * [new branch] gh/galv/3/orig -> origin/gh/galv/3/orig 2025-12-04T08:54:18.6254207Z * [new branch] gh/guangyey/134/base -> origin/gh/guangyey/134/base 2025-12-04T08:54:18.6254399Z * [new branch] gh/guangyey/134/head -> origin/gh/guangyey/134/head 2025-12-04T08:54:18.6254581Z * [new branch] gh/guangyey/134/orig -> origin/gh/guangyey/134/orig 2025-12-04T08:54:18.6254765Z * [new branch] gh/guangyey/163/base -> origin/gh/guangyey/163/base 2025-12-04T08:54:18.6254971Z * [new branch] gh/guangyey/163/head -> origin/gh/guangyey/163/head 2025-12-04T08:54:18.6255151Z * [new branch] gh/guangyey/163/orig -> origin/gh/guangyey/163/orig 2025-12-04T08:54:18.6255334Z * [new branch] gh/guangyey/168/base -> origin/gh/guangyey/168/base 2025-12-04T08:54:18.6255517Z * [new branch] gh/guangyey/168/head -> origin/gh/guangyey/168/head 2025-12-04T08:54:18.6255697Z * [new branch] gh/guangyey/168/orig -> origin/gh/guangyey/168/orig 2025-12-04T08:54:18.6255881Z * [new branch] gh/guangyey/169/base -> origin/gh/guangyey/169/base 2025-12-04T08:54:18.6256064Z * [new branch] gh/guangyey/169/head -> origin/gh/guangyey/169/head 2025-12-04T08:54:18.6256243Z * [new branch] gh/guangyey/169/orig -> origin/gh/guangyey/169/orig 2025-12-04T08:54:18.6256427Z * [new branch] gh/guangyey/170/base -> origin/gh/guangyey/170/base 2025-12-04T08:54:18.6256607Z * [new branch] gh/guangyey/170/head -> origin/gh/guangyey/170/head 2025-12-04T08:54:18.6256788Z * [new branch] gh/guangyey/170/orig -> origin/gh/guangyey/170/orig 2025-12-04T08:54:18.6256970Z * [new branch] gh/guangyey/171/base -> origin/gh/guangyey/171/base 2025-12-04T08:54:18.6257150Z * [new branch] gh/guangyey/171/head -> origin/gh/guangyey/171/head 2025-12-04T08:54:18.6257333Z * [new branch] gh/guangyey/171/orig -> origin/gh/guangyey/171/orig 2025-12-04T08:54:18.6257520Z * [new branch] gh/guangyey/178/base -> origin/gh/guangyey/178/base 2025-12-04T08:54:18.6257701Z * [new branch] gh/guangyey/178/head -> origin/gh/guangyey/178/head 2025-12-04T08:54:18.6257884Z * [new branch] gh/guangyey/178/orig -> origin/gh/guangyey/178/orig 2025-12-04T08:54:18.6258117Z * [new branch] gh/guangyey/182/base -> origin/gh/guangyey/182/base 2025-12-04T08:54:18.6258296Z * [new branch] gh/guangyey/182/head -> origin/gh/guangyey/182/head 2025-12-04T08:54:18.6258479Z * [new branch] gh/guangyey/182/orig -> origin/gh/guangyey/182/orig 2025-12-04T08:54:18.6258661Z * [new branch] gh/guangyey/183/base -> origin/gh/guangyey/183/base 2025-12-04T08:54:18.6258839Z * [new branch] gh/guangyey/183/head -> origin/gh/guangyey/183/head 2025-12-04T08:54:18.6259023Z * [new branch] gh/guangyey/183/orig -> origin/gh/guangyey/183/orig 2025-12-04T08:54:18.6259283Z * [new branch] gh/guangyey/185/base -> origin/gh/guangyey/185/base 2025-12-04T08:54:18.6259462Z * [new branch] gh/guangyey/185/head -> origin/gh/guangyey/185/head 2025-12-04T08:54:18.6259643Z * [new branch] gh/guangyey/185/orig -> origin/gh/guangyey/185/orig 2025-12-04T08:54:18.6259825Z * [new branch] gh/guangyey/186/base -> origin/gh/guangyey/186/base 2025-12-04T08:54:18.6260005Z * [new branch] gh/guangyey/186/head -> origin/gh/guangyey/186/head 2025-12-04T08:54:18.6260187Z * [new branch] gh/guangyey/186/orig -> origin/gh/guangyey/186/orig 2025-12-04T08:54:18.6260368Z * [new branch] gh/guangyey/187/base -> origin/gh/guangyey/187/base 2025-12-04T08:54:18.6260547Z * [new branch] gh/guangyey/187/head -> origin/gh/guangyey/187/head 2025-12-04T08:54:18.6260727Z * [new branch] gh/guangyey/187/orig -> origin/gh/guangyey/187/orig 2025-12-04T08:54:18.6260908Z * [new branch] gh/guangyey/188/base -> origin/gh/guangyey/188/base 2025-12-04T08:54:18.6261091Z * [new branch] gh/guangyey/188/head -> origin/gh/guangyey/188/head 2025-12-04T08:54:18.6261272Z * [new branch] gh/guangyey/188/orig -> origin/gh/guangyey/188/orig 2025-12-04T08:54:18.6261483Z * [new branch] gh/guangyey/190/base -> origin/gh/guangyey/190/base 2025-12-04T08:54:18.6261667Z * [new branch] gh/guangyey/190/head -> origin/gh/guangyey/190/head 2025-12-04T08:54:18.6261848Z * [new branch] gh/guangyey/190/orig -> origin/gh/guangyey/190/orig 2025-12-04T08:54:18.6262027Z * [new branch] gh/guangyey/208/base -> origin/gh/guangyey/208/base 2025-12-04T08:54:18.6262210Z * [new branch] gh/guangyey/208/head -> origin/gh/guangyey/208/head 2025-12-04T08:54:18.6262394Z * [new branch] gh/guangyey/208/orig -> origin/gh/guangyey/208/orig 2025-12-04T08:54:18.6262578Z * [new branch] gh/guangyey/228/base -> origin/gh/guangyey/228/base 2025-12-04T08:54:18.6262759Z * [new branch] gh/guangyey/228/head -> origin/gh/guangyey/228/head 2025-12-04T08:54:18.6262942Z * [new branch] gh/guangyey/228/orig -> origin/gh/guangyey/228/orig 2025-12-04T08:54:18.6263123Z * [new branch] gh/guangyey/230/base -> origin/gh/guangyey/230/base 2025-12-04T08:54:18.6263304Z * [new branch] gh/guangyey/230/head -> origin/gh/guangyey/230/head 2025-12-04T08:54:18.6263487Z * [new branch] gh/guangyey/230/orig -> origin/gh/guangyey/230/orig 2025-12-04T08:54:18.6263666Z * [new branch] gh/guangyey/231/base -> origin/gh/guangyey/231/base 2025-12-04T08:54:18.6263848Z * [new branch] gh/guangyey/231/head -> origin/gh/guangyey/231/head 2025-12-04T08:54:18.6264030Z * [new branch] gh/guangyey/231/orig -> origin/gh/guangyey/231/orig 2025-12-04T08:54:18.6264211Z * [new branch] gh/guangyey/232/base -> origin/gh/guangyey/232/base 2025-12-04T08:54:18.6264393Z * [new branch] gh/guangyey/232/head -> origin/gh/guangyey/232/head 2025-12-04T08:54:18.6264574Z * [new branch] gh/guangyey/232/orig -> origin/gh/guangyey/232/orig 2025-12-04T08:54:18.6264755Z * [new branch] gh/guangyey/233/base -> origin/gh/guangyey/233/base 2025-12-04T08:54:18.6264938Z * [new branch] gh/guangyey/233/head -> origin/gh/guangyey/233/head 2025-12-04T08:54:18.6265118Z * [new branch] gh/guangyey/233/orig -> origin/gh/guangyey/233/orig 2025-12-04T08:54:18.6265300Z * [new branch] gh/guangyey/234/base -> origin/gh/guangyey/234/base 2025-12-04T08:54:18.6265483Z * [new branch] gh/guangyey/234/head -> origin/gh/guangyey/234/head 2025-12-04T08:54:18.6265663Z * [new branch] gh/guangyey/234/orig -> origin/gh/guangyey/234/orig 2025-12-04T08:54:18.6265874Z * [new branch] gh/guangyey/235/base -> origin/gh/guangyey/235/base 2025-12-04T08:54:18.6266055Z * [new branch] gh/guangyey/235/head -> origin/gh/guangyey/235/head 2025-12-04T08:54:18.6266235Z * [new branch] gh/guangyey/235/orig -> origin/gh/guangyey/235/orig 2025-12-04T08:54:18.6266417Z * [new branch] gh/guangyey/236/base -> origin/gh/guangyey/236/base 2025-12-04T08:54:18.6266599Z * [new branch] gh/guangyey/236/head -> origin/gh/guangyey/236/head 2025-12-04T08:54:18.6266780Z * [new branch] gh/guangyey/236/orig -> origin/gh/guangyey/236/orig 2025-12-04T08:54:18.6266961Z * [new branch] gh/guangyey/237/base -> origin/gh/guangyey/237/base 2025-12-04T08:54:18.6267142Z * [new branch] gh/guangyey/237/head -> origin/gh/guangyey/237/head 2025-12-04T08:54:18.6267323Z * [new branch] gh/guangyey/237/orig -> origin/gh/guangyey/237/orig 2025-12-04T08:54:18.6267505Z * [new branch] gh/guangyey/238/base -> origin/gh/guangyey/238/base 2025-12-04T08:54:18.6267687Z * [new branch] gh/guangyey/238/head -> origin/gh/guangyey/238/head 2025-12-04T08:54:18.6267867Z * [new branch] gh/guangyey/239/base -> origin/gh/guangyey/239/base 2025-12-04T08:54:18.6268145Z * [new branch] gh/guangyey/239/head -> origin/gh/guangyey/239/head 2025-12-04T08:54:18.6268329Z * [new branch] gh/guangyey/239/orig -> origin/gh/guangyey/239/orig 2025-12-04T08:54:18.6268508Z * [new branch] gh/guangyey/240/base -> origin/gh/guangyey/240/base 2025-12-04T08:54:18.6268692Z * [new branch] gh/guangyey/240/head -> origin/gh/guangyey/240/head 2025-12-04T08:54:18.6268871Z * [new branch] gh/guangyey/240/orig -> origin/gh/guangyey/240/orig 2025-12-04T08:54:18.6269057Z * [new branch] gh/guangyey/241/base -> origin/gh/guangyey/241/base 2025-12-04T08:54:18.6269242Z * [new branch] gh/guangyey/241/head -> origin/gh/guangyey/241/head 2025-12-04T08:54:18.6269422Z * [new branch] gh/guangyey/241/orig -> origin/gh/guangyey/241/orig 2025-12-04T08:54:18.6269604Z * [new branch] gh/guangyey/242/base -> origin/gh/guangyey/242/base 2025-12-04T08:54:18.6269790Z * [new branch] gh/guangyey/242/head -> origin/gh/guangyey/242/head 2025-12-04T08:54:18.6269969Z * [new branch] gh/guangyey/242/orig -> origin/gh/guangyey/242/orig 2025-12-04T08:54:18.6270150Z * [new branch] gh/guangyey/243/base -> origin/gh/guangyey/243/base 2025-12-04T08:54:18.6270331Z * [new branch] gh/guangyey/243/head -> origin/gh/guangyey/243/head 2025-12-04T08:54:18.6270510Z * [new branch] gh/guangyey/243/orig -> origin/gh/guangyey/243/orig 2025-12-04T08:54:18.6270694Z * [new branch] gh/guangyey/244/base -> origin/gh/guangyey/244/base 2025-12-04T08:54:18.6270878Z * [new branch] gh/guangyey/244/head -> origin/gh/guangyey/244/head 2025-12-04T08:54:18.6271060Z * [new branch] gh/guangyey/244/orig -> origin/gh/guangyey/244/orig 2025-12-04T08:54:18.6271243Z * [new branch] gh/guangyey/245/base -> origin/gh/guangyey/245/base 2025-12-04T08:54:18.6271430Z * [new branch] gh/guangyey/245/head -> origin/gh/guangyey/245/head 2025-12-04T08:54:18.6271608Z * [new branch] gh/guangyey/245/orig -> origin/gh/guangyey/245/orig 2025-12-04T08:54:18.6271789Z * [new branch] gh/guangyey/246/base -> origin/gh/guangyey/246/base 2025-12-04T08:54:18.6271969Z * [new branch] gh/guangyey/246/head -> origin/gh/guangyey/246/head 2025-12-04T08:54:18.6272148Z * [new branch] gh/guangyey/246/orig -> origin/gh/guangyey/246/orig 2025-12-04T08:54:18.6272366Z * [new branch] gh/guangyey/247/base -> origin/gh/guangyey/247/base 2025-12-04T08:54:18.6272548Z * [new branch] gh/guangyey/247/head -> origin/gh/guangyey/247/head 2025-12-04T08:54:18.6272727Z * [new branch] gh/guangyey/247/orig -> origin/gh/guangyey/247/orig 2025-12-04T08:54:18.6272912Z * [new branch] gh/guangyey/248/base -> origin/gh/guangyey/248/base 2025-12-04T08:54:18.6273093Z * [new branch] gh/guangyey/248/head -> origin/gh/guangyey/248/head 2025-12-04T08:54:18.6273275Z * [new branch] gh/guangyey/248/orig -> origin/gh/guangyey/248/orig 2025-12-04T08:54:18.6273456Z * [new branch] gh/guangyey/249/base -> origin/gh/guangyey/249/base 2025-12-04T08:54:18.6273635Z * [new branch] gh/guangyey/249/head -> origin/gh/guangyey/249/head 2025-12-04T08:54:18.6273818Z * [new branch] gh/guangyey/249/orig -> origin/gh/guangyey/249/orig 2025-12-04T08:54:18.6274003Z * [new branch] gh/guangyey/250/base -> origin/gh/guangyey/250/base 2025-12-04T08:54:18.6274181Z * [new branch] gh/guangyey/250/head -> origin/gh/guangyey/250/head 2025-12-04T08:54:18.6274361Z * [new branch] gh/guangyey/250/orig -> origin/gh/guangyey/250/orig 2025-12-04T08:54:18.6274568Z * [new branch] gh/guangyey/251/base -> origin/gh/guangyey/251/base 2025-12-04T08:54:18.6274748Z * [new branch] gh/guangyey/251/head -> origin/gh/guangyey/251/head 2025-12-04T08:54:18.6274929Z * [new branch] gh/guangyey/251/orig -> origin/gh/guangyey/251/orig 2025-12-04T08:54:18.6275110Z * [new branch] gh/guangyey/252/base -> origin/gh/guangyey/252/base 2025-12-04T08:54:18.6275289Z * [new branch] gh/guangyey/252/head -> origin/gh/guangyey/252/head 2025-12-04T08:54:18.6275472Z * [new branch] gh/guangyey/252/orig -> origin/gh/guangyey/252/orig 2025-12-04T08:54:18.6275654Z * [new branch] gh/guangyey/253/base -> origin/gh/guangyey/253/base 2025-12-04T08:54:18.6275832Z * [new branch] gh/guangyey/253/head -> origin/gh/guangyey/253/head 2025-12-04T08:54:18.6276015Z * [new branch] gh/guangyey/253/orig -> origin/gh/guangyey/253/orig 2025-12-04T08:54:18.6276199Z * [new branch] gh/guangyey/254/base -> origin/gh/guangyey/254/base 2025-12-04T08:54:18.6276377Z * [new branch] gh/guangyey/254/head -> origin/gh/guangyey/254/head 2025-12-04T08:54:18.6276558Z * [new branch] gh/guangyey/254/orig -> origin/gh/guangyey/254/orig 2025-12-04T08:54:18.6276740Z * [new branch] gh/guangyey/255/base -> origin/gh/guangyey/255/base 2025-12-04T08:54:18.6276918Z * [new branch] gh/guangyey/255/head -> origin/gh/guangyey/255/head 2025-12-04T08:54:18.6277101Z * [new branch] gh/guangyey/255/orig -> origin/gh/guangyey/255/orig 2025-12-04T08:54:18.6277309Z * [new branch] gh/guilhermeleobas/107/base -> origin/gh/guilhermeleobas/107/base 2025-12-04T08:54:18.6277531Z * [new branch] gh/guilhermeleobas/107/head -> origin/gh/guilhermeleobas/107/head 2025-12-04T08:54:18.6277751Z * [new branch] gh/guilhermeleobas/107/orig -> origin/gh/guilhermeleobas/107/orig 2025-12-04T08:54:18.6277969Z * [new branch] gh/guilhermeleobas/108/base -> origin/gh/guilhermeleobas/108/base 2025-12-04T08:54:18.6278223Z * [new branch] gh/guilhermeleobas/108/head -> origin/gh/guilhermeleobas/108/head 2025-12-04T08:54:18.6278443Z * [new branch] gh/guilhermeleobas/108/orig -> origin/gh/guilhermeleobas/108/orig 2025-12-04T08:54:18.6278657Z * [new branch] gh/guilhermeleobas/150/base -> origin/gh/guilhermeleobas/150/base 2025-12-04T08:54:18.6278873Z * [new branch] gh/guilhermeleobas/150/head -> origin/gh/guilhermeleobas/150/head 2025-12-04T08:54:18.6279126Z * [new branch] gh/guilhermeleobas/150/orig -> origin/gh/guilhermeleobas/150/orig 2025-12-04T08:54:18.6279338Z * [new branch] gh/guilhermeleobas/168/base -> origin/gh/guilhermeleobas/168/base 2025-12-04T08:54:18.6279553Z * [new branch] gh/guilhermeleobas/168/head -> origin/gh/guilhermeleobas/168/head 2025-12-04T08:54:18.6279771Z * [new branch] gh/guilhermeleobas/168/orig -> origin/gh/guilhermeleobas/168/orig 2025-12-04T08:54:18.6279984Z * [new branch] gh/guilhermeleobas/169/base -> origin/gh/guilhermeleobas/169/base 2025-12-04T08:54:18.6280202Z * [new branch] gh/guilhermeleobas/169/head -> origin/gh/guilhermeleobas/169/head 2025-12-04T08:54:18.6280418Z * [new branch] gh/guilhermeleobas/169/orig -> origin/gh/guilhermeleobas/169/orig 2025-12-04T08:54:18.6280630Z * [new branch] gh/guilhermeleobas/170/base -> origin/gh/guilhermeleobas/170/base 2025-12-04T08:54:18.6280847Z * [new branch] gh/guilhermeleobas/170/head -> origin/gh/guilhermeleobas/170/head 2025-12-04T08:54:18.6281063Z * [new branch] gh/guilhermeleobas/170/orig -> origin/gh/guilhermeleobas/170/orig 2025-12-04T08:54:18.6281276Z * [new branch] gh/guilhermeleobas/171/base -> origin/gh/guilhermeleobas/171/base 2025-12-04T08:54:18.6281525Z * [new branch] gh/guilhermeleobas/171/head -> origin/gh/guilhermeleobas/171/head 2025-12-04T08:54:18.6281743Z * [new branch] gh/guilhermeleobas/171/orig -> origin/gh/guilhermeleobas/171/orig 2025-12-04T08:54:18.6281956Z * [new branch] gh/guilhermeleobas/173/base -> origin/gh/guilhermeleobas/173/base 2025-12-04T08:54:18.6282172Z * [new branch] gh/guilhermeleobas/173/head -> origin/gh/guilhermeleobas/173/head 2025-12-04T08:54:18.6282388Z * [new branch] gh/guilhermeleobas/173/orig -> origin/gh/guilhermeleobas/173/orig 2025-12-04T08:54:18.6282605Z * [new branch] gh/guilhermeleobas/193/base -> origin/gh/guilhermeleobas/193/base 2025-12-04T08:54:18.6282823Z * [new branch] gh/guilhermeleobas/193/head -> origin/gh/guilhermeleobas/193/head 2025-12-04T08:54:18.6283040Z * [new branch] gh/guilhermeleobas/193/orig -> origin/gh/guilhermeleobas/193/orig 2025-12-04T08:54:18.6283255Z * [new branch] gh/guilhermeleobas/204/base -> origin/gh/guilhermeleobas/204/base 2025-12-04T08:54:18.6283471Z * [new branch] gh/guilhermeleobas/204/head -> origin/gh/guilhermeleobas/204/head 2025-12-04T08:54:18.6283687Z * [new branch] gh/guilhermeleobas/204/orig -> origin/gh/guilhermeleobas/204/orig 2025-12-04T08:54:18.6283901Z * [new branch] gh/guilhermeleobas/211/base -> origin/gh/guilhermeleobas/211/base 2025-12-04T08:54:18.6284118Z * [new branch] gh/guilhermeleobas/211/head -> origin/gh/guilhermeleobas/211/head 2025-12-04T08:54:18.6284337Z * [new branch] gh/guilhermeleobas/211/orig -> origin/gh/guilhermeleobas/211/orig 2025-12-04T08:54:18.6284550Z * [new branch] gh/guilhermeleobas/226/base -> origin/gh/guilhermeleobas/226/base 2025-12-04T08:54:18.6284764Z * [new branch] gh/guilhermeleobas/226/head -> origin/gh/guilhermeleobas/226/head 2025-12-04T08:54:18.6284983Z * [new branch] gh/guilhermeleobas/226/orig -> origin/gh/guilhermeleobas/226/orig 2025-12-04T08:54:18.6285198Z * [new branch] gh/guilhermeleobas/236/base -> origin/gh/guilhermeleobas/236/base 2025-12-04T08:54:18.6285413Z * [new branch] gh/guilhermeleobas/236/head -> origin/gh/guilhermeleobas/236/head 2025-12-04T08:54:18.6285626Z * [new branch] gh/guilhermeleobas/236/orig -> origin/gh/guilhermeleobas/236/orig 2025-12-04T08:54:18.6285843Z * [new branch] gh/guilhermeleobas/247/base -> origin/gh/guilhermeleobas/247/base 2025-12-04T08:54:18.6286058Z * [new branch] gh/guilhermeleobas/247/head -> origin/gh/guilhermeleobas/247/head 2025-12-04T08:54:18.6286296Z * [new branch] gh/guilhermeleobas/247/orig -> origin/gh/guilhermeleobas/247/orig 2025-12-04T08:54:18.6286511Z * [new branch] gh/guilhermeleobas/248/base -> origin/gh/guilhermeleobas/248/base 2025-12-04T08:54:18.6286726Z * [new branch] gh/guilhermeleobas/248/head -> origin/gh/guilhermeleobas/248/head 2025-12-04T08:54:18.6286941Z * [new branch] gh/guilhermeleobas/248/orig -> origin/gh/guilhermeleobas/248/orig 2025-12-04T08:54:18.6287156Z * [new branch] gh/guilhermeleobas/250/base -> origin/gh/guilhermeleobas/250/base 2025-12-04T08:54:18.6287371Z * [new branch] gh/guilhermeleobas/250/head -> origin/gh/guilhermeleobas/250/head 2025-12-04T08:54:18.6287584Z * [new branch] gh/guilhermeleobas/250/orig -> origin/gh/guilhermeleobas/250/orig 2025-12-04T08:54:18.6287799Z * [new branch] gh/guilhermeleobas/253/base -> origin/gh/guilhermeleobas/253/base 2025-12-04T08:54:18.6288020Z * [new branch] gh/guilhermeleobas/253/head -> origin/gh/guilhermeleobas/253/head 2025-12-04T08:54:18.6288279Z * [new branch] gh/guilhermeleobas/253/orig -> origin/gh/guilhermeleobas/253/orig 2025-12-04T08:54:18.6288495Z * [new branch] gh/guilhermeleobas/254/base -> origin/gh/guilhermeleobas/254/base 2025-12-04T08:54:18.6288737Z * [new branch] gh/guilhermeleobas/254/head -> origin/gh/guilhermeleobas/254/head 2025-12-04T08:54:18.6288950Z * [new branch] gh/guilhermeleobas/254/orig -> origin/gh/guilhermeleobas/254/orig 2025-12-04T08:54:18.6289168Z * [new branch] gh/guilhermeleobas/255/base -> origin/gh/guilhermeleobas/255/base 2025-12-04T08:54:18.6289382Z * [new branch] gh/guilhermeleobas/255/head -> origin/gh/guilhermeleobas/255/head 2025-12-04T08:54:18.6289597Z * [new branch] gh/guilhermeleobas/255/orig -> origin/gh/guilhermeleobas/255/orig 2025-12-04T08:54:18.6289815Z * [new branch] gh/guilhermeleobas/256/base -> origin/gh/guilhermeleobas/256/base 2025-12-04T08:54:18.6290030Z * [new branch] gh/guilhermeleobas/256/head -> origin/gh/guilhermeleobas/256/head 2025-12-04T08:54:18.6290241Z * [new branch] gh/guilhermeleobas/256/orig -> origin/gh/guilhermeleobas/256/orig 2025-12-04T08:54:18.6290457Z * [new branch] gh/guilhermeleobas/257/base -> origin/gh/guilhermeleobas/257/base 2025-12-04T08:54:18.6290672Z * [new branch] gh/guilhermeleobas/257/head -> origin/gh/guilhermeleobas/257/head 2025-12-04T08:54:18.6290884Z * [new branch] gh/guilhermeleobas/257/orig -> origin/gh/guilhermeleobas/257/orig 2025-12-04T08:54:18.6291098Z * [new branch] gh/guilhermeleobas/258/base -> origin/gh/guilhermeleobas/258/base 2025-12-04T08:54:18.6291312Z * [new branch] gh/guilhermeleobas/258/head -> origin/gh/guilhermeleobas/258/head 2025-12-04T08:54:18.6291529Z * [new branch] gh/guilhermeleobas/258/orig -> origin/gh/guilhermeleobas/258/orig 2025-12-04T08:54:18.6291744Z * [new branch] gh/guilhermeleobas/259/base -> origin/gh/guilhermeleobas/259/base 2025-12-04T08:54:18.6291960Z * [new branch] gh/guilhermeleobas/259/head -> origin/gh/guilhermeleobas/259/head 2025-12-04T08:54:18.6292176Z * [new branch] gh/guilhermeleobas/259/orig -> origin/gh/guilhermeleobas/259/orig 2025-12-04T08:54:18.6292389Z * [new branch] gh/guilhermeleobas/260/base -> origin/gh/guilhermeleobas/260/base 2025-12-04T08:54:18.6292601Z * [new branch] gh/guilhermeleobas/260/head -> origin/gh/guilhermeleobas/260/head 2025-12-04T08:54:18.6292813Z * [new branch] gh/guilhermeleobas/260/orig -> origin/gh/guilhermeleobas/260/orig 2025-12-04T08:54:18.6293028Z * [new branch] gh/guilhermeleobas/261/base -> origin/gh/guilhermeleobas/261/base 2025-12-04T08:54:18.6293275Z * [new branch] gh/guilhermeleobas/261/head -> origin/gh/guilhermeleobas/261/head 2025-12-04T08:54:18.6293486Z * [new branch] gh/guilhermeleobas/261/orig -> origin/gh/guilhermeleobas/261/orig 2025-12-04T08:54:18.6293699Z * [new branch] gh/guilhermeleobas/262/base -> origin/gh/guilhermeleobas/262/base 2025-12-04T08:54:18.6293912Z * [new branch] gh/guilhermeleobas/262/head -> origin/gh/guilhermeleobas/262/head 2025-12-04T08:54:18.6294123Z * [new branch] gh/guilhermeleobas/262/orig -> origin/gh/guilhermeleobas/262/orig 2025-12-04T08:54:18.6294335Z * [new branch] gh/guilhermeleobas/263/base -> origin/gh/guilhermeleobas/263/base 2025-12-04T08:54:18.6294548Z * [new branch] gh/guilhermeleobas/263/head -> origin/gh/guilhermeleobas/263/head 2025-12-04T08:54:18.6294761Z * [new branch] gh/guilhermeleobas/263/orig -> origin/gh/guilhermeleobas/263/orig 2025-12-04T08:54:18.6294978Z * [new branch] gh/guilhermeleobas/264/base -> origin/gh/guilhermeleobas/264/base 2025-12-04T08:54:18.6295191Z * [new branch] gh/guilhermeleobas/264/head -> origin/gh/guilhermeleobas/264/head 2025-12-04T08:54:18.6295404Z * [new branch] gh/guilhermeleobas/264/orig -> origin/gh/guilhermeleobas/264/orig 2025-12-04T08:54:18.6295649Z * [new branch] gh/guilhermeleobas/265/base -> origin/gh/guilhermeleobas/265/base 2025-12-04T08:54:18.6295863Z * [new branch] gh/guilhermeleobas/265/head -> origin/gh/guilhermeleobas/265/head 2025-12-04T08:54:18.6296077Z * [new branch] gh/guilhermeleobas/265/orig -> origin/gh/guilhermeleobas/265/orig 2025-12-04T08:54:18.6296290Z * [new branch] gh/guilhermeleobas/266/base -> origin/gh/guilhermeleobas/266/base 2025-12-04T08:54:18.6296505Z * [new branch] gh/guilhermeleobas/266/head -> origin/gh/guilhermeleobas/266/head 2025-12-04T08:54:18.6296723Z * [new branch] gh/guilhermeleobas/266/orig -> origin/gh/guilhermeleobas/266/orig 2025-12-04T08:54:18.6296936Z * [new branch] gh/guilhermeleobas/267/base -> origin/gh/guilhermeleobas/267/base 2025-12-04T08:54:18.6297160Z * [new branch] gh/guilhermeleobas/267/head -> origin/gh/guilhermeleobas/267/head 2025-12-04T08:54:18.6297394Z * [new branch] gh/guilhermeleobas/267/orig -> origin/gh/guilhermeleobas/267/orig 2025-12-04T08:54:18.6297599Z * [new branch] gh/hameerabbasi/1/base -> origin/gh/hameerabbasi/1/base 2025-12-04T08:54:18.6297793Z * [new branch] gh/hameerabbasi/1/head -> origin/gh/hameerabbasi/1/head 2025-12-04T08:54:18.6297984Z * [new branch] gh/hameerabbasi/2/base -> origin/gh/hameerabbasi/2/base 2025-12-04T08:54:18.6298240Z * [new branch] gh/hameerabbasi/2/head -> origin/gh/hameerabbasi/2/head 2025-12-04T08:54:18.6298430Z * [new branch] gh/hameerabbasi/2/orig -> origin/gh/hameerabbasi/2/orig 2025-12-04T08:54:18.6298618Z * [new branch] gh/hameerabbasi/3/base -> origin/gh/hameerabbasi/3/base 2025-12-04T08:54:18.6298807Z * [new branch] gh/hameerabbasi/3/head -> origin/gh/hameerabbasi/3/head 2025-12-04T08:54:18.6298993Z * [new branch] gh/hameerabbasi/3/orig -> origin/gh/hameerabbasi/3/orig 2025-12-04T08:54:18.6299181Z * [new branch] gh/hameerabbasi/4/base -> origin/gh/hameerabbasi/4/base 2025-12-04T08:54:18.6299369Z * [new branch] gh/hameerabbasi/4/head -> origin/gh/hameerabbasi/4/head 2025-12-04T08:54:18.6299556Z * [new branch] gh/hameerabbasi/4/orig -> origin/gh/hameerabbasi/4/orig 2025-12-04T08:54:18.6299739Z * [new branch] gh/huydhn/1/next -> origin/gh/huydhn/1/next 2025-12-04T08:54:18.6299915Z * [new branch] gh/huydhn/2/next -> origin/gh/huydhn/2/next 2025-12-04T08:54:18.6300085Z * [new branch] gh/huydhn/3/next -> origin/gh/huydhn/3/next 2025-12-04T08:54:18.6300285Z * [new branch] gh/huydhn/4/next -> origin/gh/huydhn/4/next 2025-12-04T08:54:18.6300452Z * [new branch] gh/huydhn/5/next -> origin/gh/huydhn/5/next 2025-12-04T08:54:18.6300622Z * [new branch] gh/huydhn/6/next -> origin/gh/huydhn/6/next 2025-12-04T08:54:18.6300792Z * [new branch] gh/int3/97/base -> origin/gh/int3/97/base 2025-12-04T08:54:18.6300958Z * [new branch] gh/int3/97/head -> origin/gh/int3/97/head 2025-12-04T08:54:18.6301130Z * [new branch] gh/isuruf/101/base -> origin/gh/isuruf/101/base 2025-12-04T08:54:18.6301309Z * [new branch] gh/isuruf/101/head -> origin/gh/isuruf/101/head 2025-12-04T08:54:18.6301482Z * [new branch] gh/isuruf/146/base -> origin/gh/isuruf/146/base 2025-12-04T08:54:18.6301656Z * [new branch] gh/isuruf/146/head -> origin/gh/isuruf/146/head 2025-12-04T08:54:18.6301831Z * [new branch] gh/isuruf/146/orig -> origin/gh/isuruf/146/orig 2025-12-04T08:54:18.6302003Z * [new branch] gh/isuruf/158/base -> origin/gh/isuruf/158/base 2025-12-04T08:54:18.6302175Z * [new branch] gh/isuruf/158/head -> origin/gh/isuruf/158/head 2025-12-04T08:54:18.6302379Z * [new branch] gh/isuruf/159/base -> origin/gh/isuruf/159/base 2025-12-04T08:54:18.6302551Z * [new branch] gh/isuruf/159/head -> origin/gh/isuruf/159/head 2025-12-04T08:54:18.6302723Z * [new branch] gh/isuruf/160/base -> origin/gh/isuruf/160/base 2025-12-04T08:54:18.6302896Z * [new branch] gh/isuruf/160/head -> origin/gh/isuruf/160/head 2025-12-04T08:54:18.6303067Z * [new branch] gh/isuruf/160/orig -> origin/gh/isuruf/160/orig 2025-12-04T08:54:18.6303241Z * [new branch] gh/isuruf/81/base -> origin/gh/isuruf/81/base 2025-12-04T08:54:18.6303419Z * [new branch] gh/isuruf/81/head -> origin/gh/isuruf/81/head 2025-12-04T08:54:18.6303589Z * [new branch] gh/isuruf/81/orig -> origin/gh/isuruf/81/orig 2025-12-04T08:54:18.6303767Z * [new branch] gh/jamesjwu/176/base -> origin/gh/jamesjwu/176/base 2025-12-04T08:54:18.6303951Z * [new branch] gh/jamesjwu/176/head -> origin/gh/jamesjwu/176/head 2025-12-04T08:54:18.6304133Z * [new branch] gh/jamesjwu/176/orig -> origin/gh/jamesjwu/176/orig 2025-12-04T08:54:18.6304314Z * [new branch] gh/jamesjwu/187/base -> origin/gh/jamesjwu/187/base 2025-12-04T08:54:18.6304494Z * [new branch] gh/jamesjwu/187/head -> origin/gh/jamesjwu/187/head 2025-12-04T08:54:18.6304674Z * [new branch] gh/jamesjwu/187/orig -> origin/gh/jamesjwu/187/orig 2025-12-04T08:54:18.6304852Z * [new branch] gh/jamesjwu/196/base -> origin/gh/jamesjwu/196/base 2025-12-04T08:54:18.6305032Z * [new branch] gh/jamesjwu/196/head -> origin/gh/jamesjwu/196/head 2025-12-04T08:54:18.6305209Z * [new branch] gh/jamesjwu/196/orig -> origin/gh/jamesjwu/196/orig 2025-12-04T08:54:18.6305388Z * [new branch] gh/jamesjwu/198/base -> origin/gh/jamesjwu/198/base 2025-12-04T08:54:18.6305567Z * [new branch] gh/jamesjwu/198/head -> origin/gh/jamesjwu/198/head 2025-12-04T08:54:18.6305745Z * [new branch] gh/jamesjwu/198/orig -> origin/gh/jamesjwu/198/orig 2025-12-04T08:54:18.6305924Z * [new branch] gh/jamesjwu/207/base -> origin/gh/jamesjwu/207/base 2025-12-04T08:54:18.6306101Z * [new branch] gh/jamesjwu/207/head -> origin/gh/jamesjwu/207/head 2025-12-04T08:54:18.6306280Z * [new branch] gh/jamesjwu/207/orig -> origin/gh/jamesjwu/207/orig 2025-12-04T08:54:18.6306457Z * [new branch] gh/jamesjwu/208/base -> origin/gh/jamesjwu/208/base 2025-12-04T08:54:18.6306983Z * [new branch] gh/jamesjwu/208/head -> origin/gh/jamesjwu/208/head 2025-12-04T08:54:18.6307161Z * [new branch] gh/jamesjwu/208/orig -> origin/gh/jamesjwu/208/orig 2025-12-04T08:54:18.6307342Z * [new branch] gh/jamesjwu/52/base -> origin/gh/jamesjwu/52/base 2025-12-04T08:54:18.6307521Z * [new branch] gh/jamesjwu/52/head -> origin/gh/jamesjwu/52/head 2025-12-04T08:54:18.6307699Z * [new branch] gh/jamesjwu/53/base -> origin/gh/jamesjwu/53/base 2025-12-04T08:54:18.6307877Z * [new branch] gh/jamesjwu/53/head -> origin/gh/jamesjwu/53/head 2025-12-04T08:54:18.6308052Z * [new branch] gh/jamesjwu/54/base -> origin/gh/jamesjwu/54/base 2025-12-04T08:54:18.6308278Z * [new branch] gh/jamesjwu/54/head -> origin/gh/jamesjwu/54/head 2025-12-04T08:54:18.6308454Z * [new branch] gh/jamesjwu/55/base -> origin/gh/jamesjwu/55/base 2025-12-04T08:54:18.6308635Z * [new branch] gh/jamesjwu/55/head -> origin/gh/jamesjwu/55/head 2025-12-04T08:54:18.6308813Z * [new branch] gh/jamesjwu/56/base -> origin/gh/jamesjwu/56/base 2025-12-04T08:54:18.6308987Z * [new branch] gh/jamesjwu/56/head -> origin/gh/jamesjwu/56/head 2025-12-04T08:54:18.6309196Z * [new branch] gh/jamesjwu/57/base -> origin/gh/jamesjwu/57/base 2025-12-04T08:54:18.6309375Z * [new branch] gh/jamesjwu/57/head -> origin/gh/jamesjwu/57/head 2025-12-04T08:54:18.6309549Z * [new branch] gh/jamesjwu/58/base -> origin/gh/jamesjwu/58/base 2025-12-04T08:54:18.6309727Z * [new branch] gh/jamesjwu/58/head -> origin/gh/jamesjwu/58/head 2025-12-04T08:54:18.6309903Z * [new branch] gh/jamesjwu/59/base -> origin/gh/jamesjwu/59/base 2025-12-04T08:54:18.6310077Z * [new branch] gh/jamesjwu/59/head -> origin/gh/jamesjwu/59/head 2025-12-04T08:54:18.6310259Z * [new branch] gh/jamesjwu/60/base -> origin/gh/jamesjwu/60/base 2025-12-04T08:54:18.6310435Z * [new branch] gh/jamesjwu/60/head -> origin/gh/jamesjwu/60/head 2025-12-04T08:54:18.6310610Z * [new branch] gh/jamesjwu/61/base -> origin/gh/jamesjwu/61/base 2025-12-04T08:54:18.6310788Z * [new branch] gh/jamesjwu/61/head -> origin/gh/jamesjwu/61/head 2025-12-04T08:54:18.6310965Z * [new branch] gh/jamesjwu/62/base -> origin/gh/jamesjwu/62/base 2025-12-04T08:54:18.6311141Z * [new branch] gh/jamesjwu/62/head -> origin/gh/jamesjwu/62/head 2025-12-04T08:54:18.6311317Z * [new branch] gh/jamesjwu/63/base -> origin/gh/jamesjwu/63/base 2025-12-04T08:54:18.6311494Z * [new branch] gh/jamesjwu/63/head -> origin/gh/jamesjwu/63/head 2025-12-04T08:54:18.6311670Z * [new branch] gh/jamesjwu/64/base -> origin/gh/jamesjwu/64/base 2025-12-04T08:54:18.6311849Z * [new branch] gh/jamesjwu/64/head -> origin/gh/jamesjwu/64/head 2025-12-04T08:54:18.6311918Z * [new branch] gh/jamesjwu/65/base -> origin/gh/jamesjwu/65/base 2025-12-04T08:54:18.6311987Z * [new branch] gh/jamesjwu/65/head -> origin/gh/jamesjwu/65/head 2025-12-04T08:54:18.6312062Z * [new branch] gh/janeyx99/165/base -> origin/gh/janeyx99/165/base 2025-12-04T08:54:18.6312133Z * [new branch] gh/janeyx99/165/head -> origin/gh/janeyx99/165/head 2025-12-04T08:54:18.6312205Z * [new branch] gh/janeyx99/165/orig -> origin/gh/janeyx99/165/orig 2025-12-04T08:54:18.6312274Z * [new branch] gh/janeyx99/201/base -> origin/gh/janeyx99/201/base 2025-12-04T08:54:18.6312344Z * [new branch] gh/janeyx99/201/head -> origin/gh/janeyx99/201/head 2025-12-04T08:54:18.6312414Z * [new branch] gh/janeyx99/201/orig -> origin/gh/janeyx99/201/orig 2025-12-04T08:54:18.6312519Z * [new branch] gh/janeyx99/225/base -> origin/gh/janeyx99/225/base 2025-12-04T08:54:18.6312588Z * [new branch] gh/janeyx99/225/head -> origin/gh/janeyx99/225/head 2025-12-04T08:54:18.6312659Z * [new branch] gh/janeyx99/225/orig -> origin/gh/janeyx99/225/orig 2025-12-04T08:54:18.6312730Z * [new branch] gh/janeyx99/299/base -> origin/gh/janeyx99/299/base 2025-12-04T08:54:18.6312800Z * [new branch] gh/janeyx99/299/head -> origin/gh/janeyx99/299/head 2025-12-04T08:54:18.6312872Z * [new branch] gh/janeyx99/299/orig -> origin/gh/janeyx99/299/orig 2025-12-04T08:54:18.6312941Z * [new branch] gh/janeyx99/302/base -> origin/gh/janeyx99/302/base 2025-12-04T08:54:18.6313010Z * [new branch] gh/janeyx99/302/head -> origin/gh/janeyx99/302/head 2025-12-04T08:54:18.6313082Z * [new branch] gh/janeyx99/303/base -> origin/gh/janeyx99/303/base 2025-12-04T08:54:18.6313151Z * [new branch] gh/janeyx99/303/head -> origin/gh/janeyx99/303/head 2025-12-04T08:54:18.6313221Z * [new branch] gh/janeyx99/305/base -> origin/gh/janeyx99/305/base 2025-12-04T08:54:18.6313292Z * [new branch] gh/janeyx99/305/head -> origin/gh/janeyx99/305/head 2025-12-04T08:54:18.6313383Z * [new branch] gh/janeyx99/306/base -> origin/gh/janeyx99/306/base 2025-12-04T08:54:18.6313453Z * [new branch] gh/janeyx99/306/head -> origin/gh/janeyx99/306/head 2025-12-04T08:54:18.6313523Z * [new branch] gh/janeyx99/314/base -> origin/gh/janeyx99/314/base 2025-12-04T08:54:18.6313592Z * [new branch] gh/janeyx99/314/head -> origin/gh/janeyx99/314/head 2025-12-04T08:54:18.6313663Z * [new branch] gh/janeyx99/314/orig -> origin/gh/janeyx99/314/orig 2025-12-04T08:54:18.6313733Z * [new branch] gh/janeyx99/315/base -> origin/gh/janeyx99/315/base 2025-12-04T08:54:18.6313802Z * [new branch] gh/janeyx99/315/head -> origin/gh/janeyx99/315/head 2025-12-04T08:54:18.6313873Z * [new branch] gh/janeyx99/315/orig -> origin/gh/janeyx99/315/orig 2025-12-04T08:54:18.6313942Z * [new branch] gh/janeyx99/316/base -> origin/gh/janeyx99/316/base 2025-12-04T08:54:18.6314012Z * [new branch] gh/janeyx99/316/head -> origin/gh/janeyx99/316/head 2025-12-04T08:54:18.6314083Z * [new branch] gh/janeyx99/316/orig -> origin/gh/janeyx99/316/orig 2025-12-04T08:54:18.6314152Z * [new branch] gh/janeyx99/317/base -> origin/gh/janeyx99/317/base 2025-12-04T08:54:18.6314221Z * [new branch] gh/janeyx99/317/head -> origin/gh/janeyx99/317/head 2025-12-04T08:54:18.6314291Z * [new branch] gh/janeyx99/317/orig -> origin/gh/janeyx99/317/orig 2025-12-04T08:54:18.6314365Z * [new branch] gh/janeyx99/325/base -> origin/gh/janeyx99/325/base 2025-12-04T08:54:18.6314434Z * [new branch] gh/janeyx99/325/head -> origin/gh/janeyx99/325/head 2025-12-04T08:54:18.6314505Z * [new branch] gh/janeyx99/325/orig -> origin/gh/janeyx99/325/orig 2025-12-04T08:54:18.6314574Z * [new branch] gh/janeyx99/327/base -> origin/gh/janeyx99/327/base 2025-12-04T08:54:18.6314644Z * [new branch] gh/janeyx99/327/head -> origin/gh/janeyx99/327/head 2025-12-04T08:54:18.6314714Z * [new branch] gh/janeyx99/327/orig -> origin/gh/janeyx99/327/orig 2025-12-04T08:54:18.6314783Z * [new branch] gh/janeyx99/328/base -> origin/gh/janeyx99/328/base 2025-12-04T08:54:18.6314852Z * [new branch] gh/janeyx99/328/head -> origin/gh/janeyx99/328/head 2025-12-04T08:54:18.6314923Z * [new branch] gh/janeyx99/328/orig -> origin/gh/janeyx99/328/orig 2025-12-04T08:54:18.6315018Z * [new branch] gh/janeyx99/329/base -> origin/gh/janeyx99/329/base 2025-12-04T08:54:18.6315088Z * [new branch] gh/janeyx99/329/head -> origin/gh/janeyx99/329/head 2025-12-04T08:54:18.6315157Z * [new branch] gh/janeyx99/329/orig -> origin/gh/janeyx99/329/orig 2025-12-04T08:54:18.6315358Z * [new branch] gh/janeyx99/330/base -> origin/gh/janeyx99/330/base 2025-12-04T08:54:18.6315433Z * [new branch] gh/janeyx99/330/head -> origin/gh/janeyx99/330/head 2025-12-04T08:54:18.6315502Z * [new branch] gh/janeyx99/330/orig -> origin/gh/janeyx99/330/orig 2025-12-04T08:54:18.6315571Z * [new branch] gh/janeyx99/331/base -> origin/gh/janeyx99/331/base 2025-12-04T08:54:18.6315642Z * [new branch] gh/janeyx99/331/head -> origin/gh/janeyx99/331/head 2025-12-04T08:54:18.6315711Z * [new branch] gh/janeyx99/331/orig -> origin/gh/janeyx99/331/orig 2025-12-04T08:54:18.6315781Z * [new branch] gh/janeyx99/332/base -> origin/gh/janeyx99/332/base 2025-12-04T08:54:18.6315852Z * [new branch] gh/janeyx99/332/head -> origin/gh/janeyx99/332/head 2025-12-04T08:54:18.6315921Z * [new branch] gh/janeyx99/332/orig -> origin/gh/janeyx99/332/orig 2025-12-04T08:54:18.6316013Z * [new branch] gh/janeyx99/333/base -> origin/gh/janeyx99/333/base 2025-12-04T08:54:18.6316085Z * [new branch] gh/janeyx99/333/head -> origin/gh/janeyx99/333/head 2025-12-04T08:54:18.6316154Z * [new branch] gh/janeyx99/333/orig -> origin/gh/janeyx99/333/orig 2025-12-04T08:54:18.6316223Z * [new branch] gh/janeyx99/88/base -> origin/gh/janeyx99/88/base 2025-12-04T08:54:18.6316294Z * [new branch] gh/janeyx99/88/head -> origin/gh/janeyx99/88/head 2025-12-04T08:54:18.6316362Z * [new branch] gh/janeyx99/88/orig -> origin/gh/janeyx99/88/orig 2025-12-04T08:54:18.6316433Z * [new branch] gh/jansel/360/base -> origin/gh/jansel/360/base 2025-12-04T08:54:18.6316503Z * [new branch] gh/jansel/360/head -> origin/gh/jansel/360/head 2025-12-04T08:54:18.6316570Z * [new branch] gh/jansel/451/base -> origin/gh/jansel/451/base 2025-12-04T08:54:18.6316639Z * [new branch] gh/jansel/451/head -> origin/gh/jansel/451/head 2025-12-04T08:54:18.6316708Z * [new branch] gh/jansel/451/orig -> origin/gh/jansel/451/orig 2025-12-04T08:54:18.6316774Z * [new branch] gh/jansel/462/base -> origin/gh/jansel/462/base 2025-12-04T08:54:18.6316842Z * [new branch] gh/jansel/462/head -> origin/gh/jansel/462/head 2025-12-04T08:54:18.6316908Z * [new branch] gh/jansel/462/orig -> origin/gh/jansel/462/orig 2025-12-04T08:54:18.6316975Z * [new branch] gh/jansel/533/base -> origin/gh/jansel/533/base 2025-12-04T08:54:18.6317046Z * [new branch] gh/jansel/533/head -> origin/gh/jansel/533/head 2025-12-04T08:54:18.6317113Z * [new branch] gh/jansel/533/orig -> origin/gh/jansel/533/orig 2025-12-04T08:54:18.6317179Z * [new branch] gh/jansel/552/base -> origin/gh/jansel/552/base 2025-12-04T08:54:18.6317247Z * [new branch] gh/jansel/552/head -> origin/gh/jansel/552/head 2025-12-04T08:54:18.6317315Z * [new branch] gh/jansel/552/orig -> origin/gh/jansel/552/orig 2025-12-04T08:54:18.6317383Z * [new branch] gh/jansel/553/base -> origin/gh/jansel/553/base 2025-12-04T08:54:18.6317451Z * [new branch] gh/jansel/553/head -> origin/gh/jansel/553/head 2025-12-04T08:54:18.6317518Z * [new branch] gh/jansel/553/orig -> origin/gh/jansel/553/orig 2025-12-04T08:54:18.6317583Z * [new branch] gh/jansel/554/base -> origin/gh/jansel/554/base 2025-12-04T08:54:18.6317671Z * [new branch] gh/jansel/554/head -> origin/gh/jansel/554/head 2025-12-04T08:54:18.6317738Z * [new branch] gh/jansel/554/orig -> origin/gh/jansel/554/orig 2025-12-04T08:54:18.6317804Z * [new branch] gh/jansel/555/base -> origin/gh/jansel/555/base 2025-12-04T08:54:18.6317872Z * [new branch] gh/jansel/555/head -> origin/gh/jansel/555/head 2025-12-04T08:54:18.6317939Z * [new branch] gh/jansel/555/orig -> origin/gh/jansel/555/orig 2025-12-04T08:54:18.6318007Z * [new branch] gh/jansel/556/base -> origin/gh/jansel/556/base 2025-12-04T08:54:18.6318131Z * [new branch] gh/jansel/556/head -> origin/gh/jansel/556/head 2025-12-04T08:54:18.6318200Z * [new branch] gh/jansel/556/orig -> origin/gh/jansel/556/orig 2025-12-04T08:54:18.6318266Z * [new branch] gh/jansel/557/base -> origin/gh/jansel/557/base 2025-12-04T08:54:18.6318336Z * [new branch] gh/jansel/557/head -> origin/gh/jansel/557/head 2025-12-04T08:54:18.6318403Z * [new branch] gh/jansel/557/orig -> origin/gh/jansel/557/orig 2025-12-04T08:54:18.6318470Z * [new branch] gh/jansel/558/base -> origin/gh/jansel/558/base 2025-12-04T08:54:18.6318537Z * [new branch] gh/jansel/558/head -> origin/gh/jansel/558/head 2025-12-04T08:54:18.6318632Z * [new branch] gh/jansel/558/orig -> origin/gh/jansel/558/orig 2025-12-04T08:54:18.6318701Z * [new branch] gh/jansel/559/base -> origin/gh/jansel/559/base 2025-12-04T08:54:18.6318767Z * [new branch] gh/jansel/559/head -> origin/gh/jansel/559/head 2025-12-04T08:54:18.6318834Z * [new branch] gh/jansel/559/orig -> origin/gh/jansel/559/orig 2025-12-04T08:54:18.6318902Z * [new branch] gh/jansel/560/base -> origin/gh/jansel/560/base 2025-12-04T08:54:18.6318970Z * [new branch] gh/jansel/560/head -> origin/gh/jansel/560/head 2025-12-04T08:54:18.6319036Z * [new branch] gh/jansel/560/orig -> origin/gh/jansel/560/orig 2025-12-04T08:54:18.6319104Z * [new branch] gh/jansel/561/base -> origin/gh/jansel/561/base 2025-12-04T08:54:18.6319169Z * [new branch] gh/jansel/561/head -> origin/gh/jansel/561/head 2025-12-04T08:54:18.6319237Z * [new branch] gh/jansel/561/orig -> origin/gh/jansel/561/orig 2025-12-04T08:54:18.6319306Z * [new branch] gh/jansel/562/base -> origin/gh/jansel/562/base 2025-12-04T08:54:18.6319372Z * [new branch] gh/jansel/562/head -> origin/gh/jansel/562/head 2025-12-04T08:54:18.6319439Z * [new branch] gh/jansel/562/orig -> origin/gh/jansel/562/orig 2025-12-04T08:54:18.6319507Z * [new branch] gh/jansel/563/base -> origin/gh/jansel/563/base 2025-12-04T08:54:18.6319575Z * [new branch] gh/jansel/563/head -> origin/gh/jansel/563/head 2025-12-04T08:54:18.6319641Z * [new branch] gh/jansel/563/orig -> origin/gh/jansel/563/orig 2025-12-04T08:54:18.6319708Z * [new branch] gh/jansel/564/base -> origin/gh/jansel/564/base 2025-12-04T08:54:18.6319774Z * [new branch] gh/jansel/564/head -> origin/gh/jansel/564/head 2025-12-04T08:54:18.6319842Z * [new branch] gh/jansel/564/orig -> origin/gh/jansel/564/orig 2025-12-04T08:54:18.6319910Z * [new branch] gh/jansel/565/base -> origin/gh/jansel/565/base 2025-12-04T08:54:18.6319977Z * [new branch] gh/jansel/565/head -> origin/gh/jansel/565/head 2025-12-04T08:54:18.6320045Z * [new branch] gh/jansel/565/orig -> origin/gh/jansel/565/orig 2025-12-04T08:54:18.6320112Z * [new branch] gh/jansel/566/base -> origin/gh/jansel/566/base 2025-12-04T08:54:18.6320210Z * [new branch] gh/jansel/566/head -> origin/gh/jansel/566/head 2025-12-04T08:54:18.6320280Z * [new branch] gh/jansel/566/orig -> origin/gh/jansel/566/orig 2025-12-04T08:54:18.6320347Z * [new branch] gh/jansel/567/base -> origin/gh/jansel/567/base 2025-12-04T08:54:18.6320413Z * [new branch] gh/jansel/567/head -> origin/gh/jansel/567/head 2025-12-04T08:54:18.6320483Z * [new branch] gh/jansel/567/orig -> origin/gh/jansel/567/orig 2025-12-04T08:54:18.6320550Z * [new branch] gh/jansel/568/base -> origin/gh/jansel/568/base 2025-12-04T08:54:18.6320617Z * [new branch] gh/jansel/568/head -> origin/gh/jansel/568/head 2025-12-04T08:54:18.6320685Z * [new branch] gh/jansel/568/orig -> origin/gh/jansel/568/orig 2025-12-04T08:54:18.6320752Z * [new branch] gh/jansel/569/base -> origin/gh/jansel/569/base 2025-12-04T08:54:18.6320820Z * [new branch] gh/jansel/569/head -> origin/gh/jansel/569/head 2025-12-04T08:54:18.6320890Z * [new branch] gh/jansel/569/orig -> origin/gh/jansel/569/orig 2025-12-04T08:54:18.6320957Z * [new branch] gh/jansel/570/base -> origin/gh/jansel/570/base 2025-12-04T08:54:18.6321024Z * [new branch] gh/jansel/570/head -> origin/gh/jansel/570/head 2025-12-04T08:54:18.6321110Z * [new branch] gh/jansel/570/orig -> origin/gh/jansel/570/orig 2025-12-04T08:54:18.6321178Z * [new branch] gh/jansel/571/base -> origin/gh/jansel/571/base 2025-12-04T08:54:18.6321244Z * [new branch] gh/jansel/571/head -> origin/gh/jansel/571/head 2025-12-04T08:54:18.6321313Z * [new branch] gh/jansel/571/orig -> origin/gh/jansel/571/orig 2025-12-04T08:54:18.6321379Z * [new branch] gh/jansel/572/base -> origin/gh/jansel/572/base 2025-12-04T08:54:18.6321446Z * [new branch] gh/jansel/572/head -> origin/gh/jansel/572/head 2025-12-04T08:54:18.6321517Z * [new branch] gh/jansel/572/orig -> origin/gh/jansel/572/orig 2025-12-04T08:54:18.6321584Z * [new branch] gh/jansel/573/base -> origin/gh/jansel/573/base 2025-12-04T08:54:18.6321652Z * [new branch] gh/jansel/573/head -> origin/gh/jansel/573/head 2025-12-04T08:54:18.6321721Z * [new branch] gh/jansel/573/orig -> origin/gh/jansel/573/orig 2025-12-04T08:54:18.6321788Z * [new branch] gh/jansel/574/base -> origin/gh/jansel/574/base 2025-12-04T08:54:18.6321858Z * [new branch] gh/jansel/574/head -> origin/gh/jansel/574/head 2025-12-04T08:54:18.6321924Z * [new branch] gh/jansel/574/orig -> origin/gh/jansel/574/orig 2025-12-04T08:54:18.6321991Z * [new branch] gh/jansel/575/base -> origin/gh/jansel/575/base 2025-12-04T08:54:18.6322058Z * [new branch] gh/jansel/575/head -> origin/gh/jansel/575/head 2025-12-04T08:54:18.6322125Z * [new branch] gh/jansel/575/orig -> origin/gh/jansel/575/orig 2025-12-04T08:54:18.6322191Z * [new branch] gh/jansel/576/base -> origin/gh/jansel/576/base 2025-12-04T08:54:18.6322259Z * [new branch] gh/jansel/576/head -> origin/gh/jansel/576/head 2025-12-04T08:54:18.6322327Z * [new branch] gh/jansel/576/orig -> origin/gh/jansel/576/orig 2025-12-04T08:54:18.6322408Z * [new branch] gh/jbschlosser/247/base -> origin/gh/jbschlosser/247/base 2025-12-04T08:54:18.6322488Z * [new branch] gh/jbschlosser/247/head -> origin/gh/jbschlosser/247/head 2025-12-04T08:54:18.6322564Z * [new branch] gh/jbschlosser/247/orig -> origin/gh/jbschlosser/247/orig 2025-12-04T08:54:18.6322639Z * [new branch] gh/jbschlosser/250/base -> origin/gh/jbschlosser/250/base 2025-12-04T08:54:18.6322735Z * [new branch] gh/jbschlosser/250/head -> origin/gh/jbschlosser/250/head 2025-12-04T08:54:18.6322809Z * [new branch] gh/jbschlosser/250/orig -> origin/gh/jbschlosser/250/orig 2025-12-04T08:54:18.6322881Z * [new branch] gh/jerryzh168/1/base -> origin/gh/jerryzh168/1/base 2025-12-04T08:54:18.6322955Z * [new branch] gh/jerryzh168/1/head -> origin/gh/jerryzh168/1/head 2025-12-04T08:54:18.6323027Z * [new branch] gh/jerryzh168/1/orig -> origin/gh/jerryzh168/1/orig 2025-12-04T08:54:18.6323100Z * [new branch] gh/jiayisunx/59/base -> origin/gh/jiayisunx/59/base 2025-12-04T08:54:18.6323171Z * [new branch] gh/jiayisunx/59/head -> origin/gh/jiayisunx/59/head 2025-12-04T08:54:18.6323242Z * [new branch] gh/jiayisunx/59/orig -> origin/gh/jiayisunx/59/orig 2025-12-04T08:54:18.6323314Z * [new branch] gh/jiayisunx/61/base -> origin/gh/jiayisunx/61/base 2025-12-04T08:54:18.6323385Z * [new branch] gh/jiayisunx/61/head -> origin/gh/jiayisunx/61/head 2025-12-04T08:54:18.6323456Z * [new branch] gh/jiayisunx/61/orig -> origin/gh/jiayisunx/61/orig 2025-12-04T08:54:18.6323528Z * [new branch] gh/jiayisunx/68/base -> origin/gh/jiayisunx/68/base 2025-12-04T08:54:18.6323598Z * [new branch] gh/jiayisunx/68/head -> origin/gh/jiayisunx/68/head 2025-12-04T08:54:18.6323688Z * [new branch] gh/jiayisunx/68/orig -> origin/gh/jiayisunx/68/orig 2025-12-04T08:54:18.6323760Z * [new branch] gh/jiayisunx/77/base -> origin/gh/jiayisunx/77/base 2025-12-04T08:54:18.6323833Z * [new branch] gh/jiayisunx/77/head -> origin/gh/jiayisunx/77/head 2025-12-04T08:54:18.6323904Z * [new branch] gh/jiayisunx/77/orig -> origin/gh/jiayisunx/77/orig 2025-12-04T08:54:18.6323975Z * [new branch] gh/jiayisunx/78/base -> origin/gh/jiayisunx/78/base 2025-12-04T08:54:18.6324047Z * [new branch] gh/jiayisunx/78/head -> origin/gh/jiayisunx/78/head 2025-12-04T08:54:18.6324118Z * [new branch] gh/jiayisunx/78/orig -> origin/gh/jiayisunx/78/orig 2025-12-04T08:54:18.6324190Z * [new branch] gh/jiayisunx/79/base -> origin/gh/jiayisunx/79/base 2025-12-04T08:54:18.6324260Z * [new branch] gh/jiayisunx/79/head -> origin/gh/jiayisunx/79/head 2025-12-04T08:54:18.6324333Z * [new branch] gh/jiayisunx/79/orig -> origin/gh/jiayisunx/79/orig 2025-12-04T08:54:18.6324405Z * [new branch] gh/jiayisunx/82/base -> origin/gh/jiayisunx/82/base 2025-12-04T08:54:18.6324474Z * [new branch] gh/jiayisunx/82/head -> origin/gh/jiayisunx/82/head 2025-12-04T08:54:18.6324545Z * [new branch] gh/jiayisunx/82/orig -> origin/gh/jiayisunx/82/orig 2025-12-04T08:54:18.6324619Z * [new branch] gh/jiayisunx/83/base -> origin/gh/jiayisunx/83/base 2025-12-04T08:54:18.6324691Z * [new branch] gh/jiayisunx/83/head -> origin/gh/jiayisunx/83/head 2025-12-04T08:54:18.6324762Z * [new branch] gh/jiayisunx/83/orig -> origin/gh/jiayisunx/83/orig 2025-12-04T08:54:18.6324834Z * [new branch] gh/jiayisunx/84/base -> origin/gh/jiayisunx/84/base 2025-12-04T08:54:18.6324906Z * [new branch] gh/jiayisunx/84/head -> origin/gh/jiayisunx/84/head 2025-12-04T08:54:18.6324978Z * [new branch] gh/jiayisunx/84/orig -> origin/gh/jiayisunx/84/orig 2025-12-04T08:54:18.6325050Z * [new branch] gh/jiayisunx/85/base -> origin/gh/jiayisunx/85/base 2025-12-04T08:54:18.6325120Z * [new branch] gh/jiayisunx/85/head -> origin/gh/jiayisunx/85/head 2025-12-04T08:54:18.6325191Z * [new branch] gh/jiayisunx/85/orig -> origin/gh/jiayisunx/85/orig 2025-12-04T08:54:18.6325261Z * [new branch] gh/jiayisunx/86/base -> origin/gh/jiayisunx/86/base 2025-12-04T08:54:18.6325358Z * [new branch] gh/jiayisunx/86/head -> origin/gh/jiayisunx/86/head 2025-12-04T08:54:18.6325432Z * [new branch] gh/jiayisunx/86/orig -> origin/gh/jiayisunx/86/orig 2025-12-04T08:54:18.6325502Z * [new branch] gh/jiayisunx/87/base -> origin/gh/jiayisunx/87/base 2025-12-04T08:54:18.6325574Z * [new branch] gh/jiayisunx/87/head -> origin/gh/jiayisunx/87/head 2025-12-04T08:54:18.6325646Z * [new branch] gh/jiayisunx/87/orig -> origin/gh/jiayisunx/87/orig 2025-12-04T08:54:18.6325718Z * [new branch] gh/jiayisunx/88/base -> origin/gh/jiayisunx/88/base 2025-12-04T08:54:18.6325788Z * [new branch] gh/jiayisunx/88/head -> origin/gh/jiayisunx/88/head 2025-12-04T08:54:18.6325860Z * [new branch] gh/jiayisunx/88/orig -> origin/gh/jiayisunx/88/orig 2025-12-04T08:54:18.6325930Z * [new branch] gh/jiayisunx/89/base -> origin/gh/jiayisunx/89/base 2025-12-04T08:54:18.6326002Z * [new branch] gh/jiayisunx/89/head -> origin/gh/jiayisunx/89/head 2025-12-04T08:54:18.6326073Z * [new branch] gh/jiayisunx/89/orig -> origin/gh/jiayisunx/89/orig 2025-12-04T08:54:18.6326144Z * [new branch] gh/jiayisunx/90/base -> origin/gh/jiayisunx/90/base 2025-12-04T08:54:18.6326239Z * [new branch] gh/jiayisunx/90/head -> origin/gh/jiayisunx/90/head 2025-12-04T08:54:18.6326313Z * [new branch] gh/jiayisunx/90/orig -> origin/gh/jiayisunx/90/orig 2025-12-04T08:54:18.6326389Z * [new branch] gh/jjwu@meta.com/1/base -> origin/gh/jjwu@meta.com/1/base 2025-12-04T08:54:18.6326465Z * [new branch] gh/jjwu@meta.com/1/head -> origin/gh/jjwu@meta.com/1/head 2025-12-04T08:54:18.6326536Z * [new branch] gh/jturney/1/base -> origin/gh/jturney/1/base 2025-12-04T08:54:18.6326604Z * [new branch] gh/jturney/1/head -> origin/gh/jturney/1/head 2025-12-04T08:54:18.6326675Z * [new branch] gh/jturney/1/orig -> origin/gh/jturney/1/orig 2025-12-04T08:54:18.6326743Z * [new branch] gh/jturney/2/base -> origin/gh/jturney/2/base 2025-12-04T08:54:18.6326810Z * [new branch] gh/jturney/2/head -> origin/gh/jturney/2/head 2025-12-04T08:54:18.6326878Z * [new branch] gh/jturney/2/orig -> origin/gh/jturney/2/orig 2025-12-04T08:54:18.6326955Z * [new branch] gh/karthickai/10/base -> origin/gh/karthickai/10/base 2025-12-04T08:54:18.6327030Z * [new branch] gh/karthickai/10/head -> origin/gh/karthickai/10/head 2025-12-04T08:54:18.6327106Z * [new branch] gh/karthickai/10/orig -> origin/gh/karthickai/10/orig 2025-12-04T08:54:18.6327178Z * [new branch] gh/karthickai/11/base -> origin/gh/karthickai/11/base 2025-12-04T08:54:18.6327251Z * [new branch] gh/karthickai/11/head -> origin/gh/karthickai/11/head 2025-12-04T08:54:18.6327329Z * [new branch] gh/karthickai/11/orig -> origin/gh/karthickai/11/orig 2025-12-04T08:54:18.6327402Z * [new branch] gh/karthickai/12/base -> origin/gh/karthickai/12/base 2025-12-04T08:54:18.6327474Z * [new branch] gh/karthickai/12/head -> origin/gh/karthickai/12/head 2025-12-04T08:54:18.6327548Z * [new branch] gh/karthickai/12/orig -> origin/gh/karthickai/12/orig 2025-12-04T08:54:18.6327620Z * [new branch] gh/karthickai/13/base -> origin/gh/karthickai/13/base 2025-12-04T08:54:18.6327693Z * [new branch] gh/karthickai/13/head -> origin/gh/karthickai/13/head 2025-12-04T08:54:18.6327767Z * [new branch] gh/karthickai/13/orig -> origin/gh/karthickai/13/orig 2025-12-04T08:54:18.6327840Z * [new branch] gh/karthickai/14/base -> origin/gh/karthickai/14/base 2025-12-04T08:54:18.6327914Z * [new branch] gh/karthickai/14/head -> origin/gh/karthickai/14/head 2025-12-04T08:54:18.6328006Z * [new branch] gh/karthickai/14/orig -> origin/gh/karthickai/14/orig 2025-12-04T08:54:18.6328121Z * [new branch] gh/karthickai/15/base -> origin/gh/karthickai/15/base 2025-12-04T08:54:18.6328197Z * [new branch] gh/karthickai/15/head -> origin/gh/karthickai/15/head 2025-12-04T08:54:18.6328270Z * [new branch] gh/karthickai/15/orig -> origin/gh/karthickai/15/orig 2025-12-04T08:54:18.6328342Z * [new branch] gh/karthickai/16/base -> origin/gh/karthickai/16/base 2025-12-04T08:54:18.6328415Z * [new branch] gh/karthickai/16/head -> origin/gh/karthickai/16/head 2025-12-04T08:54:18.6328487Z * [new branch] gh/karthickai/16/orig -> origin/gh/karthickai/16/orig 2025-12-04T08:54:18.6328558Z * [new branch] gh/karthickai/17/base -> origin/gh/karthickai/17/base 2025-12-04T08:54:18.6328635Z * [new branch] gh/karthickai/17/head -> origin/gh/karthickai/17/head 2025-12-04T08:54:18.6328707Z * [new branch] gh/karthickai/17/orig -> origin/gh/karthickai/17/orig 2025-12-04T08:54:18.6328778Z * [new branch] gh/karthickai/18/base -> origin/gh/karthickai/18/base 2025-12-04T08:54:18.6328852Z * [new branch] gh/karthickai/18/head -> origin/gh/karthickai/18/head 2025-12-04T08:54:18.6328950Z * [new branch] gh/karthickai/18/orig -> origin/gh/karthickai/18/orig 2025-12-04T08:54:18.6329025Z * [new branch] gh/karthickai/19/base -> origin/gh/karthickai/19/base 2025-12-04T08:54:18.6329099Z * [new branch] gh/karthickai/19/head -> origin/gh/karthickai/19/head 2025-12-04T08:54:18.6329172Z * [new branch] gh/karthickai/19/orig -> origin/gh/karthickai/19/orig 2025-12-04T08:54:18.6329245Z * [new branch] gh/karthickai/20/base -> origin/gh/karthickai/20/base 2025-12-04T08:54:18.6329322Z * [new branch] gh/karthickai/20/head -> origin/gh/karthickai/20/head 2025-12-04T08:54:18.6329395Z * [new branch] gh/karthickai/20/orig -> origin/gh/karthickai/20/orig 2025-12-04T08:54:18.6329467Z * [new branch] gh/karthickai/21/base -> origin/gh/karthickai/21/base 2025-12-04T08:54:18.6329541Z * [new branch] gh/karthickai/21/head -> origin/gh/karthickai/21/head 2025-12-04T08:54:18.6329615Z * [new branch] gh/karthickai/21/orig -> origin/gh/karthickai/21/orig 2025-12-04T08:54:18.6329688Z * [new branch] gh/karthickai/22/base -> origin/gh/karthickai/22/base 2025-12-04T08:54:18.6329760Z * [new branch] gh/karthickai/22/head -> origin/gh/karthickai/22/head 2025-12-04T08:54:18.6329831Z * [new branch] gh/karthickai/22/orig -> origin/gh/karthickai/22/orig 2025-12-04T08:54:18.6329905Z * [new branch] gh/karthickai/23/base -> origin/gh/karthickai/23/base 2025-12-04T08:54:18.6329981Z * [new branch] gh/karthickai/23/head -> origin/gh/karthickai/23/head 2025-12-04T08:54:18.6330054Z * [new branch] gh/karthickai/23/orig -> origin/gh/karthickai/23/orig 2025-12-04T08:54:18.6330128Z * [new branch] gh/karthickai/24/base -> origin/gh/karthickai/24/base 2025-12-04T08:54:18.6330202Z * [new branch] gh/karthickai/24/head -> origin/gh/karthickai/24/head 2025-12-04T08:54:18.6330274Z * [new branch] gh/karthickai/24/orig -> origin/gh/karthickai/24/orig 2025-12-04T08:54:18.6330348Z * [new branch] gh/karthickai/25/base -> origin/gh/karthickai/25/base 2025-12-04T08:54:18.6330420Z * [new branch] gh/karthickai/25/head -> origin/gh/karthickai/25/head 2025-12-04T08:54:18.6330492Z * [new branch] gh/karthickai/25/orig -> origin/gh/karthickai/25/orig 2025-12-04T08:54:18.6330566Z * [new branch] gh/karthickai/26/base -> origin/gh/karthickai/26/base 2025-12-04T08:54:18.6330670Z * [new branch] gh/karthickai/26/head -> origin/gh/karthickai/26/head 2025-12-04T08:54:18.6330741Z * [new branch] gh/karthickai/26/orig -> origin/gh/karthickai/26/orig 2025-12-04T08:54:18.6330816Z * [new branch] gh/karthickai/6/base -> origin/gh/karthickai/6/base 2025-12-04T08:54:18.6330890Z * [new branch] gh/karthickai/6/head -> origin/gh/karthickai/6/head 2025-12-04T08:54:18.6330961Z * [new branch] gh/karthickai/6/orig -> origin/gh/karthickai/6/orig 2025-12-04T08:54:18.6331032Z * [new branch] gh/krocki/1/base -> origin/gh/krocki/1/base 2025-12-04T08:54:18.6331100Z * [new branch] gh/krocki/1/head -> origin/gh/krocki/1/head 2025-12-04T08:54:18.6331169Z * [new branch] gh/krocki/1/orig -> origin/gh/krocki/1/orig 2025-12-04T08:54:18.6331235Z * [new branch] gh/krocki/2/base -> origin/gh/krocki/2/base 2025-12-04T08:54:18.6331302Z * [new branch] gh/krocki/2/head -> origin/gh/krocki/2/head 2025-12-04T08:54:18.6331369Z * [new branch] gh/krocki/2/orig -> origin/gh/krocki/2/orig 2025-12-04T08:54:18.6331448Z * [new branch] gh/kurtamohler/60/base -> origin/gh/kurtamohler/60/base 2025-12-04T08:54:18.6331543Z * [new branch] gh/kurtamohler/60/head -> origin/gh/kurtamohler/60/head 2025-12-04T08:54:18.6331621Z * [new branch] gh/kurtamohler/60/orig -> origin/gh/kurtamohler/60/orig 2025-12-04T08:54:18.6331694Z * [new branch] gh/kurtamohler/61/base -> origin/gh/kurtamohler/61/base 2025-12-04T08:54:18.6331767Z * [new branch] gh/kurtamohler/61/head -> origin/gh/kurtamohler/61/head 2025-12-04T08:54:18.6331840Z * [new branch] gh/kurtamohler/61/orig -> origin/gh/kurtamohler/61/orig 2025-12-04T08:54:18.6331911Z * [new branch] gh/kurtamohler/62/base -> origin/gh/kurtamohler/62/base 2025-12-04T08:54:18.6331985Z * [new branch] gh/kurtamohler/62/head -> origin/gh/kurtamohler/62/head 2025-12-04T08:54:18.6332062Z * [new branch] gh/kurtamohler/62/orig -> origin/gh/kurtamohler/62/orig 2025-12-04T08:54:18.6332135Z * [new branch] gh/kurtamohler/63/base -> origin/gh/kurtamohler/63/base 2025-12-04T08:54:18.6332210Z * [new branch] gh/kurtamohler/63/head -> origin/gh/kurtamohler/63/head 2025-12-04T08:54:18.6332285Z * [new branch] gh/kurtamohler/63/orig -> origin/gh/kurtamohler/63/orig 2025-12-04T08:54:18.6332358Z * [new branch] gh/kurtamohler/64/base -> origin/gh/kurtamohler/64/base 2025-12-04T08:54:18.6332431Z * [new branch] gh/kurtamohler/64/head -> origin/gh/kurtamohler/64/head 2025-12-04T08:54:18.6332506Z * [new branch] gh/kurtamohler/64/orig -> origin/gh/kurtamohler/64/orig 2025-12-04T08:54:18.6332579Z * [new branch] gh/kurtamohler/65/base -> origin/gh/kurtamohler/65/base 2025-12-04T08:54:18.6332654Z * [new branch] gh/kurtamohler/65/head -> origin/gh/kurtamohler/65/head 2025-12-04T08:54:18.6332730Z * [new branch] gh/kurtamohler/65/orig -> origin/gh/kurtamohler/65/orig 2025-12-04T08:54:18.6332804Z * [new branch] gh/kurtamohler/66/base -> origin/gh/kurtamohler/66/base 2025-12-04T08:54:18.6332882Z * [new branch] gh/kurtamohler/66/head -> origin/gh/kurtamohler/66/head 2025-12-04T08:54:18.6332955Z * [new branch] gh/kurtamohler/66/orig -> origin/gh/kurtamohler/66/orig 2025-12-04T08:54:18.6333028Z * [new branch] gh/kurtamohler/67/base -> origin/gh/kurtamohler/67/base 2025-12-04T08:54:18.6333102Z * [new branch] gh/kurtamohler/67/head -> origin/gh/kurtamohler/67/head 2025-12-04T08:54:18.6333175Z * [new branch] gh/kurtamohler/67/orig -> origin/gh/kurtamohler/67/orig 2025-12-04T08:54:18.6333272Z * [new branch] gh/kwen2501/130/base -> origin/gh/kwen2501/130/base 2025-12-04T08:54:18.6333344Z * [new branch] gh/kwen2501/130/head -> origin/gh/kwen2501/130/head 2025-12-04T08:54:18.6333412Z * [new branch] gh/kwen2501/130/orig -> origin/gh/kwen2501/130/orig 2025-12-04T08:54:18.6333481Z * [new branch] gh/kwen2501/170/base -> origin/gh/kwen2501/170/base 2025-12-04T08:54:18.6333553Z * [new branch] gh/kwen2501/170/head -> origin/gh/kwen2501/170/head 2025-12-04T08:54:18.6333621Z * [new branch] gh/kwen2501/187/base -> origin/gh/kwen2501/187/base 2025-12-04T08:54:18.6333689Z * [new branch] gh/kwen2501/187/head -> origin/gh/kwen2501/187/head 2025-12-04T08:54:18.6333759Z * [new branch] gh/kwen2501/187/orig -> origin/gh/kwen2501/187/orig 2025-12-04T08:54:18.6333827Z * [new branch] gh/kwen2501/188/base -> origin/gh/kwen2501/188/base 2025-12-04T08:54:18.6333897Z * [new branch] gh/kwen2501/188/head -> origin/gh/kwen2501/188/head 2025-12-04T08:54:18.6333968Z * [new branch] gh/kwen2501/188/orig -> origin/gh/kwen2501/188/orig 2025-12-04T08:54:18.6334037Z * [new branch] gh/kwen2501/211/base -> origin/gh/kwen2501/211/base 2025-12-04T08:54:18.6334106Z * [new branch] gh/kwen2501/211/head -> origin/gh/kwen2501/211/head 2025-12-04T08:54:18.6334207Z * [new branch] gh/kwen2501/224/base -> origin/gh/kwen2501/224/base 2025-12-04T08:54:18.6334276Z * [new branch] gh/kwen2501/224/head -> origin/gh/kwen2501/224/head 2025-12-04T08:54:18.6334347Z * [new branch] gh/kwen2501/224/orig -> origin/gh/kwen2501/224/orig 2025-12-04T08:54:18.6334415Z * [new branch] gh/kwen2501/228/base -> origin/gh/kwen2501/228/base 2025-12-04T08:54:18.6334484Z * [new branch] gh/kwen2501/228/head -> origin/gh/kwen2501/228/head 2025-12-04T08:54:18.6334556Z * [new branch] gh/kwen2501/228/orig -> origin/gh/kwen2501/228/orig 2025-12-04T08:54:18.6334624Z * [new branch] gh/kwen2501/234/base -> origin/gh/kwen2501/234/base 2025-12-04T08:54:18.6334692Z * [new branch] gh/kwen2501/234/head -> origin/gh/kwen2501/234/head 2025-12-04T08:54:18.6334763Z * [new branch] gh/kwen2501/234/orig -> origin/gh/kwen2501/234/orig 2025-12-04T08:54:18.6334831Z * [new branch] gh/kwen2501/235/base -> origin/gh/kwen2501/235/base 2025-12-04T08:54:18.6334900Z * [new branch] gh/kwen2501/235/head -> origin/gh/kwen2501/235/head 2025-12-04T08:54:18.6334970Z * [new branch] gh/kwen2501/235/orig -> origin/gh/kwen2501/235/orig 2025-12-04T08:54:18.6335039Z * [new branch] gh/kwen2501/236/base -> origin/gh/kwen2501/236/base 2025-12-04T08:54:18.6335108Z * [new branch] gh/kwen2501/236/head -> origin/gh/kwen2501/236/head 2025-12-04T08:54:18.6335180Z * [new branch] gh/kwen2501/236/orig -> origin/gh/kwen2501/236/orig 2025-12-04T08:54:18.6335249Z * [new branch] gh/kwen2501/237/base -> origin/gh/kwen2501/237/base 2025-12-04T08:54:18.6335316Z * [new branch] gh/kwen2501/237/head -> origin/gh/kwen2501/237/head 2025-12-04T08:54:18.6335389Z * [new branch] gh/kwen2501/237/orig -> origin/gh/kwen2501/237/orig 2025-12-04T08:54:18.6335457Z * [new branch] gh/kwen2501/238/base -> origin/gh/kwen2501/238/base 2025-12-04T08:54:18.6335525Z * [new branch] gh/kwen2501/238/head -> origin/gh/kwen2501/238/head 2025-12-04T08:54:18.6335597Z * [new branch] gh/kwen2501/238/orig -> origin/gh/kwen2501/238/orig 2025-12-04T08:54:18.6335665Z * [new branch] gh/kwen2501/240/base -> origin/gh/kwen2501/240/base 2025-12-04T08:54:18.6335736Z * [new branch] gh/kwen2501/240/head -> origin/gh/kwen2501/240/head 2025-12-04T08:54:18.6335825Z * [new branch] gh/kwen2501/240/orig -> origin/gh/kwen2501/240/orig 2025-12-04T08:54:18.6335894Z * [new branch] gh/kwen2501/241/base -> origin/gh/kwen2501/241/base 2025-12-04T08:54:18.6335963Z * [new branch] gh/kwen2501/241/head -> origin/gh/kwen2501/241/head 2025-12-04T08:54:18.6336033Z * [new branch] gh/kwen2501/241/orig -> origin/gh/kwen2501/241/orig 2025-12-04T08:54:18.6336101Z * [new branch] gh/kwen2501/247/base -> origin/gh/kwen2501/247/base 2025-12-04T08:54:18.6336171Z * [new branch] gh/kwen2501/247/head -> origin/gh/kwen2501/247/head 2025-12-04T08:54:18.6336238Z * [new branch] gh/kwen2501/247/orig -> origin/gh/kwen2501/247/orig 2025-12-04T08:54:18.6336306Z * [new branch] gh/kwen2501/252/base -> origin/gh/kwen2501/252/base 2025-12-04T08:54:18.6336376Z * [new branch] gh/kwen2501/252/head -> origin/gh/kwen2501/252/head 2025-12-04T08:54:18.6336446Z * [new branch] gh/kwen2501/252/orig -> origin/gh/kwen2501/252/orig 2025-12-04T08:54:18.6336515Z * [new branch] gh/kwen2501/259/base -> origin/gh/kwen2501/259/base 2025-12-04T08:54:18.6336586Z * [new branch] gh/kwen2501/259/head -> origin/gh/kwen2501/259/head 2025-12-04T08:54:18.6336675Z * [new branch] gh/kwen2501/259/orig -> origin/gh/kwen2501/259/orig 2025-12-04T08:54:18.6336745Z * [new branch] gh/kwen2501/260/base -> origin/gh/kwen2501/260/base 2025-12-04T08:54:18.6336814Z * [new branch] gh/kwen2501/260/head -> origin/gh/kwen2501/260/head 2025-12-04T08:54:18.6336882Z * [new branch] gh/kwen2501/260/orig -> origin/gh/kwen2501/260/orig 2025-12-04T08:54:18.6336951Z * [new branch] gh/kwen2501/268/base -> origin/gh/kwen2501/268/base 2025-12-04T08:54:18.6337023Z * [new branch] gh/kwen2501/268/head -> origin/gh/kwen2501/268/head 2025-12-04T08:54:18.6337093Z * [new branch] gh/kwen2501/268/orig -> origin/gh/kwen2501/268/orig 2025-12-04T08:54:18.6337161Z * [new branch] gh/kwen2501/269/base -> origin/gh/kwen2501/269/base 2025-12-04T08:54:18.6337232Z * [new branch] gh/kwen2501/269/head -> origin/gh/kwen2501/269/head 2025-12-04T08:54:18.6337301Z * [new branch] gh/kwen2501/269/orig -> origin/gh/kwen2501/269/orig 2025-12-04T08:54:18.6337371Z * [new branch] gh/kwen2501/270/base -> origin/gh/kwen2501/270/base 2025-12-04T08:54:18.6337439Z * [new branch] gh/kwen2501/270/head -> origin/gh/kwen2501/270/head 2025-12-04T08:54:18.6337507Z * [new branch] gh/kwen2501/270/orig -> origin/gh/kwen2501/270/orig 2025-12-04T08:54:18.6337579Z * [new branch] gh/kwen2501/271/base -> origin/gh/kwen2501/271/base 2025-12-04T08:54:18.6337647Z * [new branch] gh/kwen2501/271/head -> origin/gh/kwen2501/271/head 2025-12-04T08:54:18.6337718Z * [new branch] gh/kwen2501/271/orig -> origin/gh/kwen2501/271/orig 2025-12-04T08:54:18.6337790Z * [new branch] gh/kwen2501/274/base -> origin/gh/kwen2501/274/base 2025-12-04T08:54:18.6337858Z * [new branch] gh/kwen2501/274/head -> origin/gh/kwen2501/274/head 2025-12-04T08:54:18.6337928Z * [new branch] gh/kwen2501/274/orig -> origin/gh/kwen2501/274/orig 2025-12-04T08:54:18.6338001Z * [new branch] gh/kwen2501/275/base -> origin/gh/kwen2501/275/base 2025-12-04T08:54:18.6338117Z * [new branch] gh/kwen2501/275/head -> origin/gh/kwen2501/275/head 2025-12-04T08:54:18.6338189Z * [new branch] gh/kwen2501/275/orig -> origin/gh/kwen2501/275/orig 2025-12-04T08:54:18.6338260Z * [new branch] gh/kwen2501/276/base -> origin/gh/kwen2501/276/base 2025-12-04T08:54:18.6338328Z * [new branch] gh/kwen2501/276/head -> origin/gh/kwen2501/276/head 2025-12-04T08:54:18.6338428Z * [new branch] gh/kwen2501/276/orig -> origin/gh/kwen2501/276/orig 2025-12-04T08:54:18.6338499Z * [new branch] gh/kwen2501/277/base -> origin/gh/kwen2501/277/base 2025-12-04T08:54:18.6338568Z * [new branch] gh/kwen2501/277/head -> origin/gh/kwen2501/277/head 2025-12-04T08:54:18.6338638Z * [new branch] gh/kwen2501/277/orig -> origin/gh/kwen2501/277/orig 2025-12-04T08:54:18.6338708Z * [new branch] gh/kwen2501/278/base -> origin/gh/kwen2501/278/base 2025-12-04T08:54:18.6338777Z * [new branch] gh/kwen2501/278/head -> origin/gh/kwen2501/278/head 2025-12-04T08:54:18.6338848Z * [new branch] gh/kwen2501/278/orig -> origin/gh/kwen2501/278/orig 2025-12-04T08:54:18.6338917Z * [new branch] gh/kwen2501/279/base -> origin/gh/kwen2501/279/base 2025-12-04T08:54:18.6338987Z * [new branch] gh/kwen2501/279/head -> origin/gh/kwen2501/279/head 2025-12-04T08:54:18.6339058Z * [new branch] gh/kwen2501/279/orig -> origin/gh/kwen2501/279/orig 2025-12-04T08:54:18.6339127Z * [new branch] gh/kwen2501/280/base -> origin/gh/kwen2501/280/base 2025-12-04T08:54:18.6339195Z * [new branch] gh/kwen2501/280/head -> origin/gh/kwen2501/280/head 2025-12-04T08:54:18.6339289Z * [new branch] gh/kwen2501/280/orig -> origin/gh/kwen2501/280/orig 2025-12-04T08:54:18.6339358Z * [new branch] gh/kwen2501/281/base -> origin/gh/kwen2501/281/base 2025-12-04T08:54:18.6339426Z * [new branch] gh/kwen2501/281/head -> origin/gh/kwen2501/281/head 2025-12-04T08:54:18.6339498Z * [new branch] gh/kwen2501/281/orig -> origin/gh/kwen2501/281/orig 2025-12-04T08:54:18.6339566Z * [new branch] gh/kwen2501/282/base -> origin/gh/kwen2501/282/base 2025-12-04T08:54:18.6339636Z * [new branch] gh/kwen2501/282/head -> origin/gh/kwen2501/282/head 2025-12-04T08:54:18.6339707Z * [new branch] gh/kwen2501/282/orig -> origin/gh/kwen2501/282/orig 2025-12-04T08:54:18.6339775Z * [new branch] gh/kwen2501/283/base -> origin/gh/kwen2501/283/base 2025-12-04T08:54:18.6339844Z * [new branch] gh/kwen2501/283/head -> origin/gh/kwen2501/283/head 2025-12-04T08:54:18.6339915Z * [new branch] gh/kwen2501/283/orig -> origin/gh/kwen2501/283/orig 2025-12-04T08:54:18.6339984Z * [new branch] gh/kwen2501/284/base -> origin/gh/kwen2501/284/base 2025-12-04T08:54:18.6340053Z * [new branch] gh/kwen2501/284/head -> origin/gh/kwen2501/284/head 2025-12-04T08:54:18.6340124Z * [new branch] gh/kwen2501/284/orig -> origin/gh/kwen2501/284/orig 2025-12-04T08:54:18.6340191Z * [new branch] gh/kwen2501/285/base -> origin/gh/kwen2501/285/base 2025-12-04T08:54:18.6340263Z * [new branch] gh/kwen2501/285/head -> origin/gh/kwen2501/285/head 2025-12-04T08:54:18.6340331Z * [new branch] gh/kwen2501/285/orig -> origin/gh/kwen2501/285/orig 2025-12-04T08:54:18.6340399Z * [new branch] gh/kwen2501/286/base -> origin/gh/kwen2501/286/base 2025-12-04T08:54:18.6340471Z * [new branch] gh/kwen2501/286/head -> origin/gh/kwen2501/286/head 2025-12-04T08:54:18.6340541Z * [new branch] gh/kwen2501/286/orig -> origin/gh/kwen2501/286/orig 2025-12-04T08:54:18.6340611Z * [new branch] gh/kwen2501/287/base -> origin/gh/kwen2501/287/base 2025-12-04T08:54:18.6340681Z * [new branch] gh/kwen2501/287/head -> origin/gh/kwen2501/287/head 2025-12-04T08:54:18.6340749Z * [new branch] gh/kwen2501/287/orig -> origin/gh/kwen2501/287/orig 2025-12-04T08:54:18.6340817Z * [new branch] gh/kwen2501/288/base -> origin/gh/kwen2501/288/base 2025-12-04T08:54:18.6340909Z * [new branch] gh/kwen2501/288/head -> origin/gh/kwen2501/288/head 2025-12-04T08:54:18.6340978Z * [new branch] gh/kwen2501/288/orig -> origin/gh/kwen2501/288/orig 2025-12-04T08:54:18.6341053Z * [new branch] gh/laithsakka/251/base -> origin/gh/laithsakka/251/base 2025-12-04T08:54:18.6341132Z * [new branch] gh/laithsakka/251/head -> origin/gh/laithsakka/251/head 2025-12-04T08:54:18.6341206Z * [new branch] gh/laithsakka/251/orig -> origin/gh/laithsakka/251/orig 2025-12-04T08:54:18.6341280Z * [new branch] gh/laithsakka/276/base -> origin/gh/laithsakka/276/base 2025-12-04T08:54:18.6341355Z * [new branch] gh/laithsakka/276/head -> origin/gh/laithsakka/276/head 2025-12-04T08:54:18.6341428Z * [new branch] gh/laithsakka/276/orig -> origin/gh/laithsakka/276/orig 2025-12-04T08:54:18.6341502Z * [new branch] gh/laithsakka/28/base -> origin/gh/laithsakka/28/base 2025-12-04T08:54:18.6341579Z * [new branch] gh/laithsakka/29/base -> origin/gh/laithsakka/29/base 2025-12-04T08:54:18.6341653Z * [new branch] gh/laithsakka/30/base -> origin/gh/laithsakka/30/base 2025-12-04T08:54:18.6341728Z * [new branch] gh/laithsakka/30/head -> origin/gh/laithsakka/30/head 2025-12-04T08:54:18.6341821Z * [new branch] gh/laithsakka/31/base -> origin/gh/laithsakka/31/base 2025-12-04T08:54:18.6341894Z * [new branch] gh/laithsakka/31/head -> origin/gh/laithsakka/31/head 2025-12-04T08:54:18.6341971Z * [new branch] gh/laithsakka/313/base -> origin/gh/laithsakka/313/base 2025-12-04T08:54:18.6342045Z * [new branch] gh/laithsakka/313/head -> origin/gh/laithsakka/313/head 2025-12-04T08:54:18.6342118Z * [new branch] gh/laithsakka/313/orig -> origin/gh/laithsakka/313/orig 2025-12-04T08:54:18.6342193Z * [new branch] gh/laithsakka/316/base -> origin/gh/laithsakka/316/base 2025-12-04T08:54:18.6342267Z * [new branch] gh/laithsakka/316/head -> origin/gh/laithsakka/316/head 2025-12-04T08:54:18.6342339Z * [new branch] gh/laithsakka/316/orig -> origin/gh/laithsakka/316/orig 2025-12-04T08:54:18.6342414Z * [new branch] gh/laithsakka/317/base -> origin/gh/laithsakka/317/base 2025-12-04T08:54:18.6342488Z * [new branch] gh/laithsakka/317/head -> origin/gh/laithsakka/317/head 2025-12-04T08:54:18.6342561Z * [new branch] gh/laithsakka/317/orig -> origin/gh/laithsakka/317/orig 2025-12-04T08:54:18.6342636Z * [new branch] gh/laithsakka/319/base -> origin/gh/laithsakka/319/base 2025-12-04T08:54:18.6342708Z * [new branch] gh/laithsakka/319/head -> origin/gh/laithsakka/319/head 2025-12-04T08:54:18.6342780Z * [new branch] gh/laithsakka/319/orig -> origin/gh/laithsakka/319/orig 2025-12-04T08:54:18.6342855Z * [new branch] gh/laithsakka/32/base -> origin/gh/laithsakka/32/base 2025-12-04T08:54:18.6342929Z * [new branch] gh/laithsakka/32/head -> origin/gh/laithsakka/32/head 2025-12-04T08:54:18.6343002Z * [new branch] gh/laithsakka/320/base -> origin/gh/laithsakka/320/base 2025-12-04T08:54:18.6343076Z * [new branch] gh/laithsakka/320/head -> origin/gh/laithsakka/320/head 2025-12-04T08:54:18.6343150Z * [new branch] gh/laithsakka/320/orig -> origin/gh/laithsakka/320/orig 2025-12-04T08:54:18.6343223Z * [new branch] gh/laithsakka/321/base -> origin/gh/laithsakka/321/base 2025-12-04T08:54:18.6343299Z * [new branch] gh/laithsakka/321/head -> origin/gh/laithsakka/321/head 2025-12-04T08:54:18.6343372Z * [new branch] gh/laithsakka/321/orig -> origin/gh/laithsakka/321/orig 2025-12-04T08:54:18.6343446Z * [new branch] gh/laithsakka/322/base -> origin/gh/laithsakka/322/base 2025-12-04T08:54:18.6343538Z * [new branch] gh/laithsakka/322/head -> origin/gh/laithsakka/322/head 2025-12-04T08:54:18.6343610Z * [new branch] gh/laithsakka/322/orig -> origin/gh/laithsakka/322/orig 2025-12-04T08:54:18.6343685Z * [new branch] gh/laithsakka/323/base -> origin/gh/laithsakka/323/base 2025-12-04T08:54:18.6343757Z * [new branch] gh/laithsakka/323/head -> origin/gh/laithsakka/323/head 2025-12-04T08:54:18.6343830Z * [new branch] gh/laithsakka/323/orig -> origin/gh/laithsakka/323/orig 2025-12-04T08:54:18.6343905Z * [new branch] gh/laithsakka/324/base -> origin/gh/laithsakka/324/base 2025-12-04T08:54:18.6343976Z * [new branch] gh/laithsakka/324/head -> origin/gh/laithsakka/324/head 2025-12-04T08:54:18.6344048Z * [new branch] gh/laithsakka/324/orig -> origin/gh/laithsakka/324/orig 2025-12-04T08:54:18.6344123Z * [new branch] gh/laithsakka/325/base -> origin/gh/laithsakka/325/base 2025-12-04T08:54:18.6344196Z * [new branch] gh/laithsakka/325/head -> origin/gh/laithsakka/325/head 2025-12-04T08:54:18.6344268Z * [new branch] gh/laithsakka/325/orig -> origin/gh/laithsakka/325/orig 2025-12-04T08:54:18.6344343Z * [new branch] gh/laithsakka/326/base -> origin/gh/laithsakka/326/base 2025-12-04T08:54:18.6344439Z * [new branch] gh/laithsakka/326/head -> origin/gh/laithsakka/326/head 2025-12-04T08:54:18.6344512Z * [new branch] gh/laithsakka/326/orig -> origin/gh/laithsakka/326/orig 2025-12-04T08:54:18.6344589Z * [new branch] gh/laithsakka/327/base -> origin/gh/laithsakka/327/base 2025-12-04T08:54:18.6344661Z * [new branch] gh/laithsakka/327/head -> origin/gh/laithsakka/327/head 2025-12-04T08:54:18.6344734Z * [new branch] gh/laithsakka/327/orig -> origin/gh/laithsakka/327/orig 2025-12-04T08:54:18.6344809Z * [new branch] gh/laithsakka/328/base -> origin/gh/laithsakka/328/base 2025-12-04T08:54:18.6344885Z * [new branch] gh/laithsakka/328/head -> origin/gh/laithsakka/328/head 2025-12-04T08:54:18.6344959Z * [new branch] gh/laithsakka/328/orig -> origin/gh/laithsakka/328/orig 2025-12-04T08:54:18.6345029Z * [new branch] gh/liangel/4/base -> origin/gh/liangel/4/base 2025-12-04T08:54:18.6345099Z * [new branch] gh/liangel/4/head -> origin/gh/liangel/4/head 2025-12-04T08:54:18.6345170Z * [new branch] gh/liangel/4/orig -> origin/gh/liangel/4/orig 2025-12-04T08:54:18.6345245Z * [new branch] gh/lucaskabela/1/base -> origin/gh/lucaskabela/1/base 2025-12-04T08:54:18.6345320Z * [new branch] gh/lucaskabela/1/head -> origin/gh/lucaskabela/1/head 2025-12-04T08:54:18.6345387Z * [new branch] gh/lw/4/base -> origin/gh/lw/4/base 2025-12-04T08:54:18.6345450Z * [new branch] gh/lw/4/head -> origin/gh/lw/4/head 2025-12-04T08:54:18.6345515Z * [new branch] gh/lw/4/orig -> origin/gh/lw/4/orig 2025-12-04T08:54:18.6345578Z * [new branch] gh/lw/5/base -> origin/gh/lw/5/base 2025-12-04T08:54:18.6345638Z * [new branch] gh/lw/5/head -> origin/gh/lw/5/head 2025-12-04T08:54:18.6345701Z * [new branch] gh/lw/5/orig -> origin/gh/lw/5/orig 2025-12-04T08:54:18.6345763Z * [new branch] gh/lw/6/base -> origin/gh/lw/6/base 2025-12-04T08:54:18.6345823Z * [new branch] gh/lw/6/head -> origin/gh/lw/6/head 2025-12-04T08:54:18.6345883Z * [new branch] gh/lw/6/orig -> origin/gh/lw/6/orig 2025-12-04T08:54:18.6345953Z * [new branch] gh/malfet/14/base -> origin/gh/malfet/14/base 2025-12-04T08:54:18.6346023Z * [new branch] gh/malfet/417/base -> origin/gh/malfet/417/base 2025-12-04T08:54:18.6346114Z * [new branch] gh/malfet/417/head -> origin/gh/malfet/417/head 2025-12-04T08:54:18.6346187Z * [new branch] gh/malfet/417/orig -> origin/gh/malfet/417/orig 2025-12-04T08:54:18.6346256Z * [new branch] gh/malfet/506/base -> origin/gh/malfet/506/base 2025-12-04T08:54:18.6346324Z * [new branch] gh/malfet/506/head -> origin/gh/malfet/506/head 2025-12-04T08:54:18.6346394Z * [new branch] gh/malfet/506/orig -> origin/gh/malfet/506/orig 2025-12-04T08:54:18.6346461Z * [new branch] gh/malfet/517/base -> origin/gh/malfet/517/base 2025-12-04T08:54:18.6346527Z * [new branch] gh/malfet/517/head -> origin/gh/malfet/517/head 2025-12-04T08:54:18.6346597Z * [new branch] gh/malfet/528/base -> origin/gh/malfet/528/base 2025-12-04T08:54:18.6346664Z * [new branch] gh/malfet/528/head -> origin/gh/malfet/528/head 2025-12-04T08:54:18.6346734Z * [new branch] gh/malfet/528/orig -> origin/gh/malfet/528/orig 2025-12-04T08:54:18.6346802Z * [new branch] gh/malfet/537/base -> origin/gh/malfet/537/base 2025-12-04T08:54:18.6346870Z * [new branch] gh/malfet/537/head -> origin/gh/malfet/537/head 2025-12-04T08:54:18.6346939Z * [new branch] gh/malfet/537/orig -> origin/gh/malfet/537/orig 2025-12-04T08:54:18.6347027Z * [new branch] gh/malfet/546/base -> origin/gh/malfet/546/base 2025-12-04T08:54:18.6347095Z * [new branch] gh/malfet/546/head -> origin/gh/malfet/546/head 2025-12-04T08:54:18.6347164Z * [new branch] gh/malfet/546/orig -> origin/gh/malfet/546/orig 2025-12-04T08:54:18.6347232Z * [new branch] gh/malfet/565/base -> origin/gh/malfet/565/base 2025-12-04T08:54:18.6347300Z * [new branch] gh/malfet/565/head -> origin/gh/malfet/565/head 2025-12-04T08:54:18.6347370Z * [new branch] gh/malfet/565/orig -> origin/gh/malfet/565/orig 2025-12-04T08:54:18.6347437Z * [new branch] gh/malfet/575/base -> origin/gh/malfet/575/base 2025-12-04T08:54:18.6347504Z * [new branch] gh/malfet/575/head -> origin/gh/malfet/575/head 2025-12-04T08:54:18.6347573Z * [new branch] gh/malfet/575/orig -> origin/gh/malfet/575/orig 2025-12-04T08:54:18.6347642Z * [new branch] gh/malfet/580/base -> origin/gh/malfet/580/base 2025-12-04T08:54:18.6347709Z * [new branch] gh/malfet/580/head -> origin/gh/malfet/580/head 2025-12-04T08:54:18.6347779Z * [new branch] gh/malfet/580/orig -> origin/gh/malfet/580/orig 2025-12-04T08:54:18.6347845Z * [new branch] gh/malfet/581/base -> origin/gh/malfet/581/base 2025-12-04T08:54:18.6347912Z * [new branch] gh/malfet/581/head -> origin/gh/malfet/581/head 2025-12-04T08:54:18.6347982Z * [new branch] gh/malfet/581/orig -> origin/gh/malfet/581/orig 2025-12-04T08:54:18.6348049Z * [new branch] gh/malfet/583/base -> origin/gh/malfet/583/base 2025-12-04T08:54:18.6348162Z * [new branch] gh/malfet/583/head -> origin/gh/malfet/583/head 2025-12-04T08:54:18.6348234Z * [new branch] gh/malfet/583/orig -> origin/gh/malfet/583/orig 2025-12-04T08:54:18.6348302Z * [new branch] gh/malfet/586/base -> origin/gh/malfet/586/base 2025-12-04T08:54:18.6348370Z * [new branch] gh/malfet/586/head -> origin/gh/malfet/586/head 2025-12-04T08:54:18.6348437Z * [new branch] gh/malfet/586/orig -> origin/gh/malfet/586/orig 2025-12-04T08:54:18.6348504Z * [new branch] gh/malfet/587/base -> origin/gh/malfet/587/base 2025-12-04T08:54:18.6348573Z * [new branch] gh/malfet/587/head -> origin/gh/malfet/587/head 2025-12-04T08:54:18.6348641Z * [new branch] gh/malfet/587/orig -> origin/gh/malfet/587/orig 2025-12-04T08:54:18.6348737Z * [new branch] gh/malfet/588/base -> origin/gh/malfet/588/base 2025-12-04T08:54:18.6348806Z * [new branch] gh/malfet/588/head -> origin/gh/malfet/588/head 2025-12-04T08:54:18.6348873Z * [new branch] gh/malfet/588/orig -> origin/gh/malfet/588/orig 2025-12-04T08:54:18.6348940Z * [new branch] gh/malfet/589/base -> origin/gh/malfet/589/base 2025-12-04T08:54:18.6349010Z * [new branch] gh/malfet/589/head -> origin/gh/malfet/589/head 2025-12-04T08:54:18.6349078Z * [new branch] gh/malfet/589/orig -> origin/gh/malfet/589/orig 2025-12-04T08:54:18.6349145Z * [new branch] gh/malfet/590/base -> origin/gh/malfet/590/base 2025-12-04T08:54:18.6349213Z * [new branch] gh/malfet/590/head -> origin/gh/malfet/590/head 2025-12-04T08:54:18.6349281Z * [new branch] gh/malfet/590/orig -> origin/gh/malfet/590/orig 2025-12-04T08:54:18.6349348Z * [new branch] gh/malfet/591/base -> origin/gh/malfet/591/base 2025-12-04T08:54:18.6349417Z * [new branch] gh/malfet/591/head -> origin/gh/malfet/591/head 2025-12-04T08:54:18.6349491Z * [new branch] gh/malfet/591/orig -> origin/gh/malfet/591/orig 2025-12-04T08:54:18.6349664Z * [new branch] gh/malfet/592/base -> origin/gh/malfet/592/base 2025-12-04T08:54:18.6349761Z * [new branch] gh/malfet/592/head -> origin/gh/malfet/592/head 2025-12-04T08:54:18.6349830Z * [new branch] gh/malfet/592/orig -> origin/gh/malfet/592/orig 2025-12-04T08:54:18.6349897Z * [new branch] gh/malfet/593/base -> origin/gh/malfet/593/base 2025-12-04T08:54:18.6349966Z * [new branch] gh/malfet/593/head -> origin/gh/malfet/593/head 2025-12-04T08:54:18.6350033Z * [new branch] gh/malfet/593/orig -> origin/gh/malfet/593/orig 2025-12-04T08:54:18.6350102Z * [new branch] gh/malfet/594/base -> origin/gh/malfet/594/base 2025-12-04T08:54:18.6350169Z * [new branch] gh/malfet/594/head -> origin/gh/malfet/594/head 2025-12-04T08:54:18.6350237Z * [new branch] gh/malfet/594/orig -> origin/gh/malfet/594/orig 2025-12-04T08:54:18.6350307Z * [new branch] gh/malfet/595/base -> origin/gh/malfet/595/base 2025-12-04T08:54:18.6350375Z * [new branch] gh/malfet/595/head -> origin/gh/malfet/595/head 2025-12-04T08:54:18.6350442Z * [new branch] gh/malfet/595/orig -> origin/gh/malfet/595/orig 2025-12-04T08:54:18.6350511Z * [new branch] gh/malfet/596/base -> origin/gh/malfet/596/base 2025-12-04T08:54:18.6350578Z * [new branch] gh/malfet/596/head -> origin/gh/malfet/596/head 2025-12-04T08:54:18.6350645Z * [new branch] gh/malfet/596/orig -> origin/gh/malfet/596/orig 2025-12-04T08:54:18.6350715Z * [new branch] gh/malfet/597/base -> origin/gh/malfet/597/base 2025-12-04T08:54:18.6350782Z * [new branch] gh/malfet/597/head -> origin/gh/malfet/597/head 2025-12-04T08:54:18.6350849Z * [new branch] gh/malfet/597/orig -> origin/gh/malfet/597/orig 2025-12-04T08:54:18.6350920Z * [new branch] gh/malfet/598/base -> origin/gh/malfet/598/base 2025-12-04T08:54:18.6350988Z * [new branch] gh/malfet/598/head -> origin/gh/malfet/598/head 2025-12-04T08:54:18.6351054Z * [new branch] gh/malfet/598/orig -> origin/gh/malfet/598/orig 2025-12-04T08:54:18.6351123Z * [new branch] gh/malfet/599/base -> origin/gh/malfet/599/base 2025-12-04T08:54:18.6351190Z * [new branch] gh/malfet/599/head -> origin/gh/malfet/599/head 2025-12-04T08:54:18.6351257Z * [new branch] gh/malfet/599/orig -> origin/gh/malfet/599/orig 2025-12-04T08:54:18.6351347Z * [new branch] gh/malfet/600/base -> origin/gh/malfet/600/base 2025-12-04T08:54:18.6351414Z * [new branch] gh/malfet/600/head -> origin/gh/malfet/600/head 2025-12-04T08:54:18.6351480Z * [new branch] gh/malfet/600/orig -> origin/gh/malfet/600/orig 2025-12-04T08:54:18.6351551Z * [new branch] gh/malfet/601/base -> origin/gh/malfet/601/base 2025-12-04T08:54:18.6351618Z * [new branch] gh/malfet/601/head -> origin/gh/malfet/601/head 2025-12-04T08:54:18.6351688Z * [new branch] gh/malfet/601/orig -> origin/gh/malfet/601/orig 2025-12-04T08:54:18.6351755Z * [new branch] gh/malfet/602/base -> origin/gh/malfet/602/base 2025-12-04T08:54:18.6351822Z * [new branch] gh/malfet/602/head -> origin/gh/malfet/602/head 2025-12-04T08:54:18.6351890Z * [new branch] gh/malfet/602/orig -> origin/gh/malfet/602/orig 2025-12-04T08:54:18.6351959Z * [new branch] gh/malfet/603/base -> origin/gh/malfet/603/base 2025-12-04T08:54:18.6352027Z * [new branch] gh/malfet/603/head -> origin/gh/malfet/603/head 2025-12-04T08:54:18.6356578Z * [new branch] gh/malfet/603/orig -> origin/gh/malfet/603/orig 2025-12-04T08:54:18.6356715Z * [new branch] gh/malfet/604/base -> origin/gh/malfet/604/base 2025-12-04T08:54:18.6356788Z * [new branch] gh/malfet/604/head -> origin/gh/malfet/604/head 2025-12-04T08:54:18.6356863Z * [new branch] gh/malfet/604/orig -> origin/gh/malfet/604/orig 2025-12-04T08:54:18.6356931Z * [new branch] gh/malfet/605/base -> origin/gh/malfet/605/base 2025-12-04T08:54:18.6356999Z * [new branch] gh/malfet/605/head -> origin/gh/malfet/605/head 2025-12-04T08:54:18.6357068Z * [new branch] gh/malfet/605/orig -> origin/gh/malfet/605/orig 2025-12-04T08:54:18.6357140Z * [new branch] gh/malfet/606/base -> origin/gh/malfet/606/base 2025-12-04T08:54:18.6357209Z * [new branch] gh/malfet/606/head -> origin/gh/malfet/606/head 2025-12-04T08:54:18.6357279Z * [new branch] gh/malfet/606/orig -> origin/gh/malfet/606/orig 2025-12-04T08:54:18.6357352Z * [new branch] gh/malfet/607/base -> origin/gh/malfet/607/base 2025-12-04T08:54:18.6357420Z * [new branch] gh/malfet/607/head -> origin/gh/malfet/607/head 2025-12-04T08:54:18.6357490Z * [new branch] gh/malfet/607/orig -> origin/gh/malfet/607/orig 2025-12-04T08:54:18.6357558Z * [new branch] gh/malfet/608/base -> origin/gh/malfet/608/base 2025-12-04T08:54:18.6357625Z * [new branch] gh/malfet/608/head -> origin/gh/malfet/608/head 2025-12-04T08:54:18.6357693Z * [new branch] gh/malfet/608/orig -> origin/gh/malfet/608/orig 2025-12-04T08:54:18.6357762Z * [new branch] gh/malfet/609/base -> origin/gh/malfet/609/base 2025-12-04T08:54:18.6357830Z * [new branch] gh/malfet/609/head -> origin/gh/malfet/609/head 2025-12-04T08:54:18.6357901Z * [new branch] gh/malfet/609/orig -> origin/gh/malfet/609/orig 2025-12-04T08:54:18.6357971Z * [new branch] gh/malfet/610/base -> origin/gh/malfet/610/base 2025-12-04T08:54:18.6358044Z * [new branch] gh/malfet/610/head -> origin/gh/malfet/610/head 2025-12-04T08:54:18.6358167Z * [new branch] gh/malfet/610/orig -> origin/gh/malfet/610/orig 2025-12-04T08:54:18.6358238Z * [new branch] gh/malfet/611/base -> origin/gh/malfet/611/base 2025-12-04T08:54:18.6358311Z * [new branch] gh/malfet/611/head -> origin/gh/malfet/611/head 2025-12-04T08:54:18.6358382Z * [new branch] gh/malfet/611/orig -> origin/gh/malfet/611/orig 2025-12-04T08:54:18.6358495Z * [new branch] gh/malfet/612/base -> origin/gh/malfet/612/base 2025-12-04T08:54:18.6358569Z * [new branch] gh/malfet/612/head -> origin/gh/malfet/612/head 2025-12-04T08:54:18.6358638Z * [new branch] gh/malfet/612/orig -> origin/gh/malfet/612/orig 2025-12-04T08:54:18.6358713Z * [new branch] gh/malfet/64/base -> origin/gh/malfet/64/base 2025-12-04T08:54:18.6358786Z * [new branch] gh/malfet/64/head -> origin/gh/malfet/64/head 2025-12-04T08:54:18.6358876Z * [new branch] gh/manuelcandales/11/base -> origin/gh/manuelcandales/11/base 2025-12-04T08:54:18.6358961Z * [new branch] gh/manuelcandales/11/head -> origin/gh/manuelcandales/11/head 2025-12-04T08:54:18.6359047Z * [new branch] gh/manuelcandales/11/orig -> origin/gh/manuelcandales/11/orig 2025-12-04T08:54:18.6359119Z * [new branch] gh/markkm/1/base -> origin/gh/markkm/1/base 2025-12-04T08:54:18.6359197Z * [new branch] gh/masnesral/1/base -> origin/gh/masnesral/1/base 2025-12-04T08:54:18.6359273Z * [new branch] gh/masnesral/1/head -> origin/gh/masnesral/1/head 2025-12-04T08:54:18.6359344Z * [new branch] gh/masnesral/1/orig -> origin/gh/masnesral/1/orig 2025-12-04T08:54:18.6359458Z * [new branch] gh/mhorowitz/0/base -> origin/gh/mhorowitz/0/base 2025-12-04T08:54:18.6359529Z * [new branch] gh/mhorowitz/0/head -> origin/gh/mhorowitz/0/head 2025-12-04T08:54:18.6359599Z * [new branch] gh/mhorowitz/1/base -> origin/gh/mhorowitz/1/base 2025-12-04T08:54:18.6359670Z * [new branch] gh/mhorowitz/1/head -> origin/gh/mhorowitz/1/head 2025-12-04T08:54:18.6359740Z * [new branch] gh/mhorowitz/2/base -> origin/gh/mhorowitz/2/base 2025-12-04T08:54:18.6359810Z * [new branch] gh/mhorowitz/2/head -> origin/gh/mhorowitz/2/head 2025-12-04T08:54:18.6359882Z * [new branch] gh/mhorowitz/3/base -> origin/gh/mhorowitz/3/base 2025-12-04T08:54:18.6359953Z * [new branch] gh/mhorowitz/3/head -> origin/gh/mhorowitz/3/head 2025-12-04T08:54:18.6360026Z * [new branch] gh/mhorowitz/4/base -> origin/gh/mhorowitz/4/base 2025-12-04T08:54:18.6360098Z * [new branch] gh/mhorowitz/4/head -> origin/gh/mhorowitz/4/head 2025-12-04T08:54:18.6360170Z * [new branch] gh/mhorowitz/5/base -> origin/gh/mhorowitz/5/base 2025-12-04T08:54:18.6360241Z * [new branch] gh/mhorowitz/5/head -> origin/gh/mhorowitz/5/head 2025-12-04T08:54:18.6360310Z * [new branch] gh/mhorowitz/6/base -> origin/gh/mhorowitz/6/base 2025-12-04T08:54:18.6360382Z * [new branch] gh/mhorowitz/6/head -> origin/gh/mhorowitz/6/head 2025-12-04T08:54:18.6360484Z * [new branch] gh/mikaylagawarecki/234/base -> origin/gh/mikaylagawarecki/234/base 2025-12-04T08:54:18.6360582Z * [new branch] gh/mikaylagawarecki/234/head -> origin/gh/mikaylagawarecki/234/head 2025-12-04T08:54:18.6360676Z * [new branch] gh/mikaylagawarecki/235/base -> origin/gh/mikaylagawarecki/235/base 2025-12-04T08:54:18.6360767Z * [new branch] gh/mikaylagawarecki/235/head -> origin/gh/mikaylagawarecki/235/head 2025-12-04T08:54:18.6360860Z * [new branch] gh/mikaylagawarecki/236/base -> origin/gh/mikaylagawarecki/236/base 2025-12-04T08:54:18.6360953Z * [new branch] gh/mikaylagawarecki/236/head -> origin/gh/mikaylagawarecki/236/head 2025-12-04T08:54:18.6361045Z * [new branch] gh/mikaylagawarecki/237/base -> origin/gh/mikaylagawarecki/237/base 2025-12-04T08:54:18.6361136Z * [new branch] gh/mikaylagawarecki/237/head -> origin/gh/mikaylagawarecki/237/head 2025-12-04T08:54:18.6361232Z * [new branch] gh/mikaylagawarecki/238/base -> origin/gh/mikaylagawarecki/238/base 2025-12-04T08:54:18.6361345Z * [new branch] gh/mikaylagawarecki/238/head -> origin/gh/mikaylagawarecki/238/head 2025-12-04T08:54:18.6361437Z * [new branch] gh/mikaylagawarecki/336/base -> origin/gh/mikaylagawarecki/336/base 2025-12-04T08:54:18.6361531Z * [new branch] gh/mikaylagawarecki/336/head -> origin/gh/mikaylagawarecki/336/head 2025-12-04T08:54:18.6361625Z * [new branch] gh/mikaylagawarecki/336/orig -> origin/gh/mikaylagawarecki/336/orig 2025-12-04T08:54:18.6361719Z * [new branch] gh/mikaylagawarecki/341/base -> origin/gh/mikaylagawarecki/341/base 2025-12-04T08:54:18.6361812Z * [new branch] gh/mikaylagawarecki/341/head -> origin/gh/mikaylagawarecki/341/head 2025-12-04T08:54:18.6361905Z * [new branch] gh/mikaylagawarecki/341/orig -> origin/gh/mikaylagawarecki/341/orig 2025-12-04T08:54:18.6362002Z * [new branch] gh/mikaylagawarecki/342/base -> origin/gh/mikaylagawarecki/342/base 2025-12-04T08:54:18.6362094Z * [new branch] gh/mikaylagawarecki/342/head -> origin/gh/mikaylagawarecki/342/head 2025-12-04T08:54:18.6362186Z * [new branch] gh/mikaylagawarecki/342/orig -> origin/gh/mikaylagawarecki/342/orig 2025-12-04T08:54:18.6362280Z * [new branch] gh/mikaylagawarecki/345/base -> origin/gh/mikaylagawarecki/345/base 2025-12-04T08:54:18.6362392Z * [new branch] gh/mikaylagawarecki/345/head -> origin/gh/mikaylagawarecki/345/head 2025-12-04T08:54:18.6362482Z * [new branch] gh/mikaylagawarecki/345/orig -> origin/gh/mikaylagawarecki/345/orig 2025-12-04T08:54:18.6362574Z * [new branch] gh/mikaylagawarecki/346/base -> origin/gh/mikaylagawarecki/346/base 2025-12-04T08:54:18.6362665Z * [new branch] gh/mikaylagawarecki/346/head -> origin/gh/mikaylagawarecki/346/head 2025-12-04T08:54:18.6362756Z * [new branch] gh/mikaylagawarecki/346/orig -> origin/gh/mikaylagawarecki/346/orig 2025-12-04T08:54:18.6362850Z * [new branch] gh/mikaylagawarecki/347/base -> origin/gh/mikaylagawarecki/347/base 2025-12-04T08:54:18.6362941Z * [new branch] gh/mikaylagawarecki/347/head -> origin/gh/mikaylagawarecki/347/head 2025-12-04T08:54:18.6363032Z * [new branch] gh/mikaylagawarecki/347/orig -> origin/gh/mikaylagawarecki/347/orig 2025-12-04T08:54:18.6363126Z * [new branch] gh/mikaylagawarecki/350/base -> origin/gh/mikaylagawarecki/350/base 2025-12-04T08:54:18.6363216Z * [new branch] gh/mikaylagawarecki/350/head -> origin/gh/mikaylagawarecki/350/head 2025-12-04T08:54:18.6363310Z * [new branch] gh/mikaylagawarecki/350/orig -> origin/gh/mikaylagawarecki/350/orig 2025-12-04T08:54:18.6363401Z * [new branch] gh/mikaylagawarecki/351/base -> origin/gh/mikaylagawarecki/351/base 2025-12-04T08:54:18.6363491Z * [new branch] gh/mikaylagawarecki/351/head -> origin/gh/mikaylagawarecki/351/head 2025-12-04T08:54:18.6363585Z * [new branch] gh/mikaylagawarecki/351/orig -> origin/gh/mikaylagawarecki/351/orig 2025-12-04T08:54:18.6363676Z * [new branch] gh/mikaylagawarecki/352/base -> origin/gh/mikaylagawarecki/352/base 2025-12-04T08:54:18.6363767Z * [new branch] gh/mikaylagawarecki/352/head -> origin/gh/mikaylagawarecki/352/head 2025-12-04T08:54:18.6363860Z * [new branch] gh/mikaylagawarecki/352/orig -> origin/gh/mikaylagawarecki/352/orig 2025-12-04T08:54:18.6363952Z * [new branch] gh/mikaylagawarecki/353/base -> origin/gh/mikaylagawarecki/353/base 2025-12-04T08:54:18.6364043Z * [new branch] gh/mikaylagawarecki/353/head -> origin/gh/mikaylagawarecki/353/head 2025-12-04T08:54:18.6364135Z * [new branch] gh/mikaylagawarecki/353/orig -> origin/gh/mikaylagawarecki/353/orig 2025-12-04T08:54:18.6364226Z * [new branch] gh/mikaylagawarecki/354/base -> origin/gh/mikaylagawarecki/354/base 2025-12-04T08:54:18.6364337Z * [new branch] gh/mikaylagawarecki/354/head -> origin/gh/mikaylagawarecki/354/head 2025-12-04T08:54:18.6364429Z * [new branch] gh/mikaylagawarecki/354/orig -> origin/gh/mikaylagawarecki/354/orig 2025-12-04T08:54:18.6364518Z * [new branch] gh/mikaylagawarecki/356/base -> origin/gh/mikaylagawarecki/356/base 2025-12-04T08:54:18.6364611Z * [new branch] gh/mikaylagawarecki/356/head -> origin/gh/mikaylagawarecki/356/head 2025-12-04T08:54:18.6364702Z * [new branch] gh/mikaylagawarecki/356/orig -> origin/gh/mikaylagawarecki/356/orig 2025-12-04T08:54:18.6364792Z * [new branch] gh/mikaylagawarecki/357/base -> origin/gh/mikaylagawarecki/357/base 2025-12-04T08:54:18.6364883Z * [new branch] gh/mikaylagawarecki/357/head -> origin/gh/mikaylagawarecki/357/head 2025-12-04T08:54:18.6364974Z * [new branch] gh/mikaylagawarecki/357/orig -> origin/gh/mikaylagawarecki/357/orig 2025-12-04T08:54:18.6365066Z * [new branch] gh/mikaylagawarecki/359/base -> origin/gh/mikaylagawarecki/359/base 2025-12-04T08:54:18.6365158Z * [new branch] gh/mikaylagawarecki/359/head -> origin/gh/mikaylagawarecki/359/head 2025-12-04T08:54:18.6365248Z * [new branch] gh/mikaylagawarecki/359/orig -> origin/gh/mikaylagawarecki/359/orig 2025-12-04T08:54:18.6365359Z * [new branch] gh/mikaylagawarecki/360/base -> origin/gh/mikaylagawarecki/360/base 2025-12-04T08:54:18.6365451Z * [new branch] gh/mikaylagawarecki/360/head -> origin/gh/mikaylagawarecki/360/head 2025-12-04T08:54:18.6365541Z * [new branch] gh/mikaylagawarecki/360/orig -> origin/gh/mikaylagawarecki/360/orig 2025-12-04T08:54:18.6365632Z * [new branch] gh/mikaylagawarecki/361/base -> origin/gh/mikaylagawarecki/361/base 2025-12-04T08:54:18.6365723Z * [new branch] gh/mikaylagawarecki/361/head -> origin/gh/mikaylagawarecki/361/head 2025-12-04T08:54:18.6365814Z * [new branch] gh/mikaylagawarecki/361/orig -> origin/gh/mikaylagawarecki/361/orig 2025-12-04T08:54:18.6365904Z * [new branch] gh/mikaylagawarecki/362/base -> origin/gh/mikaylagawarecki/362/base 2025-12-04T08:54:18.6365996Z * [new branch] gh/mikaylagawarecki/362/head -> origin/gh/mikaylagawarecki/362/head 2025-12-04T08:54:18.6366087Z * [new branch] gh/mikaylagawarecki/362/orig -> origin/gh/mikaylagawarecki/362/orig 2025-12-04T08:54:18.6366179Z * [new branch] gh/mikaylagawarecki/363/base -> origin/gh/mikaylagawarecki/363/base 2025-12-04T08:54:18.6366269Z * [new branch] gh/mikaylagawarecki/363/head -> origin/gh/mikaylagawarecki/363/head 2025-12-04T08:54:18.6366359Z * [new branch] gh/mikaylagawarecki/363/orig -> origin/gh/mikaylagawarecki/363/orig 2025-12-04T08:54:18.6366450Z * [new branch] gh/mikaylagawarecki/364/base -> origin/gh/mikaylagawarecki/364/base 2025-12-04T08:54:18.6366543Z * [new branch] gh/mikaylagawarecki/364/head -> origin/gh/mikaylagawarecki/364/head 2025-12-04T08:54:18.6366634Z * [new branch] gh/mikaylagawarecki/364/orig -> origin/gh/mikaylagawarecki/364/orig 2025-12-04T08:54:18.6366726Z * [new branch] gh/mikaylagawarecki/365/base -> origin/gh/mikaylagawarecki/365/base 2025-12-04T08:54:18.6366819Z * [new branch] gh/mikaylagawarecki/365/head -> origin/gh/mikaylagawarecki/365/head 2025-12-04T08:54:18.6366909Z * [new branch] gh/mikaylagawarecki/365/orig -> origin/gh/mikaylagawarecki/365/orig 2025-12-04T08:54:18.6367001Z * [new branch] gh/mikaylagawarecki/366/base -> origin/gh/mikaylagawarecki/366/base 2025-12-04T08:54:18.6367092Z * [new branch] gh/mikaylagawarecki/366/head -> origin/gh/mikaylagawarecki/366/head 2025-12-04T08:54:18.6367182Z * [new branch] gh/mikaylagawarecki/366/orig -> origin/gh/mikaylagawarecki/366/orig 2025-12-04T08:54:18.6367301Z * [new branch] gh/mikaylagawarecki/367/base -> origin/gh/mikaylagawarecki/367/base 2025-12-04T08:54:18.6367392Z * [new branch] gh/mikaylagawarecki/367/head -> origin/gh/mikaylagawarecki/367/head 2025-12-04T08:54:18.6367483Z * [new branch] gh/mikaylagawarecki/367/orig -> origin/gh/mikaylagawarecki/367/orig 2025-12-04T08:54:18.6367575Z * [new branch] gh/mikaylagawarecki/368/base -> origin/gh/mikaylagawarecki/368/base 2025-12-04T08:54:18.6367665Z * [new branch] gh/mikaylagawarecki/368/head -> origin/gh/mikaylagawarecki/368/head 2025-12-04T08:54:18.6367757Z * [new branch] gh/mikaylagawarecki/368/orig -> origin/gh/mikaylagawarecki/368/orig 2025-12-04T08:54:18.6367847Z * [new branch] gh/mikaylagawarecki/369/base -> origin/gh/mikaylagawarecki/369/base 2025-12-04T08:54:18.6367936Z * [new branch] gh/mikaylagawarecki/369/head -> origin/gh/mikaylagawarecki/369/head 2025-12-04T08:54:18.6368030Z * [new branch] gh/mikaylagawarecki/369/orig -> origin/gh/mikaylagawarecki/369/orig 2025-12-04T08:54:18.6368162Z * [new branch] gh/mikaylagawarecki/370/base -> origin/gh/mikaylagawarecki/370/base 2025-12-04T08:54:18.6368254Z * [new branch] gh/mikaylagawarecki/370/head -> origin/gh/mikaylagawarecki/370/head 2025-12-04T08:54:18.6368373Z * [new branch] gh/mikaylagawarecki/370/orig -> origin/gh/mikaylagawarecki/370/orig 2025-12-04T08:54:18.6368464Z * [new branch] gh/mikaylagawarecki/371/base -> origin/gh/mikaylagawarecki/371/base 2025-12-04T08:54:18.6368555Z * [new branch] gh/mikaylagawarecki/371/head -> origin/gh/mikaylagawarecki/371/head 2025-12-04T08:54:18.6368647Z * [new branch] gh/mikaylagawarecki/371/orig -> origin/gh/mikaylagawarecki/371/orig 2025-12-04T08:54:18.6368737Z * [new branch] gh/mikaylagawarecki/372/base -> origin/gh/mikaylagawarecki/372/base 2025-12-04T08:54:18.6368830Z * [new branch] gh/mikaylagawarecki/372/head -> origin/gh/mikaylagawarecki/372/head 2025-12-04T08:54:18.6368920Z * [new branch] gh/mikaylagawarecki/372/orig -> origin/gh/mikaylagawarecki/372/orig 2025-12-04T08:54:18.6369010Z * [new branch] gh/mikaylagawarecki/373/base -> origin/gh/mikaylagawarecki/373/base 2025-12-04T08:54:18.6369104Z * [new branch] gh/mikaylagawarecki/373/head -> origin/gh/mikaylagawarecki/373/head 2025-12-04T08:54:18.6369195Z * [new branch] gh/mikaylagawarecki/373/orig -> origin/gh/mikaylagawarecki/373/orig 2025-12-04T08:54:18.6369285Z * [new branch] gh/mikaylagawarecki/374/base -> origin/gh/mikaylagawarecki/374/base 2025-12-04T08:54:18.6369377Z * [new branch] gh/mikaylagawarecki/374/head -> origin/gh/mikaylagawarecki/374/head 2025-12-04T08:54:18.6369466Z * [new branch] gh/mikaylagawarecki/374/orig -> origin/gh/mikaylagawarecki/374/orig 2025-12-04T08:54:18.6369558Z * [new branch] gh/mikaylagawarecki/375/base -> origin/gh/mikaylagawarecki/375/base 2025-12-04T08:54:18.6369650Z * [new branch] gh/mikaylagawarecki/375/head -> origin/gh/mikaylagawarecki/375/head 2025-12-04T08:54:18.6369741Z * [new branch] gh/mikaylagawarecki/375/orig -> origin/gh/mikaylagawarecki/375/orig 2025-12-04T08:54:18.6369833Z * [new branch] gh/mikaylagawarecki/376/base -> origin/gh/mikaylagawarecki/376/base 2025-12-04T08:54:18.6369925Z * [new branch] gh/mikaylagawarecki/376/head -> origin/gh/mikaylagawarecki/376/head 2025-12-04T08:54:18.6370016Z * [new branch] gh/mikaylagawarecki/376/orig -> origin/gh/mikaylagawarecki/376/orig 2025-12-04T08:54:18.6370106Z * [new branch] gh/mikaylagawarecki/377/base -> origin/gh/mikaylagawarecki/377/base 2025-12-04T08:54:18.6370198Z * [new branch] gh/mikaylagawarecki/377/head -> origin/gh/mikaylagawarecki/377/head 2025-12-04T08:54:18.6370321Z * [new branch] gh/mikaylagawarecki/377/orig -> origin/gh/mikaylagawarecki/377/orig 2025-12-04T08:54:18.6370413Z * [new branch] gh/mikaylagawarecki/378/base -> origin/gh/mikaylagawarecki/378/base 2025-12-04T08:54:18.6370503Z * [new branch] gh/mikaylagawarecki/378/head -> origin/gh/mikaylagawarecki/378/head 2025-12-04T08:54:18.6370595Z * [new branch] gh/mikaylagawarecki/378/orig -> origin/gh/mikaylagawarecki/378/orig 2025-12-04T08:54:18.6370687Z * [new branch] gh/mikaylagawarecki/379/base -> origin/gh/mikaylagawarecki/379/base 2025-12-04T08:54:18.6370777Z * [new branch] gh/mikaylagawarecki/379/head -> origin/gh/mikaylagawarecki/379/head 2025-12-04T08:54:18.6370867Z * [new branch] gh/mikaylagawarecki/379/orig -> origin/gh/mikaylagawarecki/379/orig 2025-12-04T08:54:18.6370959Z * [new branch] gh/mikaylagawarecki/380/base -> origin/gh/mikaylagawarecki/380/base 2025-12-04T08:54:18.6371051Z * [new branch] gh/mikaylagawarecki/380/head -> origin/gh/mikaylagawarecki/380/head 2025-12-04T08:54:18.6371142Z * [new branch] gh/mikaylagawarecki/380/orig -> origin/gh/mikaylagawarecki/380/orig 2025-12-04T08:54:18.6371234Z * [new branch] gh/mikaylagawarecki/381/base -> origin/gh/mikaylagawarecki/381/base 2025-12-04T08:54:18.6371344Z * [new branch] gh/mikaylagawarecki/381/head -> origin/gh/mikaylagawarecki/381/head 2025-12-04T08:54:18.6371435Z * [new branch] gh/mikaylagawarecki/381/orig -> origin/gh/mikaylagawarecki/381/orig 2025-12-04T08:54:18.6371527Z * [new branch] gh/mikaylagawarecki/382/base -> origin/gh/mikaylagawarecki/382/base 2025-12-04T08:54:18.6371619Z * [new branch] gh/mikaylagawarecki/382/head -> origin/gh/mikaylagawarecki/382/head 2025-12-04T08:54:18.6371710Z * [new branch] gh/mikaylagawarecki/382/orig -> origin/gh/mikaylagawarecki/382/orig 2025-12-04T08:54:18.6371803Z * [new branch] gh/mikaylagawarecki/383/base -> origin/gh/mikaylagawarecki/383/base 2025-12-04T08:54:18.6371894Z * [new branch] gh/mikaylagawarecki/383/head -> origin/gh/mikaylagawarecki/383/head 2025-12-04T08:54:18.6371986Z * [new branch] gh/mikaylagawarecki/383/orig -> origin/gh/mikaylagawarecki/383/orig 2025-12-04T08:54:18.6372077Z * [new branch] gh/mikaylagawarecki/384/base -> origin/gh/mikaylagawarecki/384/base 2025-12-04T08:54:18.6372168Z * [new branch] gh/mikaylagawarecki/384/head -> origin/gh/mikaylagawarecki/384/head 2025-12-04T08:54:18.6372260Z * [new branch] gh/mikaylagawarecki/384/orig -> origin/gh/mikaylagawarecki/384/orig 2025-12-04T08:54:18.6372351Z * [new branch] gh/mikaylagawarecki/385/base -> origin/gh/mikaylagawarecki/385/base 2025-12-04T08:54:18.6372441Z * [new branch] gh/mikaylagawarecki/385/head -> origin/gh/mikaylagawarecki/385/head 2025-12-04T08:54:18.6372535Z * [new branch] gh/mikaylagawarecki/385/orig -> origin/gh/mikaylagawarecki/385/orig 2025-12-04T08:54:18.6372626Z * [new branch] gh/mikaylagawarecki/386/base -> origin/gh/mikaylagawarecki/386/base 2025-12-04T08:54:18.6372717Z * [new branch] gh/mikaylagawarecki/386/head -> origin/gh/mikaylagawarecki/386/head 2025-12-04T08:54:18.6372812Z * [new branch] gh/mikaylagawarecki/386/orig -> origin/gh/mikaylagawarecki/386/orig 2025-12-04T08:54:18.6372903Z * [new branch] gh/mikaylagawarecki/387/base -> origin/gh/mikaylagawarecki/387/base 2025-12-04T08:54:18.6372993Z * [new branch] gh/mikaylagawarecki/387/head -> origin/gh/mikaylagawarecki/387/head 2025-12-04T08:54:18.6373086Z * [new branch] gh/mikaylagawarecki/387/orig -> origin/gh/mikaylagawarecki/387/orig 2025-12-04T08:54:18.6373176Z * [new branch] gh/mikaylagawarecki/388/base -> origin/gh/mikaylagawarecki/388/base 2025-12-04T08:54:18.6373288Z * [new branch] gh/mikaylagawarecki/388/head -> origin/gh/mikaylagawarecki/388/head 2025-12-04T08:54:18.6373378Z * [new branch] gh/mikaylagawarecki/388/orig -> origin/gh/mikaylagawarecki/388/orig 2025-12-04T08:54:18.6373469Z * [new branch] gh/mikaylagawarecki/389/base -> origin/gh/mikaylagawarecki/389/base 2025-12-04T08:54:18.6373564Z * [new branch] gh/mikaylagawarecki/389/head -> origin/gh/mikaylagawarecki/389/head 2025-12-04T08:54:18.6373655Z * [new branch] gh/mikaylagawarecki/389/orig -> origin/gh/mikaylagawarecki/389/orig 2025-12-04T08:54:18.6373745Z * [new branch] gh/mikaylagawarecki/390/base -> origin/gh/mikaylagawarecki/390/base 2025-12-04T08:54:18.6373839Z * [new branch] gh/mikaylagawarecki/390/head -> origin/gh/mikaylagawarecki/390/head 2025-12-04T08:54:18.6373929Z * [new branch] gh/mikaylagawarecki/390/orig -> origin/gh/mikaylagawarecki/390/orig 2025-12-04T08:54:18.6374020Z * [new branch] gh/mikaylagawarecki/391/base -> origin/gh/mikaylagawarecki/391/base 2025-12-04T08:54:18.6374112Z * [new branch] gh/mikaylagawarecki/391/head -> origin/gh/mikaylagawarecki/391/head 2025-12-04T08:54:18.6374203Z * [new branch] gh/mikaylagawarecki/391/orig -> origin/gh/mikaylagawarecki/391/orig 2025-12-04T08:54:18.6374313Z * [new branch] gh/mikaylagawarecki/392/base -> origin/gh/mikaylagawarecki/392/base 2025-12-04T08:54:18.6374405Z * [new branch] gh/mikaylagawarecki/392/head -> origin/gh/mikaylagawarecki/392/head 2025-12-04T08:54:18.6374495Z * [new branch] gh/mikaylagawarecki/392/orig -> origin/gh/mikaylagawarecki/392/orig 2025-12-04T08:54:18.6374568Z * [new branch] gh/mlazos/41/base -> origin/gh/mlazos/41/base 2025-12-04T08:54:18.6374637Z * [new branch] gh/mlazos/41/head -> origin/gh/mlazos/41/head 2025-12-04T08:54:18.6374707Z * [new branch] gh/mlazos/41/orig -> origin/gh/mlazos/41/orig 2025-12-04T08:54:18.6374777Z * [new branch] gh/mlazos/42/base -> origin/gh/mlazos/42/base 2025-12-04T08:54:18.6374844Z * [new branch] gh/mlazos/42/head -> origin/gh/mlazos/42/head 2025-12-04T08:54:18.6374912Z * [new branch] gh/mlazos/42/orig -> origin/gh/mlazos/42/orig 2025-12-04T08:54:18.6374982Z * [new branch] gh/mlazos/43/base -> origin/gh/mlazos/43/base 2025-12-04T08:54:18.6375047Z * [new branch] gh/mlazos/43/head -> origin/gh/mlazos/43/head 2025-12-04T08:54:18.6375113Z * [new branch] gh/mlazos/43/orig -> origin/gh/mlazos/43/orig 2025-12-04T08:54:18.6375180Z * [new branch] gh/mlazos/44/base -> origin/gh/mlazos/44/base 2025-12-04T08:54:18.6375246Z * [new branch] gh/mlazos/44/head -> origin/gh/mlazos/44/head 2025-12-04T08:54:18.6375314Z * [new branch] gh/mlazos/44/orig -> origin/gh/mlazos/44/orig 2025-12-04T08:54:18.6375381Z * [new branch] gh/mlazos/47/base -> origin/gh/mlazos/47/base 2025-12-04T08:54:18.6375446Z * [new branch] gh/mlazos/47/head -> origin/gh/mlazos/47/head 2025-12-04T08:54:18.6375512Z * [new branch] gh/mlazos/47/orig -> origin/gh/mlazos/47/orig 2025-12-04T08:54:18.6375582Z * [new branch] gh/mlazos/48/base -> origin/gh/mlazos/48/base 2025-12-04T08:54:18.6375649Z * [new branch] gh/mlazos/48/head -> origin/gh/mlazos/48/head 2025-12-04T08:54:18.6375714Z * [new branch] gh/mlazos/48/orig -> origin/gh/mlazos/48/orig 2025-12-04T08:54:18.6375781Z * [new branch] gh/mlazos/49/base -> origin/gh/mlazos/49/base 2025-12-04T08:54:18.6375846Z * [new branch] gh/mlazos/49/head -> origin/gh/mlazos/49/head 2025-12-04T08:54:18.6375935Z * [new branch] gh/mlazos/49/orig -> origin/gh/mlazos/49/orig 2025-12-04T08:54:18.6376003Z * [new branch] gh/mlazos/50/base -> origin/gh/mlazos/50/base 2025-12-04T08:54:18.6376069Z * [new branch] gh/mlazos/50/head -> origin/gh/mlazos/50/head 2025-12-04T08:54:18.6376138Z * [new branch] gh/mlazos/50/orig -> origin/gh/mlazos/50/orig 2025-12-04T08:54:18.6376207Z * [new branch] gh/mlazos/51/base -> origin/gh/mlazos/51/base 2025-12-04T08:54:18.6376272Z * [new branch] gh/mlazos/51/head -> origin/gh/mlazos/51/head 2025-12-04T08:54:18.6376339Z * [new branch] gh/mlazos/51/orig -> origin/gh/mlazos/51/orig 2025-12-04T08:54:18.6376405Z * [new branch] gh/mlazos/52/base -> origin/gh/mlazos/52/base 2025-12-04T08:54:18.6376471Z * [new branch] gh/mlazos/52/head -> origin/gh/mlazos/52/head 2025-12-04T08:54:18.6376540Z * [new branch] gh/mlazos/52/orig -> origin/gh/mlazos/52/orig 2025-12-04T08:54:18.6376606Z * [new branch] gh/mlazos/53/base -> origin/gh/mlazos/53/base 2025-12-04T08:54:18.6376671Z * [new branch] gh/mlazos/53/head -> origin/gh/mlazos/53/head 2025-12-04T08:54:18.6376739Z * [new branch] gh/mlazos/53/orig -> origin/gh/mlazos/53/orig 2025-12-04T08:54:18.6376828Z * [new branch] gh/mlazos/54/base -> origin/gh/mlazos/54/base 2025-12-04T08:54:18.6376894Z * [new branch] gh/mlazos/54/head -> origin/gh/mlazos/54/head 2025-12-04T08:54:18.6376962Z * [new branch] gh/mlazos/54/orig -> origin/gh/mlazos/54/orig 2025-12-04T08:54:18.6377027Z * [new branch] gh/mlazos/55/base -> origin/gh/mlazos/55/base 2025-12-04T08:54:18.6377093Z * [new branch] gh/mlazos/55/head -> origin/gh/mlazos/55/head 2025-12-04T08:54:18.6377162Z * [new branch] gh/mlazos/55/orig -> origin/gh/mlazos/55/orig 2025-12-04T08:54:18.6377230Z * [new branch] gh/mlazos/56/base -> origin/gh/mlazos/56/base 2025-12-04T08:54:18.6377296Z * [new branch] gh/mlazos/56/head -> origin/gh/mlazos/56/head 2025-12-04T08:54:18.6377363Z * [new branch] gh/mlazos/56/orig -> origin/gh/mlazos/56/orig 2025-12-04T08:54:18.6377431Z * [new branch] gh/mlazos/57/base -> origin/gh/mlazos/57/base 2025-12-04T08:54:18.6377497Z * [new branch] gh/mlazos/57/head -> origin/gh/mlazos/57/head 2025-12-04T08:54:18.6377564Z * [new branch] gh/mlazos/57/orig -> origin/gh/mlazos/57/orig 2025-12-04T08:54:18.6377630Z * [new branch] gh/mlazos/58/base -> origin/gh/mlazos/58/base 2025-12-04T08:54:18.6377695Z * [new branch] gh/mlazos/58/head -> origin/gh/mlazos/58/head 2025-12-04T08:54:18.6377763Z * [new branch] gh/mlazos/58/orig -> origin/gh/mlazos/58/orig 2025-12-04T08:54:18.6377830Z * [new branch] gh/mlazos/59/base -> origin/gh/mlazos/59/base 2025-12-04T08:54:18.6377897Z * [new branch] gh/mlazos/59/head -> origin/gh/mlazos/59/head 2025-12-04T08:54:18.6377964Z * [new branch] gh/mlazos/59/orig -> origin/gh/mlazos/59/orig 2025-12-04T08:54:18.6378032Z * [new branch] gh/mlazos/60/base -> origin/gh/mlazos/60/base 2025-12-04T08:54:18.6378177Z * [new branch] gh/mlazos/60/head -> origin/gh/mlazos/60/head 2025-12-04T08:54:18.6378245Z * [new branch] gh/mlazos/60/orig -> origin/gh/mlazos/60/orig 2025-12-04T08:54:18.6378311Z * [new branch] gh/mlazos/61/base -> origin/gh/mlazos/61/base 2025-12-04T08:54:18.6378378Z * [new branch] gh/mlazos/61/head -> origin/gh/mlazos/61/head 2025-12-04T08:54:18.6378443Z * [new branch] gh/mlazos/61/orig -> origin/gh/mlazos/61/orig 2025-12-04T08:54:18.6378550Z * [new branch] gh/mlazos/62/base -> origin/gh/mlazos/62/base 2025-12-04T08:54:18.6378619Z * [new branch] gh/mlazos/62/head -> origin/gh/mlazos/62/head 2025-12-04T08:54:18.6378684Z * [new branch] gh/mlazos/62/orig -> origin/gh/mlazos/62/orig 2025-12-04T08:54:18.6378751Z * [new branch] gh/mlazos/63/base -> origin/gh/mlazos/63/base 2025-12-04T08:54:18.6378818Z * [new branch] gh/mlazos/63/head -> origin/gh/mlazos/63/head 2025-12-04T08:54:18.6378884Z * [new branch] gh/mlazos/63/orig -> origin/gh/mlazos/63/orig 2025-12-04T08:54:18.6378949Z * [new branch] gh/mlazos/64/base -> origin/gh/mlazos/64/base 2025-12-04T08:54:18.6379016Z * [new branch] gh/mlazos/64/head -> origin/gh/mlazos/64/head 2025-12-04T08:54:18.6379082Z * [new branch] gh/mlazos/64/orig -> origin/gh/mlazos/64/orig 2025-12-04T08:54:18.6379150Z * [new branch] gh/mlazos/65/base -> origin/gh/mlazos/65/base 2025-12-04T08:54:18.6379217Z * [new branch] gh/mlazos/65/head -> origin/gh/mlazos/65/head 2025-12-04T08:54:18.6379283Z * [new branch] gh/mlazos/65/orig -> origin/gh/mlazos/65/orig 2025-12-04T08:54:18.6379380Z * [new branch] gh/mlazos/66/base -> origin/gh/mlazos/66/base 2025-12-04T08:54:18.6379447Z * [new branch] gh/mlazos/66/head -> origin/gh/mlazos/66/head 2025-12-04T08:54:18.6379513Z * [new branch] gh/mlazos/66/orig -> origin/gh/mlazos/66/orig 2025-12-04T08:54:18.6379580Z * [new branch] gh/mlazos/67/base -> origin/gh/mlazos/67/base 2025-12-04T08:54:18.6379646Z * [new branch] gh/mlazos/67/head -> origin/gh/mlazos/67/head 2025-12-04T08:54:18.6379712Z * [new branch] gh/mlazos/67/orig -> origin/gh/mlazos/67/orig 2025-12-04T08:54:18.6379781Z * [new branch] gh/mlazos/68/base -> origin/gh/mlazos/68/base 2025-12-04T08:54:18.6379847Z * [new branch] gh/mlazos/68/head -> origin/gh/mlazos/68/head 2025-12-04T08:54:18.6379914Z * [new branch] gh/mlazos/68/orig -> origin/gh/mlazos/68/orig 2025-12-04T08:54:18.6379981Z * [new branch] gh/mlazos/69/base -> origin/gh/mlazos/69/base 2025-12-04T08:54:18.6380048Z * [new branch] gh/mlazos/69/head -> origin/gh/mlazos/69/head 2025-12-04T08:54:18.6380114Z * [new branch] gh/mlazos/69/orig -> origin/gh/mlazos/69/orig 2025-12-04T08:54:18.6380181Z * [new branch] gh/mlazos/70/base -> origin/gh/mlazos/70/base 2025-12-04T08:54:18.6380246Z * [new branch] gh/mlazos/70/head -> origin/gh/mlazos/70/head 2025-12-04T08:54:18.6380312Z * [new branch] gh/mlazos/70/orig -> origin/gh/mlazos/70/orig 2025-12-04T08:54:18.6380380Z * [new branch] gh/mlazos/71/base -> origin/gh/mlazos/71/base 2025-12-04T08:54:18.6380446Z * [new branch] gh/mlazos/71/head -> origin/gh/mlazos/71/head 2025-12-04T08:54:18.6380511Z * [new branch] gh/mlazos/71/orig -> origin/gh/mlazos/71/orig 2025-12-04T08:54:18.6380579Z * [new branch] gh/mlazos/72/base -> origin/gh/mlazos/72/base 2025-12-04T08:54:18.6380646Z * [new branch] gh/mlazos/72/head -> origin/gh/mlazos/72/head 2025-12-04T08:54:18.6380711Z * [new branch] gh/mlazos/72/orig -> origin/gh/mlazos/72/orig 2025-12-04T08:54:18.6380778Z * [new branch] gh/mlazos/73/base -> origin/gh/mlazos/73/base 2025-12-04T08:54:18.6380843Z * [new branch] gh/mlazos/73/head -> origin/gh/mlazos/73/head 2025-12-04T08:54:18.6380908Z * [new branch] gh/mlazos/73/orig -> origin/gh/mlazos/73/orig 2025-12-04T08:54:18.6381004Z * [new branch] gh/mrmiywj/1/base -> origin/gh/mrmiywj/1/base 2025-12-04T08:54:18.6381072Z * [new branch] gh/mrmiywj/1/head -> origin/gh/mrmiywj/1/head 2025-12-04T08:54:18.6381151Z * [new branch] gh/muchulee8/73/base -> origin/gh/muchulee8/73/base 2025-12-04T08:54:18.6381226Z * [new branch] gh/muchulee8/73/head -> origin/gh/muchulee8/73/head 2025-12-04T08:54:18.6381299Z * [new branch] gh/muchulee8/73/orig -> origin/gh/muchulee8/73/orig 2025-12-04T08:54:18.6381386Z * [new branch] gh/naveenthangudu/1/base -> origin/gh/naveenthangudu/1/base 2025-12-04T08:54:18.6381468Z * [new branch] gh/naveenthangudu/1/head -> origin/gh/naveenthangudu/1/head 2025-12-04T08:54:18.6381548Z * [new branch] gh/naveenthangudu/1/orig -> origin/gh/naveenthangudu/1/orig 2025-12-04T08:54:18.6381630Z * [new branch] gh/naveenthangudu/2/base -> origin/gh/naveenthangudu/2/base 2025-12-04T08:54:18.6381709Z * [new branch] gh/naveenthangudu/2/head -> origin/gh/naveenthangudu/2/head 2025-12-04T08:54:18.6381789Z * [new branch] gh/naveenthangudu/2/orig -> origin/gh/naveenthangudu/2/orig 2025-12-04T08:54:18.6381869Z * [new branch] gh/naveenthangudu/3/base -> origin/gh/naveenthangudu/3/base 2025-12-04T08:54:18.6381970Z * [new branch] gh/naveenthangudu/3/head -> origin/gh/naveenthangudu/3/head 2025-12-04T08:54:18.6382049Z * [new branch] gh/naveenthangudu/3/orig -> origin/gh/naveenthangudu/3/orig 2025-12-04T08:54:18.6382128Z * [new branch] gh/naveenthangudu/4/base -> origin/gh/naveenthangudu/4/base 2025-12-04T08:54:18.6382207Z * [new branch] gh/naveenthangudu/4/head -> origin/gh/naveenthangudu/4/head 2025-12-04T08:54:18.6382285Z * [new branch] gh/naveenthangudu/4/orig -> origin/gh/naveenthangudu/4/orig 2025-12-04T08:54:18.6382364Z * [new branch] gh/naveenthangudu/5/base -> origin/gh/naveenthangudu/5/base 2025-12-04T08:54:18.6382445Z * [new branch] gh/naveenthangudu/5/head -> origin/gh/naveenthangudu/5/head 2025-12-04T08:54:18.6382524Z * [new branch] gh/naveenthangudu/5/orig -> origin/gh/naveenthangudu/5/orig 2025-12-04T08:54:18.6382603Z * [new branch] gh/naveenthangudu/6/base -> origin/gh/naveenthangudu/6/base 2025-12-04T08:54:18.6382684Z * [new branch] gh/naveenthangudu/6/head -> origin/gh/naveenthangudu/6/head 2025-12-04T08:54:18.6382765Z * [new branch] gh/naveenthangudu/6/orig -> origin/gh/naveenthangudu/6/orig 2025-12-04T08:54:18.6382844Z * [new branch] gh/naveenthangudu/7/base -> origin/gh/naveenthangudu/7/base 2025-12-04T08:54:18.6382922Z * [new branch] gh/naveenthangudu/7/head -> origin/gh/naveenthangudu/7/head 2025-12-04T08:54:18.6383002Z * [new branch] gh/naveenthangudu/7/orig -> origin/gh/naveenthangudu/7/orig 2025-12-04T08:54:18.6383081Z * [new branch] gh/naveenthangudu/8/base -> origin/gh/naveenthangudu/8/base 2025-12-04T08:54:18.6383159Z * [new branch] gh/naveenthangudu/8/head -> origin/gh/naveenthangudu/8/head 2025-12-04T08:54:18.6383239Z * [new branch] gh/naveenthangudu/8/orig -> origin/gh/naveenthangudu/8/orig 2025-12-04T08:54:18.6383318Z * [new branch] gh/naveenthangudu/9/base -> origin/gh/naveenthangudu/9/base 2025-12-04T08:54:18.6383396Z * [new branch] gh/naveenthangudu/9/head -> origin/gh/naveenthangudu/9/head 2025-12-04T08:54:18.6383477Z * [new branch] gh/naveenthangudu/9/orig -> origin/gh/naveenthangudu/9/orig 2025-12-04T08:54:18.6383550Z * [new branch] gh/nikitaved/1/base -> origin/gh/nikitaved/1/base 2025-12-04T08:54:18.6383623Z * [new branch] gh/nikitaved/1/head -> origin/gh/nikitaved/1/head 2025-12-04T08:54:18.6383695Z * [new branch] gh/nikitaved/1/orig -> origin/gh/nikitaved/1/orig 2025-12-04T08:54:18.6383787Z * [new branch] gh/nikitaved/10/base -> origin/gh/nikitaved/10/base 2025-12-04T08:54:18.6383858Z * [new branch] gh/nikitaved/10/head -> origin/gh/nikitaved/10/head 2025-12-04T08:54:18.6383930Z * [new branch] gh/nikitaved/10/orig -> origin/gh/nikitaved/10/orig 2025-12-04T08:54:18.6384002Z * [new branch] gh/nikitaved/11/base -> origin/gh/nikitaved/11/base 2025-12-04T08:54:18.6384073Z * [new branch] gh/nikitaved/11/head -> origin/gh/nikitaved/11/head 2025-12-04T08:54:18.6384144Z * [new branch] gh/nikitaved/11/orig -> origin/gh/nikitaved/11/orig 2025-12-04T08:54:18.6384216Z * [new branch] gh/nikitaved/12/base -> origin/gh/nikitaved/12/base 2025-12-04T08:54:18.6384288Z * [new branch] gh/nikitaved/12/head -> origin/gh/nikitaved/12/head 2025-12-04T08:54:18.6384359Z * [new branch] gh/nikitaved/12/orig -> origin/gh/nikitaved/12/orig 2025-12-04T08:54:18.6384431Z * [new branch] gh/nikitaved/13/base -> origin/gh/nikitaved/13/base 2025-12-04T08:54:18.6384504Z * [new branch] gh/nikitaved/13/head -> origin/gh/nikitaved/13/head 2025-12-04T08:54:18.6384575Z * [new branch] gh/nikitaved/13/orig -> origin/gh/nikitaved/13/orig 2025-12-04T08:54:18.6384668Z * [new branch] gh/nikitaved/14/base -> origin/gh/nikitaved/14/base 2025-12-04T08:54:18.6384741Z * [new branch] gh/nikitaved/14/head -> origin/gh/nikitaved/14/head 2025-12-04T08:54:18.6384812Z * [new branch] gh/nikitaved/14/orig -> origin/gh/nikitaved/14/orig 2025-12-04T08:54:18.6384882Z * [new branch] gh/nikitaved/15/base -> origin/gh/nikitaved/15/base 2025-12-04T08:54:18.6384955Z * [new branch] gh/nikitaved/15/head -> origin/gh/nikitaved/15/head 2025-12-04T08:54:18.6385025Z * [new branch] gh/nikitaved/15/orig -> origin/gh/nikitaved/15/orig 2025-12-04T08:54:18.6385097Z * [new branch] gh/nikitaved/16/base -> origin/gh/nikitaved/16/base 2025-12-04T08:54:18.6385169Z * [new branch] gh/nikitaved/16/head -> origin/gh/nikitaved/16/head 2025-12-04T08:54:18.6385240Z * [new branch] gh/nikitaved/16/orig -> origin/gh/nikitaved/16/orig 2025-12-04T08:54:18.6385313Z * [new branch] gh/nikitaved/2/base -> origin/gh/nikitaved/2/base 2025-12-04T08:54:18.6385385Z * [new branch] gh/nikitaved/2/head -> origin/gh/nikitaved/2/head 2025-12-04T08:54:18.6385455Z * [new branch] gh/nikitaved/2/orig -> origin/gh/nikitaved/2/orig 2025-12-04T08:54:18.6385525Z * [new branch] gh/nikitaved/4/base -> origin/gh/nikitaved/4/base 2025-12-04T08:54:18.6385595Z * [new branch] gh/nikitaved/4/head -> origin/gh/nikitaved/4/head 2025-12-04T08:54:18.6385666Z * [new branch] gh/nikitaved/4/orig -> origin/gh/nikitaved/4/orig 2025-12-04T08:54:18.6385740Z * [new branch] gh/nikitaved/5/base -> origin/gh/nikitaved/5/base 2025-12-04T08:54:18.6385810Z * [new branch] gh/nikitaved/5/head -> origin/gh/nikitaved/5/head 2025-12-04T08:54:18.6385879Z * [new branch] gh/nikitaved/5/orig -> origin/gh/nikitaved/5/orig 2025-12-04T08:54:18.6385952Z * [new branch] gh/nikitaved/6/base -> origin/gh/nikitaved/6/base 2025-12-04T08:54:18.6386021Z * [new branch] gh/nikitaved/6/head -> origin/gh/nikitaved/6/head 2025-12-04T08:54:18.6386090Z * [new branch] gh/nikitaved/6/orig -> origin/gh/nikitaved/6/orig 2025-12-04T08:54:18.6386161Z * [new branch] gh/nikitaved/8/base -> origin/gh/nikitaved/8/base 2025-12-04T08:54:18.6386231Z * [new branch] gh/nikitaved/8/head -> origin/gh/nikitaved/8/head 2025-12-04T08:54:18.6386301Z * [new branch] gh/nikitaved/8/orig -> origin/gh/nikitaved/8/orig 2025-12-04T08:54:18.6386393Z * [new branch] gh/nikitaved/9/base -> origin/gh/nikitaved/9/base 2025-12-04T08:54:18.6386463Z * [new branch] gh/nikitaved/9/head -> origin/gh/nikitaved/9/head 2025-12-04T08:54:18.6386533Z * [new branch] gh/nikitaved/9/orig -> origin/gh/nikitaved/9/orig 2025-12-04T08:54:18.6386604Z * [new branch] gh/oulgen/10/base -> origin/gh/oulgen/10/base 2025-12-04T08:54:18.6386672Z * [new branch] gh/oulgen/10/head -> origin/gh/oulgen/10/head 2025-12-04T08:54:18.6386740Z * [new branch] gh/oulgen/10/orig -> origin/gh/oulgen/10/orig 2025-12-04T08:54:18.6386808Z * [new branch] gh/oulgen/11/base -> origin/gh/oulgen/11/base 2025-12-04T08:54:18.6386875Z * [new branch] gh/oulgen/11/head -> origin/gh/oulgen/11/head 2025-12-04T08:54:18.6386940Z * [new branch] gh/oulgen/11/orig -> origin/gh/oulgen/11/orig 2025-12-04T08:54:18.6387011Z * [new branch] gh/oulgen/12/base -> origin/gh/oulgen/12/base 2025-12-04T08:54:18.6387077Z * [new branch] gh/oulgen/12/head -> origin/gh/oulgen/12/head 2025-12-04T08:54:18.6387146Z * [new branch] gh/oulgen/12/orig -> origin/gh/oulgen/12/orig 2025-12-04T08:54:18.6387237Z * [new branch] gh/oulgen/13/base -> origin/gh/oulgen/13/base 2025-12-04T08:54:18.6387304Z * [new branch] gh/oulgen/13/head -> origin/gh/oulgen/13/head 2025-12-04T08:54:18.6387370Z * [new branch] gh/oulgen/13/orig -> origin/gh/oulgen/13/orig 2025-12-04T08:54:18.6387437Z * [new branch] gh/oulgen/14/base -> origin/gh/oulgen/14/base 2025-12-04T08:54:18.6387504Z * [new branch] gh/oulgen/14/head -> origin/gh/oulgen/14/head 2025-12-04T08:54:18.6387570Z * [new branch] gh/oulgen/14/orig -> origin/gh/oulgen/14/orig 2025-12-04T08:54:18.6387639Z * [new branch] gh/oulgen/15/base -> origin/gh/oulgen/15/base 2025-12-04T08:54:18.6387706Z * [new branch] gh/oulgen/15/head -> origin/gh/oulgen/15/head 2025-12-04T08:54:18.6387771Z * [new branch] gh/oulgen/15/orig -> origin/gh/oulgen/15/orig 2025-12-04T08:54:18.6387837Z * [new branch] gh/oulgen/16/base -> origin/gh/oulgen/16/base 2025-12-04T08:54:18.6387905Z * [new branch] gh/oulgen/16/head -> origin/gh/oulgen/16/head 2025-12-04T08:54:18.6387970Z * [new branch] gh/oulgen/16/orig -> origin/gh/oulgen/16/orig 2025-12-04T08:54:18.6388035Z * [new branch] gh/oulgen/17/base -> origin/gh/oulgen/17/base 2025-12-04T08:54:18.6388150Z * [new branch] gh/oulgen/17/head -> origin/gh/oulgen/17/head 2025-12-04T08:54:18.6388218Z * [new branch] gh/oulgen/17/orig -> origin/gh/oulgen/17/orig 2025-12-04T08:54:18.6388286Z * [new branch] gh/oulgen/18/base -> origin/gh/oulgen/18/base 2025-12-04T08:54:18.6388353Z * [new branch] gh/oulgen/18/head -> origin/gh/oulgen/18/head 2025-12-04T08:54:18.6388419Z * [new branch] gh/oulgen/18/orig -> origin/gh/oulgen/18/orig 2025-12-04T08:54:18.6388484Z * [new branch] gh/oulgen/19/base -> origin/gh/oulgen/19/base 2025-12-04T08:54:18.6388553Z * [new branch] gh/oulgen/19/head -> origin/gh/oulgen/19/head 2025-12-04T08:54:18.6388618Z * [new branch] gh/oulgen/19/orig -> origin/gh/oulgen/19/orig 2025-12-04T08:54:18.6388684Z * [new branch] gh/oulgen/20/base -> origin/gh/oulgen/20/base 2025-12-04T08:54:18.6388752Z * [new branch] gh/oulgen/20/head -> origin/gh/oulgen/20/head 2025-12-04T08:54:18.6388818Z * [new branch] gh/oulgen/20/orig -> origin/gh/oulgen/20/orig 2025-12-04T08:54:18.6388914Z * [new branch] gh/oulgen/21/base -> origin/gh/oulgen/21/base 2025-12-04T08:54:18.6388980Z * [new branch] gh/oulgen/21/head -> origin/gh/oulgen/21/head 2025-12-04T08:54:18.6389045Z * [new branch] gh/oulgen/21/orig -> origin/gh/oulgen/21/orig 2025-12-04T08:54:18.6389112Z * [new branch] gh/oulgen/22/base -> origin/gh/oulgen/22/base 2025-12-04T08:54:18.6389179Z * [new branch] gh/oulgen/22/head -> origin/gh/oulgen/22/head 2025-12-04T08:54:18.6389245Z * [new branch] gh/oulgen/22/orig -> origin/gh/oulgen/22/orig 2025-12-04T08:54:18.6389314Z * [new branch] gh/oulgen/23/base -> origin/gh/oulgen/23/base 2025-12-04T08:54:18.6389379Z * [new branch] gh/oulgen/23/head -> origin/gh/oulgen/23/head 2025-12-04T08:54:18.6389451Z * [new branch] gh/oulgen/23/orig -> origin/gh/oulgen/23/orig 2025-12-04T08:54:18.6389529Z * [new branch] gh/oulgen/24/base -> origin/gh/oulgen/24/base 2025-12-04T08:54:18.6389601Z * [new branch] gh/oulgen/24/head -> origin/gh/oulgen/24/head 2025-12-04T08:54:18.6389667Z * [new branch] gh/oulgen/24/orig -> origin/gh/oulgen/24/orig 2025-12-04T08:54:18.6389734Z * [new branch] gh/oulgen/25/base -> origin/gh/oulgen/25/base 2025-12-04T08:54:18.6389829Z * [new branch] gh/oulgen/25/head -> origin/gh/oulgen/25/head 2025-12-04T08:54:18.6389895Z * [new branch] gh/oulgen/25/orig -> origin/gh/oulgen/25/orig 2025-12-04T08:54:18.6389962Z * [new branch] gh/oulgen/26/base -> origin/gh/oulgen/26/base 2025-12-04T08:54:18.6390028Z * [new branch] gh/oulgen/26/head -> origin/gh/oulgen/26/head 2025-12-04T08:54:18.6390093Z * [new branch] gh/oulgen/26/orig -> origin/gh/oulgen/26/orig 2025-12-04T08:54:18.6390163Z * [new branch] gh/oulgen/4/base -> origin/gh/oulgen/4/base 2025-12-04T08:54:18.6390232Z * [new branch] gh/oulgen/4/head -> origin/gh/oulgen/4/head 2025-12-04T08:54:18.6390299Z * [new branch] gh/oulgen/4/orig -> origin/gh/oulgen/4/orig 2025-12-04T08:54:18.6390366Z * [new branch] gh/oulgen/7/base -> origin/gh/oulgen/7/base 2025-12-04T08:54:18.6390433Z * [new branch] gh/oulgen/7/head -> origin/gh/oulgen/7/head 2025-12-04T08:54:18.6390500Z * [new branch] gh/oulgen/7/orig -> origin/gh/oulgen/7/orig 2025-12-04T08:54:18.6390564Z * [new branch] gh/oulgen/8/base -> origin/gh/oulgen/8/base 2025-12-04T08:54:18.6390629Z * [new branch] gh/oulgen/8/head -> origin/gh/oulgen/8/head 2025-12-04T08:54:18.6390695Z * [new branch] gh/oulgen/8/orig -> origin/gh/oulgen/8/orig 2025-12-04T08:54:18.6390761Z * [new branch] gh/oulgen/9/base -> origin/gh/oulgen/9/base 2025-12-04T08:54:18.6390827Z * [new branch] gh/oulgen/9/head -> origin/gh/oulgen/9/head 2025-12-04T08:54:18.6390892Z * [new branch] gh/oulgen/9/orig -> origin/gh/oulgen/9/orig 2025-12-04T08:54:18.6390996Z * [new branch] gh/patvig/mtia-serialization -> origin/gh/patvig/mtia-serialization 2025-12-04T08:54:18.6391067Z * [new branch] gh/pearu/108/base -> origin/gh/pearu/108/base 2025-12-04T08:54:18.6391135Z * [new branch] gh/pearu/108/head -> origin/gh/pearu/108/head 2025-12-04T08:54:18.6391202Z * [new branch] gh/pearu/108/orig -> origin/gh/pearu/108/orig 2025-12-04T08:54:18.6391269Z * [new branch] gh/pearu/109/base -> origin/gh/pearu/109/base 2025-12-04T08:54:18.6391336Z * [new branch] gh/pearu/109/head -> origin/gh/pearu/109/head 2025-12-04T08:54:18.6391401Z * [new branch] gh/pearu/109/orig -> origin/gh/pearu/109/orig 2025-12-04T08:54:18.6391488Z * [new branch] gh/pearu/110/base -> origin/gh/pearu/110/base 2025-12-04T08:54:18.6391555Z * [new branch] gh/pearu/110/head -> origin/gh/pearu/110/head 2025-12-04T08:54:18.6391622Z * [new branch] gh/pearu/110/orig -> origin/gh/pearu/110/orig 2025-12-04T08:54:18.6391689Z * [new branch] gh/pearu/111/base -> origin/gh/pearu/111/base 2025-12-04T08:54:18.6391757Z * [new branch] gh/pearu/111/head -> origin/gh/pearu/111/head 2025-12-04T08:54:18.6391823Z * [new branch] gh/pearu/111/orig -> origin/gh/pearu/111/orig 2025-12-04T08:54:18.6391889Z * [new branch] gh/pearu/112/base -> origin/gh/pearu/112/base 2025-12-04T08:54:18.6391956Z * [new branch] gh/pearu/112/head -> origin/gh/pearu/112/head 2025-12-04T08:54:18.6392021Z * [new branch] gh/pearu/112/orig -> origin/gh/pearu/112/orig 2025-12-04T08:54:18.6392089Z * [new branch] gh/pearu/115/base -> origin/gh/pearu/115/base 2025-12-04T08:54:18.6392155Z * [new branch] gh/pearu/115/head -> origin/gh/pearu/115/head 2025-12-04T08:54:18.6392221Z * [new branch] gh/pearu/115/orig -> origin/gh/pearu/115/orig 2025-12-04T08:54:18.6392308Z * [new branch] gh/pearu/116/base -> origin/gh/pearu/116/base 2025-12-04T08:54:18.6392374Z * [new branch] gh/pearu/116/head -> origin/gh/pearu/116/head 2025-12-04T08:54:18.6392440Z * [new branch] gh/pearu/116/orig -> origin/gh/pearu/116/orig 2025-12-04T08:54:18.6392506Z * [new branch] gh/pearu/117/base -> origin/gh/pearu/117/base 2025-12-04T08:54:18.6392572Z * [new branch] gh/pearu/117/head -> origin/gh/pearu/117/head 2025-12-04T08:54:18.6392637Z * [new branch] gh/pearu/117/orig -> origin/gh/pearu/117/orig 2025-12-04T08:54:18.6392706Z * [new branch] gh/pearu/118/base -> origin/gh/pearu/118/base 2025-12-04T08:54:18.6392771Z * [new branch] gh/pearu/118/head -> origin/gh/pearu/118/head 2025-12-04T08:54:18.6392837Z * [new branch] gh/pearu/118/orig -> origin/gh/pearu/118/orig 2025-12-04T08:54:18.6392905Z * [new branch] gh/pearu/119/base -> origin/gh/pearu/119/base 2025-12-04T08:54:18.6392971Z * [new branch] gh/pearu/119/head -> origin/gh/pearu/119/head 2025-12-04T08:54:18.6393037Z * [new branch] gh/pearu/119/orig -> origin/gh/pearu/119/orig 2025-12-04T08:54:18.6393104Z * [new branch] gh/pearu/139/base -> origin/gh/pearu/139/base 2025-12-04T08:54:18.6393170Z * [new branch] gh/pearu/139/head -> origin/gh/pearu/139/head 2025-12-04T08:54:18.6393236Z * [new branch] gh/pearu/139/orig -> origin/gh/pearu/139/orig 2025-12-04T08:54:18.6393308Z * [new branch] gh/pearu/140/base -> origin/gh/pearu/140/base 2025-12-04T08:54:18.6393376Z * [new branch] gh/pearu/140/head -> origin/gh/pearu/140/head 2025-12-04T08:54:18.6393441Z * [new branch] gh/pearu/140/orig -> origin/gh/pearu/140/orig 2025-12-04T08:54:18.6393508Z * [new branch] gh/pearu/142/base -> origin/gh/pearu/142/base 2025-12-04T08:54:18.6393575Z * [new branch] gh/pearu/142/head -> origin/gh/pearu/142/head 2025-12-04T08:54:18.6393643Z * [new branch] gh/pearu/142/orig -> origin/gh/pearu/142/orig 2025-12-04T08:54:18.6393709Z * [new branch] gh/pearu/143/base -> origin/gh/pearu/143/base 2025-12-04T08:54:18.6393773Z * [new branch] gh/pearu/143/head -> origin/gh/pearu/143/head 2025-12-04T08:54:18.6393839Z * [new branch] gh/pearu/143/orig -> origin/gh/pearu/143/orig 2025-12-04T08:54:18.6393923Z * [new branch] gh/pearu/147/base -> origin/gh/pearu/147/base 2025-12-04T08:54:18.6393989Z * [new branch] gh/pearu/147/head -> origin/gh/pearu/147/head 2025-12-04T08:54:18.6394056Z * [new branch] gh/pearu/147/orig -> origin/gh/pearu/147/orig 2025-12-04T08:54:18.6394121Z * [new branch] gh/pearu/149/base -> origin/gh/pearu/149/base 2025-12-04T08:54:18.6394189Z * [new branch] gh/pearu/149/head -> origin/gh/pearu/149/head 2025-12-04T08:54:18.6394257Z * [new branch] gh/pearu/149/orig -> origin/gh/pearu/149/orig 2025-12-04T08:54:18.6394323Z * [new branch] gh/pearu/150/base -> origin/gh/pearu/150/base 2025-12-04T08:54:18.6394388Z * [new branch] gh/pearu/150/head -> origin/gh/pearu/150/head 2025-12-04T08:54:18.6394455Z * [new branch] gh/pearu/150/orig -> origin/gh/pearu/150/orig 2025-12-04T08:54:18.6394522Z * [new branch] gh/pearu/151/base -> origin/gh/pearu/151/base 2025-12-04T08:54:18.6394588Z * [new branch] gh/pearu/151/head -> origin/gh/pearu/151/head 2025-12-04T08:54:18.6394654Z * [new branch] gh/pearu/151/orig -> origin/gh/pearu/151/orig 2025-12-04T08:54:18.6394721Z * [new branch] gh/pearu/152/base -> origin/gh/pearu/152/base 2025-12-04T08:54:18.6394804Z * [new branch] gh/pearu/152/head -> origin/gh/pearu/152/head 2025-12-04T08:54:18.6394871Z * [new branch] gh/pearu/152/orig -> origin/gh/pearu/152/orig 2025-12-04T08:54:18.6394938Z * [new branch] gh/pearu/153/base -> origin/gh/pearu/153/base 2025-12-04T08:54:18.6395004Z * [new branch] gh/pearu/153/head -> origin/gh/pearu/153/head 2025-12-04T08:54:18.6395072Z * [new branch] gh/pearu/153/orig -> origin/gh/pearu/153/orig 2025-12-04T08:54:18.6395138Z * [new branch] gh/pearu/154/base -> origin/gh/pearu/154/base 2025-12-04T08:54:18.6395206Z * [new branch] gh/pearu/154/head -> origin/gh/pearu/154/head 2025-12-04T08:54:18.6395273Z * [new branch] gh/pearu/154/orig -> origin/gh/pearu/154/orig 2025-12-04T08:54:18.6395339Z * [new branch] gh/pearu/155/base -> origin/gh/pearu/155/base 2025-12-04T08:54:18.6395407Z * [new branch] gh/pearu/155/head -> origin/gh/pearu/155/head 2025-12-04T08:54:18.6395473Z * [new branch] gh/pearu/155/orig -> origin/gh/pearu/155/orig 2025-12-04T08:54:18.6395539Z * [new branch] gh/pearu/156/base -> origin/gh/pearu/156/base 2025-12-04T08:54:18.6395606Z * [new branch] gh/pearu/156/head -> origin/gh/pearu/156/head 2025-12-04T08:54:18.6395671Z * [new branch] gh/pearu/156/orig -> origin/gh/pearu/156/orig 2025-12-04T08:54:18.6395737Z * [new branch] gh/pearu/56/base -> origin/gh/pearu/56/base 2025-12-04T08:54:18.6395805Z * [new branch] gh/pearu/56/head -> origin/gh/pearu/56/head 2025-12-04T08:54:18.6395871Z * [new branch] gh/pearu/56/orig -> origin/gh/pearu/56/orig 2025-12-04T08:54:18.6395936Z * [new branch] gh/pearu/97/base -> origin/gh/pearu/97/base 2025-12-04T08:54:18.6396004Z * [new branch] gh/pearu/97/head -> origin/gh/pearu/97/head 2025-12-04T08:54:18.6396070Z * [new branch] gh/pearu/97/orig -> origin/gh/pearu/97/orig 2025-12-04T08:54:18.6396146Z * [new branch] gh/pianpwk/21/base -> origin/gh/pianpwk/21/base 2025-12-04T08:54:18.6396220Z * [new branch] gh/pianpwk/21/head -> origin/gh/pianpwk/21/head 2025-12-04T08:54:18.6396290Z * [new branch] gh/pianpwk/28/base -> origin/gh/pianpwk/28/base 2025-12-04T08:54:18.6396359Z * [new branch] gh/pianpwk/28/head -> origin/gh/pianpwk/28/head 2025-12-04T08:54:18.6396449Z * [new branch] gh/pianpwk/28/orig -> origin/gh/pianpwk/28/orig 2025-12-04T08:54:18.6396517Z * [new branch] gh/pianpwk/29/base -> origin/gh/pianpwk/29/base 2025-12-04T08:54:18.6396586Z * [new branch] gh/pianpwk/29/head -> origin/gh/pianpwk/29/head 2025-12-04T08:54:18.6396656Z * [new branch] gh/pianpwk/29/orig -> origin/gh/pianpwk/29/orig 2025-12-04T08:54:18.6396724Z * [new branch] gh/pianpwk/30/base -> origin/gh/pianpwk/30/base 2025-12-04T08:54:18.6396792Z * [new branch] gh/pianpwk/30/head -> origin/gh/pianpwk/30/head 2025-12-04T08:54:18.6396861Z * [new branch] gh/pianpwk/30/orig -> origin/gh/pianpwk/30/orig 2025-12-04T08:54:18.6396930Z * [new branch] gh/pianpwk/31/base -> origin/gh/pianpwk/31/base 2025-12-04T08:54:18.6397000Z * [new branch] gh/pianpwk/31/head -> origin/gh/pianpwk/31/head 2025-12-04T08:54:18.6397070Z * [new branch] gh/pianpwk/31/orig -> origin/gh/pianpwk/31/orig 2025-12-04T08:54:18.6397138Z * [new branch] gh/pianpwk/32/base -> origin/gh/pianpwk/32/base 2025-12-04T08:54:18.6397208Z * [new branch] gh/pianpwk/32/head -> origin/gh/pianpwk/32/head 2025-12-04T08:54:18.6397302Z * [new branch] gh/pianpwk/32/orig -> origin/gh/pianpwk/32/orig 2025-12-04T08:54:18.6397371Z * [new branch] gh/pianpwk/33/base -> origin/gh/pianpwk/33/base 2025-12-04T08:54:18.6397441Z * [new branch] gh/pianpwk/33/head -> origin/gh/pianpwk/33/head 2025-12-04T08:54:18.6397509Z * [new branch] gh/pianpwk/33/orig -> origin/gh/pianpwk/33/orig 2025-12-04T08:54:18.6397578Z * [new branch] gh/pianpwk/34/base -> origin/gh/pianpwk/34/base 2025-12-04T08:54:18.6397649Z * [new branch] gh/pianpwk/34/head -> origin/gh/pianpwk/34/head 2025-12-04T08:54:18.6397719Z * [new branch] gh/pianpwk/34/orig -> origin/gh/pianpwk/34/orig 2025-12-04T08:54:18.6397787Z * [new branch] gh/pianpwk/35/base -> origin/gh/pianpwk/35/base 2025-12-04T08:54:18.6397856Z * [new branch] gh/pianpwk/35/head -> origin/gh/pianpwk/35/head 2025-12-04T08:54:18.6397926Z * [new branch] gh/pianpwk/35/orig -> origin/gh/pianpwk/35/orig 2025-12-04T08:54:18.6397992Z * [new branch] gh/rec/141/base -> origin/gh/rec/141/base 2025-12-04T08:54:18.6398061Z * [new branch] gh/rec/141/head -> origin/gh/rec/141/head 2025-12-04T08:54:18.6398169Z * [new branch] gh/rec/153/base -> origin/gh/rec/153/base 2025-12-04T08:54:18.6398233Z * [new branch] gh/rec/153/head -> origin/gh/rec/153/head 2025-12-04T08:54:18.6398298Z * [new branch] gh/rec/153/orig -> origin/gh/rec/153/orig 2025-12-04T08:54:18.6398363Z * [new branch] gh/rec/154/base -> origin/gh/rec/154/base 2025-12-04T08:54:18.6398426Z * [new branch] gh/rec/154/head -> origin/gh/rec/154/head 2025-12-04T08:54:18.6398490Z * [new branch] gh/rec/154/orig -> origin/gh/rec/154/orig 2025-12-04T08:54:18.6398552Z * [new branch] gh/rec/164/base -> origin/gh/rec/164/base 2025-12-04T08:54:18.6398616Z * [new branch] gh/rec/164/head -> origin/gh/rec/164/head 2025-12-04T08:54:18.6398681Z * [new branch] gh/rec/164/orig -> origin/gh/rec/164/orig 2025-12-04T08:54:18.6398744Z * [new branch] gh/rec/166/base -> origin/gh/rec/166/base 2025-12-04T08:54:18.6398810Z * [new branch] gh/rec/166/head -> origin/gh/rec/166/head 2025-12-04T08:54:18.6398873Z * [new branch] gh/rec/166/orig -> origin/gh/rec/166/orig 2025-12-04T08:54:18.6398935Z * [new branch] gh/rec/167/base -> origin/gh/rec/167/base 2025-12-04T08:54:18.6399033Z * [new branch] gh/rec/167/head -> origin/gh/rec/167/head 2025-12-04T08:54:18.6399096Z * [new branch] gh/rec/167/orig -> origin/gh/rec/167/orig 2025-12-04T08:54:18.6399159Z * [new branch] gh/rec/168/base -> origin/gh/rec/168/base 2025-12-04T08:54:18.6399224Z * [new branch] gh/rec/168/head -> origin/gh/rec/168/head 2025-12-04T08:54:18.6399287Z * [new branch] gh/rec/168/orig -> origin/gh/rec/168/orig 2025-12-04T08:54:18.6399350Z * [new branch] gh/rec/169/base -> origin/gh/rec/169/base 2025-12-04T08:54:18.6399414Z * [new branch] gh/rec/169/head -> origin/gh/rec/169/head 2025-12-04T08:54:18.6399476Z * [new branch] gh/rec/169/orig -> origin/gh/rec/169/orig 2025-12-04T08:54:18.6399539Z * [new branch] gh/rec/170/base -> origin/gh/rec/170/base 2025-12-04T08:54:18.6399605Z * [new branch] gh/rec/170/head -> origin/gh/rec/170/head 2025-12-04T08:54:18.6399667Z * [new branch] gh/rec/170/orig -> origin/gh/rec/170/orig 2025-12-04T08:54:18.6399729Z * [new branch] gh/rec/171/base -> origin/gh/rec/171/base 2025-12-04T08:54:18.6399829Z * [new branch] gh/rec/171/head -> origin/gh/rec/171/head 2025-12-04T08:54:18.6399893Z * [new branch] gh/rec/171/orig -> origin/gh/rec/171/orig 2025-12-04T08:54:18.6399955Z * [new branch] gh/rec/172/base -> origin/gh/rec/172/base 2025-12-04T08:54:18.6400020Z * [new branch] gh/rec/172/head -> origin/gh/rec/172/head 2025-12-04T08:54:18.6400082Z * [new branch] gh/rec/172/orig -> origin/gh/rec/172/orig 2025-12-04T08:54:18.6400144Z * [new branch] gh/rec/173/base -> origin/gh/rec/173/base 2025-12-04T08:54:18.6400210Z * [new branch] gh/rec/173/head -> origin/gh/rec/173/head 2025-12-04T08:54:18.6400273Z * [new branch] gh/rec/173/orig -> origin/gh/rec/173/orig 2025-12-04T08:54:18.6400335Z * [new branch] gh/rec/174/base -> origin/gh/rec/174/base 2025-12-04T08:54:18.6400399Z * [new branch] gh/rec/174/head -> origin/gh/rec/174/head 2025-12-04T08:54:18.6400462Z * [new branch] gh/rec/174/orig -> origin/gh/rec/174/orig 2025-12-04T08:54:18.6400526Z * [new branch] gh/rec/175/base -> origin/gh/rec/175/base 2025-12-04T08:54:18.6400589Z * [new branch] gh/rec/175/head -> origin/gh/rec/175/head 2025-12-04T08:54:18.6400652Z * [new branch] gh/rec/175/orig -> origin/gh/rec/175/orig 2025-12-04T08:54:18.6400718Z * [new branch] gh/rec/176/base -> origin/gh/rec/176/base 2025-12-04T08:54:18.6400783Z * [new branch] gh/rec/176/head -> origin/gh/rec/176/head 2025-12-04T08:54:18.6400844Z * [new branch] gh/rec/176/orig -> origin/gh/rec/176/orig 2025-12-04T08:54:18.6400909Z * [new branch] gh/rec/177/base -> origin/gh/rec/177/base 2025-12-04T08:54:18.6400971Z * [new branch] gh/rec/177/head -> origin/gh/rec/177/head 2025-12-04T08:54:18.6401035Z * [new branch] gh/rec/177/orig -> origin/gh/rec/177/orig 2025-12-04T08:54:18.6401124Z * [new branch] gh/robert-hardwick/3/base -> origin/gh/robert-hardwick/3/base 2025-12-04T08:54:18.6401209Z * [new branch] gh/robert-hardwick/3/head -> origin/gh/robert-hardwick/3/head 2025-12-04T08:54:18.6401291Z * [new branch] gh/robert-hardwick/3/orig -> origin/gh/robert-hardwick/3/orig 2025-12-04T08:54:18.6401373Z * [new branch] gh/robert-hardwick/4/base -> origin/gh/robert-hardwick/4/base 2025-12-04T08:54:18.6401473Z * [new branch] gh/robert-hardwick/4/head -> origin/gh/robert-hardwick/4/head 2025-12-04T08:54:18.6401553Z * [new branch] gh/robert-hardwick/4/orig -> origin/gh/robert-hardwick/4/orig 2025-12-04T08:54:18.6401636Z * [new branch] gh/robert-hardwick/5/base -> origin/gh/robert-hardwick/5/base 2025-12-04T08:54:18.6401719Z * [new branch] gh/robert-hardwick/5/head -> origin/gh/robert-hardwick/5/head 2025-12-04T08:54:18.6401800Z * [new branch] gh/robert-hardwick/5/orig -> origin/gh/robert-hardwick/5/orig 2025-12-04T08:54:18.6401882Z * [new branch] gh/robert-hardwick/6/base -> origin/gh/robert-hardwick/6/base 2025-12-04T08:54:18.6401963Z * [new branch] gh/robert-hardwick/6/head -> origin/gh/robert-hardwick/6/head 2025-12-04T08:54:18.6402044Z * [new branch] gh/robert-hardwick/6/orig -> origin/gh/robert-hardwick/6/orig 2025-12-04T08:54:18.6402127Z * [new branch] gh/robert-hardwick/7/base -> origin/gh/robert-hardwick/7/base 2025-12-04T08:54:18.6402210Z * [new branch] gh/robert-hardwick/7/head -> origin/gh/robert-hardwick/7/head 2025-12-04T08:54:18.6402292Z * [new branch] gh/robert-hardwick/7/orig -> origin/gh/robert-hardwick/7/orig 2025-12-04T08:54:18.6402373Z * [new branch] gh/robert-hardwick/8/base -> origin/gh/robert-hardwick/8/base 2025-12-04T08:54:18.6402478Z * [new branch] gh/robert-hardwick/8/head -> origin/gh/robert-hardwick/8/head 2025-12-04T08:54:18.6402560Z * [new branch] gh/robert-hardwick/8/orig -> origin/gh/robert-hardwick/8/orig 2025-12-04T08:54:18.6402641Z * [new branch] gh/robert-hardwick/9/base -> origin/gh/robert-hardwick/9/base 2025-12-04T08:54:18.6402722Z * [new branch] gh/robert-hardwick/9/head -> origin/gh/robert-hardwick/9/head 2025-12-04T08:54:18.6402804Z * [new branch] gh/robert-hardwick/9/orig -> origin/gh/robert-hardwick/9/orig 2025-12-04T08:54:18.6402877Z * [new branch] gh/rtimpe/1/base -> origin/gh/rtimpe/1/base 2025-12-04T08:54:18.6402946Z * [new branch] gh/rtimpe/1/head -> origin/gh/rtimpe/1/head 2025-12-04T08:54:18.6403015Z * [new branch] gh/rtimpe/2/base -> origin/gh/rtimpe/2/base 2025-12-04T08:54:18.6403082Z * [new branch] gh/rtimpe/2/head -> origin/gh/rtimpe/2/head 2025-12-04T08:54:18.6403152Z * [new branch] gh/rtimpe/22/base -> origin/gh/rtimpe/22/base 2025-12-04T08:54:18.6403221Z * [new branch] gh/rtimpe/22/head -> origin/gh/rtimpe/22/head 2025-12-04T08:54:18.6403288Z * [new branch] gh/rtimpe/22/orig -> origin/gh/rtimpe/22/orig 2025-12-04T08:54:18.6403356Z * [new branch] gh/rtimpe/23/base -> origin/gh/rtimpe/23/base 2025-12-04T08:54:18.6403424Z * [new branch] gh/rtimpe/23/head -> origin/gh/rtimpe/23/head 2025-12-04T08:54:18.6403491Z * [new branch] gh/rtimpe/23/orig -> origin/gh/rtimpe/23/orig 2025-12-04T08:54:18.6403557Z * [new branch] gh/rtimpe/24/base -> origin/gh/rtimpe/24/base 2025-12-04T08:54:18.6403624Z * [new branch] gh/rtimpe/24/head -> origin/gh/rtimpe/24/head 2025-12-04T08:54:18.6403689Z * [new branch] gh/rtimpe/24/orig -> origin/gh/rtimpe/24/orig 2025-12-04T08:54:18.6403759Z * [new branch] gh/rtimpe/25/base -> origin/gh/rtimpe/25/base 2025-12-04T08:54:18.6403825Z * [new branch] gh/rtimpe/25/head -> origin/gh/rtimpe/25/head 2025-12-04T08:54:18.6403891Z * [new branch] gh/rtimpe/25/orig -> origin/gh/rtimpe/25/orig 2025-12-04T08:54:18.6403958Z * [new branch] gh/rtimpe/26/base -> origin/gh/rtimpe/26/base 2025-12-04T08:54:18.6404023Z * [new branch] gh/rtimpe/26/head -> origin/gh/rtimpe/26/head 2025-12-04T08:54:18.6404109Z * [new branch] gh/rtimpe/26/orig -> origin/gh/rtimpe/26/orig 2025-12-04T08:54:18.6404176Z * [new branch] gh/rtimpe/27/base -> origin/gh/rtimpe/27/base 2025-12-04T08:54:18.6404242Z * [new branch] gh/rtimpe/27/head -> origin/gh/rtimpe/27/head 2025-12-04T08:54:18.6404307Z * [new branch] gh/rtimpe/27/orig -> origin/gh/rtimpe/27/orig 2025-12-04T08:54:18.6404375Z * [new branch] gh/rtimpe/28/base -> origin/gh/rtimpe/28/base 2025-12-04T08:54:18.6404441Z * [new branch] gh/rtimpe/28/head -> origin/gh/rtimpe/28/head 2025-12-04T08:54:18.6404507Z * [new branch] gh/rtimpe/28/orig -> origin/gh/rtimpe/28/orig 2025-12-04T08:54:18.6404575Z * [new branch] gh/rtimpe/29/base -> origin/gh/rtimpe/29/base 2025-12-04T08:54:18.6404641Z * [new branch] gh/rtimpe/29/head -> origin/gh/rtimpe/29/head 2025-12-04T08:54:18.6404707Z * [new branch] gh/rtimpe/29/orig -> origin/gh/rtimpe/29/orig 2025-12-04T08:54:18.6404777Z * [new branch] gh/rtimpe/3/base -> origin/gh/rtimpe/3/base 2025-12-04T08:54:18.6404842Z * [new branch] gh/rtimpe/3/head -> origin/gh/rtimpe/3/head 2025-12-04T08:54:18.6404908Z * [new branch] gh/rtimpe/30/base -> origin/gh/rtimpe/30/base 2025-12-04T08:54:18.6404993Z * [new branch] gh/rtimpe/30/head -> origin/gh/rtimpe/30/head 2025-12-04T08:54:18.6405059Z * [new branch] gh/rtimpe/30/orig -> origin/gh/rtimpe/30/orig 2025-12-04T08:54:18.6405125Z * [new branch] gh/rtimpe/31/base -> origin/gh/rtimpe/31/base 2025-12-04T08:54:18.6405192Z * [new branch] gh/rtimpe/31/head -> origin/gh/rtimpe/31/head 2025-12-04T08:54:18.6405259Z * [new branch] gh/rtimpe/31/orig -> origin/gh/rtimpe/31/orig 2025-12-04T08:54:18.6405325Z * [new branch] gh/rtimpe/32/base -> origin/gh/rtimpe/32/base 2025-12-04T08:54:18.6405394Z * [new branch] gh/rtimpe/32/head -> origin/gh/rtimpe/32/head 2025-12-04T08:54:18.6405460Z * [new branch] gh/rtimpe/32/orig -> origin/gh/rtimpe/32/orig 2025-12-04T08:54:18.6405527Z * [new branch] gh/rtimpe/33/base -> origin/gh/rtimpe/33/base 2025-12-04T08:54:18.6405594Z * [new branch] gh/rtimpe/33/head -> origin/gh/rtimpe/33/head 2025-12-04T08:54:18.6405660Z * [new branch] gh/rtimpe/33/orig -> origin/gh/rtimpe/33/orig 2025-12-04T08:54:18.6405728Z * [new branch] gh/rtimpe/34/base -> origin/gh/rtimpe/34/base 2025-12-04T08:54:18.6405793Z * [new branch] gh/rtimpe/34/head -> origin/gh/rtimpe/34/head 2025-12-04T08:54:18.6405859Z * [new branch] gh/rtimpe/34/orig -> origin/gh/rtimpe/34/orig 2025-12-04T08:54:18.6405926Z * [new branch] gh/rtimpe/35/base -> origin/gh/rtimpe/35/base 2025-12-04T08:54:18.6405994Z * [new branch] gh/rtimpe/35/head -> origin/gh/rtimpe/35/head 2025-12-04T08:54:18.6406059Z * [new branch] gh/rtimpe/35/orig -> origin/gh/rtimpe/35/orig 2025-12-04T08:54:18.6406127Z * [new branch] gh/rtimpe/4/base -> origin/gh/rtimpe/4/base 2025-12-04T08:54:18.6406194Z * [new branch] gh/rtimpe/4/head -> origin/gh/rtimpe/4/head 2025-12-04T08:54:18.6406275Z * [new branch] gh/ruisizhang123/1/base -> origin/gh/ruisizhang123/1/base 2025-12-04T08:54:18.6406356Z * [new branch] gh/ruisizhang123/1/head -> origin/gh/ruisizhang123/1/head 2025-12-04T08:54:18.6406432Z * [new branch] gh/ruisizhang123/1/orig -> origin/gh/ruisizhang123/1/orig 2025-12-04T08:54:18.6406509Z * [new branch] gh/ruisizhang123/4/base -> origin/gh/ruisizhang123/4/base 2025-12-04T08:54:18.6406586Z * [new branch] gh/ruisizhang123/4/head -> origin/gh/ruisizhang123/4/head 2025-12-04T08:54:18.6406685Z * [new branch] gh/ruisizhang123/4/orig -> origin/gh/ruisizhang123/4/orig 2025-12-04T08:54:18.6406760Z * [new branch] gh/ruisizhang123/5/base -> origin/gh/ruisizhang123/5/base 2025-12-04T08:54:18.6406835Z * [new branch] gh/ruisizhang123/5/head -> origin/gh/ruisizhang123/5/head 2025-12-04T08:54:18.6406913Z * [new branch] gh/ruisizhang123/5/orig -> origin/gh/ruisizhang123/5/orig 2025-12-04T08:54:18.6406990Z * [new branch] gh/ruisizhang123/6/base -> origin/gh/ruisizhang123/6/base 2025-12-04T08:54:18.6407064Z * [new branch] gh/ruisizhang123/6/head -> origin/gh/ruisizhang123/6/head 2025-12-04T08:54:18.6407138Z * [new branch] gh/ruisizhang123/6/orig -> origin/gh/ruisizhang123/6/orig 2025-12-04T08:54:18.6407214Z * [new branch] gh/ruisizhang123/7/base -> origin/gh/ruisizhang123/7/base 2025-12-04T08:54:18.6407290Z * [new branch] gh/ruisizhang123/7/head -> origin/gh/ruisizhang123/7/head 2025-12-04T08:54:18.6407364Z * [new branch] gh/ruisizhang123/7/orig -> origin/gh/ruisizhang123/7/orig 2025-12-04T08:54:18.6407440Z * [new branch] gh/ruisizhang123/8/base -> origin/gh/ruisizhang123/8/base 2025-12-04T08:54:18.6407515Z * [new branch] gh/ruisizhang123/8/head -> origin/gh/ruisizhang123/8/head 2025-12-04T08:54:18.6407609Z * [new branch] gh/ruisizhang123/8/orig -> origin/gh/ruisizhang123/8/orig 2025-12-04T08:54:18.6407685Z * [new branch] gh/ruisizhang123/9/base -> origin/gh/ruisizhang123/9/base 2025-12-04T08:54:18.6407760Z * [new branch] gh/ruisizhang123/9/head -> origin/gh/ruisizhang123/9/head 2025-12-04T08:54:18.6407835Z * [new branch] gh/ruisizhang123/9/orig -> origin/gh/ruisizhang123/9/orig 2025-12-04T08:54:18.6407914Z * [new branch] gh/seemethere/52/base -> origin/gh/seemethere/52/base 2025-12-04T08:54:18.6407989Z * [new branch] gh/seemethere/52/head -> origin/gh/seemethere/52/head 2025-12-04T08:54:18.6408102Z * [new branch] gh/seemethere/52/orig -> origin/gh/seemethere/52/orig 2025-12-04T08:54:18.6408177Z * [new branch] gh/seemethere/53/base -> origin/gh/seemethere/53/base 2025-12-04T08:54:18.6408252Z * [new branch] gh/seemethere/53/head -> origin/gh/seemethere/53/head 2025-12-04T08:54:18.6408324Z * [new branch] gh/seemethere/53/orig -> origin/gh/seemethere/53/orig 2025-12-04T08:54:18.6408398Z * [new branch] gh/seemethere/54/base -> origin/gh/seemethere/54/base 2025-12-04T08:54:18.6408471Z * [new branch] gh/seemethere/54/head -> origin/gh/seemethere/54/head 2025-12-04T08:54:18.6408543Z * [new branch] gh/seemethere/54/orig -> origin/gh/seemethere/54/orig 2025-12-04T08:54:18.6408616Z * [new branch] gh/seemethere/55/base -> origin/gh/seemethere/55/base 2025-12-04T08:54:18.6408689Z * [new branch] gh/seemethere/55/head -> origin/gh/seemethere/55/head 2025-12-04T08:54:18.6408763Z * [new branch] gh/seemethere/55/orig -> origin/gh/seemethere/55/orig 2025-12-04T08:54:18.6408835Z * [new branch] gh/seemethere/59/base -> origin/gh/seemethere/59/base 2025-12-04T08:54:18.6408909Z * [new branch] gh/seemethere/59/head -> origin/gh/seemethere/59/head 2025-12-04T08:54:18.6408981Z * [new branch] gh/seemethere/59/orig -> origin/gh/seemethere/59/orig 2025-12-04T08:54:18.6409053Z * [new branch] gh/seemethere/62/base -> origin/gh/seemethere/62/base 2025-12-04T08:54:18.6409125Z * [new branch] gh/seemethere/62/head -> origin/gh/seemethere/62/head 2025-12-04T08:54:18.6409199Z * [new branch] gh/seemethere/62/orig -> origin/gh/seemethere/62/orig 2025-12-04T08:54:18.6409270Z * [new branch] gh/seemethere/63/base -> origin/gh/seemethere/63/base 2025-12-04T08:54:18.6409372Z * [new branch] gh/seemethere/63/head -> origin/gh/seemethere/63/head 2025-12-04T08:54:18.6409446Z * [new branch] gh/seemethere/63/orig -> origin/gh/seemethere/63/orig 2025-12-04T08:54:18.6409518Z * [new branch] gh/seemethere/71/base -> origin/gh/seemethere/71/base 2025-12-04T08:54:18.6409592Z * [new branch] gh/seemethere/71/head -> origin/gh/seemethere/71/head 2025-12-04T08:54:18.6409665Z * [new branch] gh/seemethere/71/orig -> origin/gh/seemethere/71/orig 2025-12-04T08:54:18.6409736Z * [new branch] gh/seemethere/72/base -> origin/gh/seemethere/72/base 2025-12-04T08:54:18.6409808Z * [new branch] gh/seemethere/72/head -> origin/gh/seemethere/72/head 2025-12-04T08:54:18.6409882Z * [new branch] gh/seemethere/72/orig -> origin/gh/seemethere/72/orig 2025-12-04T08:54:18.6409954Z * [new branch] gh/seemethere/73/base -> origin/gh/seemethere/73/base 2025-12-04T08:54:18.6410027Z * [new branch] gh/seemethere/73/head -> origin/gh/seemethere/73/head 2025-12-04T08:54:18.6410102Z * [new branch] gh/seemethere/73/orig -> origin/gh/seemethere/73/orig 2025-12-04T08:54:18.6410174Z * [new branch] gh/seemethere/74/base -> origin/gh/seemethere/74/base 2025-12-04T08:54:18.6410276Z * [new branch] gh/seemethere/74/head -> origin/gh/seemethere/74/head 2025-12-04T08:54:18.6410348Z * [new branch] gh/seemethere/74/orig -> origin/gh/seemethere/74/orig 2025-12-04T08:54:18.6410420Z * [new branch] gh/seemethere/75/base -> origin/gh/seemethere/75/base 2025-12-04T08:54:18.6410493Z * [new branch] gh/seemethere/75/head -> origin/gh/seemethere/75/head 2025-12-04T08:54:18.6410565Z * [new branch] gh/seemethere/75/orig -> origin/gh/seemethere/75/orig 2025-12-04T08:54:18.6410639Z * [new branch] gh/seemethere/76/base -> origin/gh/seemethere/76/base 2025-12-04T08:54:18.6410711Z * [new branch] gh/seemethere/76/head -> origin/gh/seemethere/76/head 2025-12-04T08:54:18.6410783Z * [new branch] gh/seemethere/76/orig -> origin/gh/seemethere/76/orig 2025-12-04T08:54:18.6410859Z * [new branch] gh/shunting314/145/base -> origin/gh/shunting314/145/base 2025-12-04T08:54:18.6410936Z * [new branch] gh/shunting314/145/head -> origin/gh/shunting314/145/head 2025-12-04T08:54:18.6411010Z * [new branch] gh/shunting314/145/orig -> origin/gh/shunting314/145/orig 2025-12-04T08:54:18.6411083Z * [new branch] gh/shunting314/176/base -> origin/gh/shunting314/176/base 2025-12-04T08:54:18.6411158Z * [new branch] gh/shunting314/176/head -> origin/gh/shunting314/176/head 2025-12-04T08:54:18.6411232Z * [new branch] gh/shunting314/176/orig -> origin/gh/shunting314/176/orig 2025-12-04T08:54:18.6411307Z * [new branch] gh/shunting314/249/base -> origin/gh/shunting314/249/base 2025-12-04T08:54:18.6411381Z * [new branch] gh/shunting314/249/head -> origin/gh/shunting314/249/head 2025-12-04T08:54:18.6411455Z * [new branch] gh/shunting314/249/orig -> origin/gh/shunting314/249/orig 2025-12-04T08:54:18.6411529Z * [new branch] gh/shunting314/253/base -> origin/gh/shunting314/253/base 2025-12-04T08:54:18.6411605Z * [new branch] gh/shunting314/253/head -> origin/gh/shunting314/253/head 2025-12-04T08:54:18.6411678Z * [new branch] gh/shunting314/253/orig -> origin/gh/shunting314/253/orig 2025-12-04T08:54:18.6411752Z * [new branch] gh/shunting314/256/base -> origin/gh/shunting314/256/base 2025-12-04T08:54:18.6411826Z * [new branch] gh/shunting314/256/head -> origin/gh/shunting314/256/head 2025-12-04T08:54:18.6411899Z * [new branch] gh/shunting314/256/orig -> origin/gh/shunting314/256/orig 2025-12-04T08:54:18.6411994Z * [new branch] gh/shunting314/257/base -> origin/gh/shunting314/257/base 2025-12-04T08:54:18.6412067Z * [new branch] gh/shunting314/257/head -> origin/gh/shunting314/257/head 2025-12-04T08:54:18.6412140Z * [new branch] gh/shunting314/257/orig -> origin/gh/shunting314/257/orig 2025-12-04T08:54:18.6412216Z * [new branch] gh/shunting314/258/base -> origin/gh/shunting314/258/base 2025-12-04T08:54:18.6412289Z * [new branch] gh/shunting314/258/head -> origin/gh/shunting314/258/head 2025-12-04T08:54:18.6412362Z * [new branch] gh/shunting314/258/orig -> origin/gh/shunting314/258/orig 2025-12-04T08:54:18.6412437Z * [new branch] gh/shunting314/259/base -> origin/gh/shunting314/259/base 2025-12-04T08:54:18.6412509Z * [new branch] gh/shunting314/259/head -> origin/gh/shunting314/259/head 2025-12-04T08:54:18.6412583Z * [new branch] gh/shunting314/259/orig -> origin/gh/shunting314/259/orig 2025-12-04T08:54:18.6412659Z * [new branch] gh/shunting314/260/base -> origin/gh/shunting314/260/base 2025-12-04T08:54:18.6412732Z * [new branch] gh/shunting314/260/head -> origin/gh/shunting314/260/head 2025-12-04T08:54:18.6412805Z * [new branch] gh/shunting314/260/orig -> origin/gh/shunting314/260/orig 2025-12-04T08:54:18.6412902Z * [new branch] gh/shunting314/261/base -> origin/gh/shunting314/261/base 2025-12-04T08:54:18.6412977Z * [new branch] gh/shunting314/261/head -> origin/gh/shunting314/261/head 2025-12-04T08:54:18.6413050Z * [new branch] gh/shunting314/261/orig -> origin/gh/shunting314/261/orig 2025-12-04T08:54:18.6413125Z * [new branch] gh/shunting314/262/base -> origin/gh/shunting314/262/base 2025-12-04T08:54:18.6413198Z * [new branch] gh/shunting314/262/head -> origin/gh/shunting314/262/head 2025-12-04T08:54:18.6413288Z * [new branch] gh/shunting314/262/orig -> origin/gh/shunting314/262/orig 2025-12-04T08:54:18.6413363Z * [new branch] gh/shunting314/263/base -> origin/gh/shunting314/263/base 2025-12-04T08:54:18.6413437Z * [new branch] gh/shunting314/263/head -> origin/gh/shunting314/263/head 2025-12-04T08:54:18.6413512Z * [new branch] gh/shunting314/263/orig -> origin/gh/shunting314/263/orig 2025-12-04T08:54:18.6413586Z * [new branch] gh/shunting314/264/base -> origin/gh/shunting314/264/base 2025-12-04T08:54:18.6413659Z * [new branch] gh/shunting314/264/head -> origin/gh/shunting314/264/head 2025-12-04T08:54:18.6413734Z * [new branch] gh/shunting314/264/orig -> origin/gh/shunting314/264/orig 2025-12-04T08:54:18.6413807Z * [new branch] gh/shunting314/265/base -> origin/gh/shunting314/265/base 2025-12-04T08:54:18.6413881Z * [new branch] gh/shunting314/265/head -> origin/gh/shunting314/265/head 2025-12-04T08:54:18.6413957Z * [new branch] gh/shunting314/265/orig -> origin/gh/shunting314/265/orig 2025-12-04T08:54:18.6414030Z * [new branch] gh/shunting314/266/base -> origin/gh/shunting314/266/base 2025-12-04T08:54:18.6414103Z * [new branch] gh/shunting314/266/head -> origin/gh/shunting314/266/head 2025-12-04T08:54:18.6414180Z * [new branch] gh/shunting314/266/orig -> origin/gh/shunting314/266/orig 2025-12-04T08:54:18.6414254Z * [new branch] gh/shunting314/267/base -> origin/gh/shunting314/267/base 2025-12-04T08:54:18.6414327Z * [new branch] gh/shunting314/267/head -> origin/gh/shunting314/267/head 2025-12-04T08:54:18.6414403Z * [new branch] gh/shunting314/267/orig -> origin/gh/shunting314/267/orig 2025-12-04T08:54:18.6414476Z * [new branch] gh/shunting314/268/base -> origin/gh/shunting314/268/base 2025-12-04T08:54:18.6414550Z * [new branch] gh/shunting314/268/head -> origin/gh/shunting314/268/head 2025-12-04T08:54:18.6414654Z * [new branch] gh/shunting314/268/orig -> origin/gh/shunting314/268/orig 2025-12-04T08:54:18.6414728Z * [new branch] gh/shunting314/269/base -> origin/gh/shunting314/269/base 2025-12-04T08:54:18.6414801Z * [new branch] gh/shunting314/269/head -> origin/gh/shunting314/269/head 2025-12-04T08:54:18.6414877Z * [new branch] gh/shunting314/269/orig -> origin/gh/shunting314/269/orig 2025-12-04T08:54:18.6414949Z * [new branch] gh/silverguo/1/base -> origin/gh/silverguo/1/base 2025-12-04T08:54:18.6415024Z * [new branch] gh/silverguo/1/head -> origin/gh/silverguo/1/head 2025-12-04T08:54:18.6415096Z * [new branch] gh/silverguo/2/base -> origin/gh/silverguo/2/base 2025-12-04T08:54:18.6415167Z * [new branch] gh/silverguo/2/head -> origin/gh/silverguo/2/head 2025-12-04T08:54:18.6415241Z * [new branch] gh/silverguo/3/base -> origin/gh/silverguo/3/base 2025-12-04T08:54:18.6415311Z * [new branch] gh/silverguo/3/head -> origin/gh/silverguo/3/head 2025-12-04T08:54:18.6415380Z * [new branch] gh/silverguo/4/base -> origin/gh/silverguo/4/base 2025-12-04T08:54:18.6415451Z * [new branch] gh/silverguo/4/head -> origin/gh/silverguo/4/head 2025-12-04T08:54:18.6415542Z * [new branch] gh/slayton58/39/base -> origin/gh/slayton58/39/base 2025-12-04T08:54:18.6415615Z * [new branch] gh/slayton58/39/head -> origin/gh/slayton58/39/head 2025-12-04T08:54:18.6415687Z * [new branch] gh/slayton58/39/orig -> origin/gh/slayton58/39/orig 2025-12-04T08:54:18.6415758Z * [new branch] gh/slayton58/42/base -> origin/gh/slayton58/42/base 2025-12-04T08:54:18.6415827Z * [new branch] gh/slayton58/42/head -> origin/gh/slayton58/42/head 2025-12-04T08:54:18.6415900Z * [new branch] gh/slayton58/42/orig -> origin/gh/slayton58/42/orig 2025-12-04T08:54:18.6415969Z * [new branch] gh/slayton58/43/base -> origin/gh/slayton58/43/base 2025-12-04T08:54:18.6416039Z * [new branch] gh/slayton58/43/head -> origin/gh/slayton58/43/head 2025-12-04T08:54:18.6416111Z * [new branch] gh/slayton58/43/orig -> origin/gh/slayton58/43/orig 2025-12-04T08:54:18.6416183Z * [new branch] gh/slayton58/44/base -> origin/gh/slayton58/44/base 2025-12-04T08:54:18.6416252Z * [new branch] gh/slayton58/44/head -> origin/gh/slayton58/44/head 2025-12-04T08:54:18.6416323Z * [new branch] gh/slayton58/44/orig -> origin/gh/slayton58/44/orig 2025-12-04T08:54:18.6416394Z * [new branch] gh/slayton58/45/base -> origin/gh/slayton58/45/base 2025-12-04T08:54:18.6416464Z * [new branch] gh/slayton58/45/head -> origin/gh/slayton58/45/head 2025-12-04T08:54:18.6416536Z * [new branch] gh/slayton58/45/orig -> origin/gh/slayton58/45/orig 2025-12-04T08:54:18.6416606Z * [new branch] gh/slayton58/46/base -> origin/gh/slayton58/46/base 2025-12-04T08:54:18.6416676Z * [new branch] gh/slayton58/46/head -> origin/gh/slayton58/46/head 2025-12-04T08:54:18.6416746Z * [new branch] gh/slayton58/46/orig -> origin/gh/slayton58/46/orig 2025-12-04T08:54:18.6416817Z * [new branch] gh/slayton58/6/base -> origin/gh/slayton58/6/base 2025-12-04T08:54:18.6416887Z * [new branch] gh/slayton58/6/head -> origin/gh/slayton58/6/head 2025-12-04T08:54:18.6416956Z * [new branch] gh/slayton58/7/base -> origin/gh/slayton58/7/base 2025-12-04T08:54:18.6417025Z * [new branch] gh/slayton58/7/head -> origin/gh/slayton58/7/head 2025-12-04T08:54:18.6417100Z * [new branch] gh/soulitzer/269/base -> origin/gh/soulitzer/269/base 2025-12-04T08:54:18.6417208Z * [new branch] gh/soulitzer/269/head -> origin/gh/soulitzer/269/head 2025-12-04T08:54:18.6417280Z * [new branch] gh/soulitzer/269/orig -> origin/gh/soulitzer/269/orig 2025-12-04T08:54:18.6417354Z * [new branch] gh/soulitzer/276/base -> origin/gh/soulitzer/276/base 2025-12-04T08:54:18.6417428Z * [new branch] gh/soulitzer/276/head -> origin/gh/soulitzer/276/head 2025-12-04T08:54:18.6417500Z * [new branch] gh/soulitzer/276/orig -> origin/gh/soulitzer/276/orig 2025-12-04T08:54:18.6417573Z * [new branch] gh/soulitzer/287/base -> origin/gh/soulitzer/287/base 2025-12-04T08:54:18.6417646Z * [new branch] gh/soulitzer/287/head -> origin/gh/soulitzer/287/head 2025-12-04T08:54:18.6417718Z * [new branch] gh/soulitzer/287/orig -> origin/gh/soulitzer/287/orig 2025-12-04T08:54:18.6417791Z * [new branch] gh/soulitzer/296/base -> origin/gh/soulitzer/296/base 2025-12-04T08:54:18.6417863Z * [new branch] gh/soulitzer/296/head -> origin/gh/soulitzer/296/head 2025-12-04T08:54:18.6417935Z * [new branch] gh/soulitzer/296/orig -> origin/gh/soulitzer/296/orig 2025-12-04T08:54:18.6418009Z * [new branch] gh/soulitzer/299/base -> origin/gh/soulitzer/299/base 2025-12-04T08:54:18.6418158Z * [new branch] gh/soulitzer/299/head -> origin/gh/soulitzer/299/head 2025-12-04T08:54:18.6418233Z * [new branch] gh/soulitzer/299/orig -> origin/gh/soulitzer/299/orig 2025-12-04T08:54:18.6418304Z * [new branch] gh/soulitzer/300/base -> origin/gh/soulitzer/300/base 2025-12-04T08:54:18.6418376Z * [new branch] gh/soulitzer/300/head -> origin/gh/soulitzer/300/head 2025-12-04T08:54:18.6418449Z * [new branch] gh/soulitzer/300/orig -> origin/gh/soulitzer/300/orig 2025-12-04T08:54:18.6418521Z * [new branch] gh/soulitzer/301/base -> origin/gh/soulitzer/301/base 2025-12-04T08:54:18.6418595Z * [new branch] gh/soulitzer/301/head -> origin/gh/soulitzer/301/head 2025-12-04T08:54:18.6418668Z * [new branch] gh/soulitzer/301/orig -> origin/gh/soulitzer/301/orig 2025-12-04T08:54:18.6418739Z * [new branch] gh/soulitzer/313/base -> origin/gh/soulitzer/313/base 2025-12-04T08:54:18.6418812Z * [new branch] gh/soulitzer/313/head -> origin/gh/soulitzer/313/head 2025-12-04T08:54:18.6418886Z * [new branch] gh/soulitzer/313/orig -> origin/gh/soulitzer/313/orig 2025-12-04T08:54:18.6418958Z * [new branch] gh/soulitzer/319/base -> origin/gh/soulitzer/319/base 2025-12-04T08:54:18.6419029Z * [new branch] gh/soulitzer/319/head -> origin/gh/soulitzer/319/head 2025-12-04T08:54:18.6419103Z * [new branch] gh/soulitzer/319/orig -> origin/gh/soulitzer/319/orig 2025-12-04T08:54:18.6419174Z * [new branch] gh/soulitzer/320/base -> origin/gh/soulitzer/320/base 2025-12-04T08:54:18.6419247Z * [new branch] gh/soulitzer/320/head -> origin/gh/soulitzer/320/head 2025-12-04T08:54:18.6419321Z * [new branch] gh/soulitzer/320/orig -> origin/gh/soulitzer/320/orig 2025-12-04T08:54:18.6419392Z * [new branch] gh/soulitzer/336/base -> origin/gh/soulitzer/336/base 2025-12-04T08:54:18.6419465Z * [new branch] gh/soulitzer/336/head -> origin/gh/soulitzer/336/head 2025-12-04T08:54:18.6419539Z * [new branch] gh/soulitzer/336/orig -> origin/gh/soulitzer/336/orig 2025-12-04T08:54:18.6419610Z * [new branch] gh/soulitzer/347/base -> origin/gh/soulitzer/347/base 2025-12-04T08:54:18.6419683Z * [new branch] gh/soulitzer/347/head -> origin/gh/soulitzer/347/head 2025-12-04T08:54:18.6419754Z * [new branch] gh/soulitzer/347/orig -> origin/gh/soulitzer/347/orig 2025-12-04T08:54:18.6419824Z * [new branch] gh/soulitzer/349/base -> origin/gh/soulitzer/349/base 2025-12-04T08:54:18.6419926Z * [new branch] gh/soulitzer/349/head -> origin/gh/soulitzer/349/head 2025-12-04T08:54:18.6419998Z * [new branch] gh/soulitzer/349/orig -> origin/gh/soulitzer/349/orig 2025-12-04T08:54:18.6420070Z * [new branch] gh/soulitzer/350/base -> origin/gh/soulitzer/350/base 2025-12-04T08:54:18.6420143Z * [new branch] gh/soulitzer/350/head -> origin/gh/soulitzer/350/head 2025-12-04T08:54:18.6420215Z * [new branch] gh/soulitzer/350/orig -> origin/gh/soulitzer/350/orig 2025-12-04T08:54:18.6420287Z * [new branch] gh/soulitzer/351/base -> origin/gh/soulitzer/351/base 2025-12-04T08:54:18.6420359Z * [new branch] gh/soulitzer/351/head -> origin/gh/soulitzer/351/head 2025-12-04T08:54:18.6420430Z * [new branch] gh/soulitzer/351/orig -> origin/gh/soulitzer/351/orig 2025-12-04T08:54:18.6420503Z * [new branch] gh/soulitzer/353/base -> origin/gh/soulitzer/353/base 2025-12-04T08:54:18.6420576Z * [new branch] gh/soulitzer/353/head -> origin/gh/soulitzer/353/head 2025-12-04T08:54:18.6420646Z * [new branch] gh/soulitzer/353/orig -> origin/gh/soulitzer/353/orig 2025-12-04T08:54:18.6420718Z * [new branch] gh/soulitzer/358/base -> origin/gh/soulitzer/358/base 2025-12-04T08:54:18.6420811Z * [new branch] gh/soulitzer/358/head -> origin/gh/soulitzer/358/head 2025-12-04T08:54:18.6420883Z * [new branch] gh/soulitzer/358/orig -> origin/gh/soulitzer/358/orig 2025-12-04T08:54:18.6420954Z * [new branch] gh/soulitzer/359/base -> origin/gh/soulitzer/359/base 2025-12-04T08:54:18.6421028Z * [new branch] gh/soulitzer/359/head -> origin/gh/soulitzer/359/head 2025-12-04T08:54:18.6421099Z * [new branch] gh/soulitzer/359/orig -> origin/gh/soulitzer/359/orig 2025-12-04T08:54:18.6421173Z * [new branch] gh/soulitzer/374/base -> origin/gh/soulitzer/374/base 2025-12-04T08:54:18.6421246Z * [new branch] gh/soulitzer/374/head -> origin/gh/soulitzer/374/head 2025-12-04T08:54:18.6421318Z * [new branch] gh/soulitzer/374/orig -> origin/gh/soulitzer/374/orig 2025-12-04T08:54:18.6421392Z * [new branch] gh/soulitzer/375/base -> origin/gh/soulitzer/375/base 2025-12-04T08:54:18.6421463Z * [new branch] gh/soulitzer/375/head -> origin/gh/soulitzer/375/head 2025-12-04T08:54:18.6421534Z * [new branch] gh/soulitzer/375/orig -> origin/gh/soulitzer/375/orig 2025-12-04T08:54:18.6421607Z * [new branch] gh/soulitzer/380/base -> origin/gh/soulitzer/380/base 2025-12-04T08:54:18.6421679Z * [new branch] gh/soulitzer/380/head -> origin/gh/soulitzer/380/head 2025-12-04T08:54:18.6421750Z * [new branch] gh/soulitzer/380/orig -> origin/gh/soulitzer/380/orig 2025-12-04T08:54:18.6421825Z * [new branch] gh/soulitzer/385/base -> origin/gh/soulitzer/385/base 2025-12-04T08:54:18.6421897Z * [new branch] gh/soulitzer/385/head -> origin/gh/soulitzer/385/head 2025-12-04T08:54:18.6421968Z * [new branch] gh/soulitzer/385/orig -> origin/gh/soulitzer/385/orig 2025-12-04T08:54:18.6422042Z * [new branch] gh/soulitzer/386/base -> origin/gh/soulitzer/386/base 2025-12-04T08:54:18.6422114Z * [new branch] gh/soulitzer/386/head -> origin/gh/soulitzer/386/head 2025-12-04T08:54:18.6422185Z * [new branch] gh/soulitzer/386/orig -> origin/gh/soulitzer/386/orig 2025-12-04T08:54:18.6422259Z * [new branch] gh/soulitzer/387/base -> origin/gh/soulitzer/387/base 2025-12-04T08:54:18.6422330Z * [new branch] gh/soulitzer/387/head -> origin/gh/soulitzer/387/head 2025-12-04T08:54:18.6422414Z * [new branch] gh/soulitzer/387/orig -> origin/gh/soulitzer/387/orig 2025-12-04T08:54:18.6422515Z * [new branch] gh/soulitzer/388/base -> origin/gh/soulitzer/388/base 2025-12-04T08:54:18.6422586Z * [new branch] gh/soulitzer/388/head -> origin/gh/soulitzer/388/head 2025-12-04T08:54:18.6422658Z * [new branch] gh/soulitzer/388/orig -> origin/gh/soulitzer/388/orig 2025-12-04T08:54:18.6422733Z * [new branch] gh/soulitzer/389/base -> origin/gh/soulitzer/389/base 2025-12-04T08:54:18.6422805Z * [new branch] gh/soulitzer/389/head -> origin/gh/soulitzer/389/head 2025-12-04T08:54:18.6422877Z * [new branch] gh/soulitzer/389/orig -> origin/gh/soulitzer/389/orig 2025-12-04T08:54:18.6422950Z * [new branch] gh/soulitzer/390/base -> origin/gh/soulitzer/390/base 2025-12-04T08:54:18.6423021Z * [new branch] gh/soulitzer/390/head -> origin/gh/soulitzer/390/head 2025-12-04T08:54:18.6423094Z * [new branch] gh/soulitzer/390/orig -> origin/gh/soulitzer/390/orig 2025-12-04T08:54:18.6423168Z * [new branch] gh/soulitzer/391/base -> origin/gh/soulitzer/391/base 2025-12-04T08:54:18.6423239Z * [new branch] gh/soulitzer/391/head -> origin/gh/soulitzer/391/head 2025-12-04T08:54:18.6423313Z * [new branch] gh/soulitzer/391/orig -> origin/gh/soulitzer/391/orig 2025-12-04T08:54:18.6423405Z * [new branch] gh/soulitzer/392/base -> origin/gh/soulitzer/392/base 2025-12-04T08:54:18.6423477Z * [new branch] gh/soulitzer/392/head -> origin/gh/soulitzer/392/head 2025-12-04T08:54:18.6423549Z * [new branch] gh/soulitzer/392/orig -> origin/gh/soulitzer/392/orig 2025-12-04T08:54:18.6423621Z * [new branch] gh/swolchok/728/next -> origin/gh/swolchok/728/next 2025-12-04T08:54:18.6423692Z * [new branch] gh/swolchok/819/base -> origin/gh/swolchok/819/base 2025-12-04T08:54:18.6423765Z * [new branch] gh/swolchok/819/head -> origin/gh/swolchok/819/head 2025-12-04T08:54:18.6423835Z * [new branch] gh/swolchok/819/orig -> origin/gh/swolchok/819/orig 2025-12-04T08:54:18.6423905Z * [new branch] gh/swolchok/824/base -> origin/gh/swolchok/824/base 2025-12-04T08:54:18.6423976Z * [new branch] gh/swolchok/824/head -> origin/gh/swolchok/824/head 2025-12-04T08:54:18.6424046Z * [new branch] gh/swolchok/824/orig -> origin/gh/swolchok/824/orig 2025-12-04T08:54:18.6424116Z * [new branch] gh/swolchok/829/base -> origin/gh/swolchok/829/base 2025-12-04T08:54:18.6424188Z * [new branch] gh/swolchok/829/head -> origin/gh/swolchok/829/head 2025-12-04T08:54:18.6424257Z * [new branch] gh/swolchok/829/orig -> origin/gh/swolchok/829/orig 2025-12-04T08:54:18.6424330Z * [new branch] gh/swolchok/839/base -> origin/gh/swolchok/839/base 2025-12-04T08:54:18.6424401Z * [new branch] gh/swolchok/839/head -> origin/gh/swolchok/839/head 2025-12-04T08:54:18.6424470Z * [new branch] gh/swolchok/839/orig -> origin/gh/swolchok/839/orig 2025-12-04T08:54:18.6424542Z * [new branch] gh/swolchok/841/base -> origin/gh/swolchok/841/base 2025-12-04T08:54:18.6424611Z * [new branch] gh/swolchok/841/head -> origin/gh/swolchok/841/head 2025-12-04T08:54:18.6424682Z * [new branch] gh/swolchok/841/orig -> origin/gh/swolchok/841/orig 2025-12-04T08:54:18.6424753Z * [new branch] gh/swolchok/842/base -> origin/gh/swolchok/842/base 2025-12-04T08:54:18.6424823Z * [new branch] gh/swolchok/842/head -> origin/gh/swolchok/842/head 2025-12-04T08:54:18.6424892Z * [new branch] gh/swolchok/842/orig -> origin/gh/swolchok/842/orig 2025-12-04T08:54:18.6424964Z * [new branch] gh/swolchok/845/base -> origin/gh/swolchok/845/base 2025-12-04T08:54:18.6425053Z * [new branch] gh/swolchok/845/head -> origin/gh/swolchok/845/head 2025-12-04T08:54:18.6425122Z * [new branch] gh/swolchok/845/orig -> origin/gh/swolchok/845/orig 2025-12-04T08:54:18.6425194Z * [new branch] gh/swolchok/848/base -> origin/gh/swolchok/848/base 2025-12-04T08:54:18.6425264Z * [new branch] gh/swolchok/848/head -> origin/gh/swolchok/848/head 2025-12-04T08:54:18.6425337Z * [new branch] gh/swolchok/848/orig -> origin/gh/swolchok/848/orig 2025-12-04T08:54:18.6425407Z * [new branch] gh/swolchok/856/base -> origin/gh/swolchok/856/base 2025-12-04T08:54:18.6425477Z * [new branch] gh/swolchok/856/head -> origin/gh/swolchok/856/head 2025-12-04T08:54:18.6425547Z * [new branch] gh/swolchok/856/orig -> origin/gh/swolchok/856/orig 2025-12-04T08:54:18.6425618Z * [new branch] gh/swolchok/860/base -> origin/gh/swolchok/860/base 2025-12-04T08:54:18.6425689Z * [new branch] gh/swolchok/860/head -> origin/gh/swolchok/860/head 2025-12-04T08:54:18.6425759Z * [new branch] gh/swolchok/860/orig -> origin/gh/swolchok/860/orig 2025-12-04T08:54:18.6425830Z * [new branch] gh/swolchok/861/base -> origin/gh/swolchok/861/base 2025-12-04T08:54:18.6425899Z * [new branch] gh/swolchok/861/head -> origin/gh/swolchok/861/head 2025-12-04T08:54:18.6425998Z * [new branch] gh/swolchok/861/orig -> origin/gh/swolchok/861/orig 2025-12-04T08:54:18.6426069Z * [new branch] gh/swolchok/862/base -> origin/gh/swolchok/862/base 2025-12-04T08:54:18.6426139Z * [new branch] gh/swolchok/862/head -> origin/gh/swolchok/862/head 2025-12-04T08:54:18.6426210Z * [new branch] gh/swolchok/862/orig -> origin/gh/swolchok/862/orig 2025-12-04T08:54:18.6426280Z * [new branch] gh/swolchok/863/base -> origin/gh/swolchok/863/base 2025-12-04T08:54:18.6426352Z * [new branch] gh/swolchok/863/head -> origin/gh/swolchok/863/head 2025-12-04T08:54:18.6426425Z * [new branch] gh/swolchok/863/orig -> origin/gh/swolchok/863/orig 2025-12-04T08:54:18.6426495Z * [new branch] gh/swolchok/864/base -> origin/gh/swolchok/864/base 2025-12-04T08:54:18.6426566Z * [new branch] gh/swolchok/864/head -> origin/gh/swolchok/864/head 2025-12-04T08:54:18.6426637Z * [new branch] gh/swolchok/864/orig -> origin/gh/swolchok/864/orig 2025-12-04T08:54:18.6426707Z * [new branch] gh/swolchok/865/base -> origin/gh/swolchok/865/base 2025-12-04T08:54:18.6426778Z * [new branch] gh/swolchok/865/head -> origin/gh/swolchok/865/head 2025-12-04T08:54:18.6426849Z * [new branch] gh/swolchok/865/orig -> origin/gh/swolchok/865/orig 2025-12-04T08:54:18.6426919Z * [new branch] gh/swolchok/866/base -> origin/gh/swolchok/866/base 2025-12-04T08:54:18.6426990Z * [new branch] gh/swolchok/866/head -> origin/gh/swolchok/866/head 2025-12-04T08:54:18.6427062Z * [new branch] gh/swolchok/866/orig -> origin/gh/swolchok/866/orig 2025-12-04T08:54:18.6427132Z * [new branch] gh/swolchok/867/base -> origin/gh/swolchok/867/base 2025-12-04T08:54:18.6427203Z * [new branch] gh/swolchok/867/head -> origin/gh/swolchok/867/head 2025-12-04T08:54:18.6427274Z * [new branch] gh/swolchok/867/orig -> origin/gh/swolchok/867/orig 2025-12-04T08:54:18.6427344Z * [new branch] gh/swolchok/868/base -> origin/gh/swolchok/868/base 2025-12-04T08:54:18.6427415Z * [new branch] gh/swolchok/868/head -> origin/gh/swolchok/868/head 2025-12-04T08:54:18.6427484Z * [new branch] gh/swolchok/868/orig -> origin/gh/swolchok/868/orig 2025-12-04T08:54:18.6427554Z * [new branch] gh/swolchok/869/base -> origin/gh/swolchok/869/base 2025-12-04T08:54:18.6427648Z * [new branch] gh/swolchok/869/head -> origin/gh/swolchok/869/head 2025-12-04T08:54:18.6427718Z * [new branch] gh/swolchok/869/orig -> origin/gh/swolchok/869/orig 2025-12-04T08:54:18.6427788Z * [new branch] gh/swolchok/870/base -> origin/gh/swolchok/870/base 2025-12-04T08:54:18.6427860Z * [new branch] gh/swolchok/870/head -> origin/gh/swolchok/870/head 2025-12-04T08:54:18.6427930Z * [new branch] gh/swolchok/870/orig -> origin/gh/swolchok/870/orig 2025-12-04T08:54:18.6428000Z * [new branch] gh/swolchok/871/base -> origin/gh/swolchok/871/base 2025-12-04T08:54:18.6428122Z * [new branch] gh/swolchok/871/head -> origin/gh/swolchok/871/head 2025-12-04T08:54:18.6428195Z * [new branch] gh/swolchok/871/orig -> origin/gh/swolchok/871/orig 2025-12-04T08:54:18.6428267Z * [new branch] gh/teja-rao/4/base -> origin/gh/teja-rao/4/base 2025-12-04T08:54:18.6428343Z * [new branch] gh/teja-rao/4/head -> origin/gh/teja-rao/4/head 2025-12-04T08:54:18.6428413Z * [new branch] gh/teja-rao/4/orig -> origin/gh/teja-rao/4/orig 2025-12-04T08:54:18.6428483Z * [new branch] gh/tianyu-l/2/base -> origin/gh/tianyu-l/2/base 2025-12-04T08:54:18.6428710Z * [new branch] gh/tianyu-l/2/head -> origin/gh/tianyu-l/2/head 2025-12-04T08:54:18.6428778Z * [new branch] gh/tianyu-l/2/orig -> origin/gh/tianyu-l/2/orig 2025-12-04T08:54:18.6428853Z * [new branch] gh/tianyu-l/3/base -> origin/gh/tianyu-l/3/base 2025-12-04T08:54:18.6428921Z * [new branch] gh/tianyu-l/3/orig -> origin/gh/tianyu-l/3/orig 2025-12-04T08:54:18.6428989Z * [new branch] gh/tianyu-l/4/base -> origin/gh/tianyu-l/4/base 2025-12-04T08:54:18.6429056Z * [new branch] gh/tianyu-l/4/head -> origin/gh/tianyu-l/4/head 2025-12-04T08:54:18.6429128Z * [new branch] gh/tianyu-l/4/orig -> origin/gh/tianyu-l/4/orig 2025-12-04T08:54:18.6429216Z * [new branch] gh/tugsbayasgalan/10/base -> origin/gh/tugsbayasgalan/10/base 2025-12-04T08:54:18.6429303Z * [new branch] gh/tugsbayasgalan/10/head -> origin/gh/tugsbayasgalan/10/head 2025-12-04T08:54:18.6429387Z * [new branch] gh/tugsbayasgalan/10/orig -> origin/gh/tugsbayasgalan/10/orig 2025-12-04T08:54:18.6429470Z * [new branch] gh/tugsbayasgalan/13/base -> origin/gh/tugsbayasgalan/13/base 2025-12-04T08:54:18.6429554Z * [new branch] gh/tugsbayasgalan/13/head -> origin/gh/tugsbayasgalan/13/head 2025-12-04T08:54:18.6429635Z * [new branch] gh/tugsbayasgalan/13/orig -> origin/gh/tugsbayasgalan/13/orig 2025-12-04T08:54:18.6429718Z * [new branch] gh/tugsbayasgalan/17/base -> origin/gh/tugsbayasgalan/17/base 2025-12-04T08:54:18.6429803Z * [new branch] gh/tugsbayasgalan/17/head -> origin/gh/tugsbayasgalan/17/head 2025-12-04T08:54:18.6429885Z * [new branch] gh/tugsbayasgalan/17/orig -> origin/gh/tugsbayasgalan/17/orig 2025-12-04T08:54:18.6429968Z * [new branch] gh/tugsbayasgalan/2/base -> origin/gh/tugsbayasgalan/2/base 2025-12-04T08:54:18.6430052Z * [new branch] gh/tugsbayasgalan/2/head -> origin/gh/tugsbayasgalan/2/head 2025-12-04T08:54:18.6430137Z * [new branch] gh/tugsbayasgalan/2/orig -> origin/gh/tugsbayasgalan/2/orig 2025-12-04T08:54:18.6430222Z * [new branch] gh/tugsbayasgalan/28/base -> origin/gh/tugsbayasgalan/28/base 2025-12-04T08:54:18.6430307Z * [new branch] gh/tugsbayasgalan/28/head -> origin/gh/tugsbayasgalan/28/head 2025-12-04T08:54:18.6430389Z * [new branch] gh/tugsbayasgalan/28/orig -> origin/gh/tugsbayasgalan/28/orig 2025-12-04T08:54:18.6430470Z * [new branch] gh/tugsbayasgalan/32/base -> origin/gh/tugsbayasgalan/32/base 2025-12-04T08:54:18.6430585Z * [new branch] gh/tugsbayasgalan/32/head -> origin/gh/tugsbayasgalan/32/head 2025-12-04T08:54:18.6430666Z * [new branch] gh/tugsbayasgalan/32/orig -> origin/gh/tugsbayasgalan/32/orig 2025-12-04T08:54:18.6430749Z * [new branch] gh/tugsbayasgalan/35/base -> origin/gh/tugsbayasgalan/35/base 2025-12-04T08:54:18.6430832Z * [new branch] gh/tugsbayasgalan/35/head -> origin/gh/tugsbayasgalan/35/head 2025-12-04T08:54:18.6430913Z * [new branch] gh/tugsbayasgalan/35/orig -> origin/gh/tugsbayasgalan/35/orig 2025-12-04T08:54:18.6430997Z * [new branch] gh/tugsbayasgalan/36/base -> origin/gh/tugsbayasgalan/36/base 2025-12-04T08:54:18.6431078Z * [new branch] gh/tugsbayasgalan/36/head -> origin/gh/tugsbayasgalan/36/head 2025-12-04T08:54:18.6431159Z * [new branch] gh/tugsbayasgalan/36/orig -> origin/gh/tugsbayasgalan/36/orig 2025-12-04T08:54:18.6431244Z * [new branch] gh/tugsbayasgalan/37/base -> origin/gh/tugsbayasgalan/37/base 2025-12-04T08:54:18.6431326Z * [new branch] gh/tugsbayasgalan/37/head -> origin/gh/tugsbayasgalan/37/head 2025-12-04T08:54:18.6431407Z * [new branch] gh/tugsbayasgalan/37/orig -> origin/gh/tugsbayasgalan/37/orig 2025-12-04T08:54:18.6431508Z * [new branch] gh/tugsbayasgalan/43/base -> origin/gh/tugsbayasgalan/43/base 2025-12-04T08:54:18.6431590Z * [new branch] gh/tugsbayasgalan/43/head -> origin/gh/tugsbayasgalan/43/head 2025-12-04T08:54:18.6431671Z * [new branch] gh/tugsbayasgalan/43/orig -> origin/gh/tugsbayasgalan/43/orig 2025-12-04T08:54:18.6431755Z * [new branch] gh/tugsbayasgalan/48/base -> origin/gh/tugsbayasgalan/48/base 2025-12-04T08:54:18.6431837Z * [new branch] gh/tugsbayasgalan/48/head -> origin/gh/tugsbayasgalan/48/head 2025-12-04T08:54:18.6431918Z * [new branch] gh/tugsbayasgalan/48/orig -> origin/gh/tugsbayasgalan/48/orig 2025-12-04T08:54:18.6432003Z * [new branch] gh/tugsbayasgalan/51/base -> origin/gh/tugsbayasgalan/51/base 2025-12-04T08:54:18.6432085Z * [new branch] gh/tugsbayasgalan/51/head -> origin/gh/tugsbayasgalan/51/head 2025-12-04T08:54:18.6432166Z * [new branch] gh/tugsbayasgalan/51/orig -> origin/gh/tugsbayasgalan/51/orig 2025-12-04T08:54:18.6432250Z * [new branch] gh/tugsbayasgalan/52/base -> origin/gh/tugsbayasgalan/52/base 2025-12-04T08:54:18.6432332Z * [new branch] gh/tugsbayasgalan/52/head -> origin/gh/tugsbayasgalan/52/head 2025-12-04T08:54:18.6432415Z * [new branch] gh/tugsbayasgalan/52/orig -> origin/gh/tugsbayasgalan/52/orig 2025-12-04T08:54:18.6432496Z * [new branch] gh/tugsbayasgalan/53/base -> origin/gh/tugsbayasgalan/53/base 2025-12-04T08:54:18.6432578Z * [new branch] gh/tugsbayasgalan/53/head -> origin/gh/tugsbayasgalan/53/head 2025-12-04T08:54:18.6432662Z * [new branch] gh/tugsbayasgalan/53/orig -> origin/gh/tugsbayasgalan/53/orig 2025-12-04T08:54:18.6432744Z * [new branch] gh/tugsbayasgalan/55/base -> origin/gh/tugsbayasgalan/55/base 2025-12-04T08:54:18.6432825Z * [new branch] gh/tugsbayasgalan/55/head -> origin/gh/tugsbayasgalan/55/head 2025-12-04T08:54:18.6432909Z * [new branch] gh/tugsbayasgalan/55/orig -> origin/gh/tugsbayasgalan/55/orig 2025-12-04T08:54:18.6432991Z * [new branch] gh/tugsbayasgalan/59/base -> origin/gh/tugsbayasgalan/59/base 2025-12-04T08:54:18.6433072Z * [new branch] gh/tugsbayasgalan/59/head -> origin/gh/tugsbayasgalan/59/head 2025-12-04T08:54:18.6433156Z * [new branch] gh/tugsbayasgalan/59/orig -> origin/gh/tugsbayasgalan/59/orig 2025-12-04T08:54:18.6433237Z * [new branch] gh/tugsbayasgalan/6/base -> origin/gh/tugsbayasgalan/6/base 2025-12-04T08:54:18.6433336Z * [new branch] gh/tugsbayasgalan/6/head -> origin/gh/tugsbayasgalan/6/head 2025-12-04T08:54:18.6433419Z * [new branch] gh/tugsbayasgalan/6/orig -> origin/gh/tugsbayasgalan/6/orig 2025-12-04T08:54:18.6433501Z * [new branch] gh/tugsbayasgalan/60/base -> origin/gh/tugsbayasgalan/60/base 2025-12-04T08:54:18.6433583Z * [new branch] gh/tugsbayasgalan/60/head -> origin/gh/tugsbayasgalan/60/head 2025-12-04T08:54:18.6433670Z * [new branch] gh/tugsbayasgalan/60/orig -> origin/gh/tugsbayasgalan/60/orig 2025-12-04T08:54:18.6433751Z * [new branch] gh/tugsbayasgalan/61/base -> origin/gh/tugsbayasgalan/61/base 2025-12-04T08:54:18.6433833Z * [new branch] gh/tugsbayasgalan/61/head -> origin/gh/tugsbayasgalan/61/head 2025-12-04T08:54:18.6433914Z * [new branch] gh/tugsbayasgalan/61/orig -> origin/gh/tugsbayasgalan/61/orig 2025-12-04T08:54:18.6433996Z * [new branch] gh/tugsbayasgalan/63/base -> origin/gh/tugsbayasgalan/63/base 2025-12-04T08:54:18.6434079Z * [new branch] gh/tugsbayasgalan/63/head -> origin/gh/tugsbayasgalan/63/head 2025-12-04T08:54:18.6434161Z * [new branch] gh/tugsbayasgalan/63/orig -> origin/gh/tugsbayasgalan/63/orig 2025-12-04T08:54:18.6434242Z * [new branch] gh/tugsbayasgalan/67/base -> origin/gh/tugsbayasgalan/67/base 2025-12-04T08:54:18.6434347Z * [new branch] gh/tugsbayasgalan/67/head -> origin/gh/tugsbayasgalan/67/head 2025-12-04T08:54:18.6434429Z * [new branch] gh/tugsbayasgalan/67/orig -> origin/gh/tugsbayasgalan/67/orig 2025-12-04T08:54:18.6434510Z * [new branch] gh/tugsbayasgalan/68/base -> origin/gh/tugsbayasgalan/68/base 2025-12-04T08:54:18.6434593Z * [new branch] gh/tugsbayasgalan/68/head -> origin/gh/tugsbayasgalan/68/head 2025-12-04T08:54:18.6434675Z * [new branch] gh/tugsbayasgalan/68/orig -> origin/gh/tugsbayasgalan/68/orig 2025-12-04T08:54:18.6434757Z * [new branch] gh/tugsbayasgalan/7/base -> origin/gh/tugsbayasgalan/7/base 2025-12-04T08:54:18.6434840Z * [new branch] gh/tugsbayasgalan/7/head -> origin/gh/tugsbayasgalan/7/head 2025-12-04T08:54:18.6434919Z * [new branch] gh/tugsbayasgalan/7/orig -> origin/gh/tugsbayasgalan/7/orig 2025-12-04T08:54:18.6435003Z * [new branch] gh/tugsbayasgalan/70/base -> origin/gh/tugsbayasgalan/70/base 2025-12-04T08:54:18.6435086Z * [new branch] gh/tugsbayasgalan/70/head -> origin/gh/tugsbayasgalan/70/head 2025-12-04T08:54:18.6435167Z * [new branch] gh/tugsbayasgalan/70/orig -> origin/gh/tugsbayasgalan/70/orig 2025-12-04T08:54:18.6435249Z * [new branch] gh/tugsbayasgalan/71/base -> origin/gh/tugsbayasgalan/71/base 2025-12-04T08:54:18.6435332Z * [new branch] gh/tugsbayasgalan/71/head -> origin/gh/tugsbayasgalan/71/head 2025-12-04T08:54:18.6435413Z * [new branch] gh/tugsbayasgalan/71/orig -> origin/gh/tugsbayasgalan/71/orig 2025-12-04T08:54:18.6435496Z * [new branch] gh/tugsbayasgalan/72/base -> origin/gh/tugsbayasgalan/72/base 2025-12-04T08:54:18.6435577Z * [new branch] gh/tugsbayasgalan/72/head -> origin/gh/tugsbayasgalan/72/head 2025-12-04T08:54:18.6435659Z * [new branch] gh/tugsbayasgalan/72/orig -> origin/gh/tugsbayasgalan/72/orig 2025-12-04T08:54:18.6435743Z * [new branch] gh/tugsbayasgalan/73/base -> origin/gh/tugsbayasgalan/73/base 2025-12-04T08:54:18.6435824Z * [new branch] gh/tugsbayasgalan/73/head -> origin/gh/tugsbayasgalan/73/head 2025-12-04T08:54:18.6435906Z * [new branch] gh/tugsbayasgalan/73/orig -> origin/gh/tugsbayasgalan/73/orig 2025-12-04T08:54:18.6435989Z * [new branch] gh/tugsbayasgalan/74/base -> origin/gh/tugsbayasgalan/74/base 2025-12-04T08:54:18.6436070Z * [new branch] gh/tugsbayasgalan/74/head -> origin/gh/tugsbayasgalan/74/head 2025-12-04T08:54:18.6436176Z * [new branch] gh/tugsbayasgalan/74/orig -> origin/gh/tugsbayasgalan/74/orig 2025-12-04T08:54:18.6436258Z * [new branch] gh/tugsbayasgalan/75/base -> origin/gh/tugsbayasgalan/75/base 2025-12-04T08:54:18.6436339Z * [new branch] gh/tugsbayasgalan/75/head -> origin/gh/tugsbayasgalan/75/head 2025-12-04T08:54:18.6436423Z * [new branch] gh/tugsbayasgalan/75/orig -> origin/gh/tugsbayasgalan/75/orig 2025-12-04T08:54:18.6436506Z * [new branch] gh/tugsbayasgalan/76/base -> origin/gh/tugsbayasgalan/76/base 2025-12-04T08:54:18.6436587Z * [new branch] gh/tugsbayasgalan/76/head -> origin/gh/tugsbayasgalan/76/head 2025-12-04T08:54:18.6436670Z * [new branch] gh/tugsbayasgalan/76/orig -> origin/gh/tugsbayasgalan/76/orig 2025-12-04T08:54:18.6436753Z * [new branch] gh/tugsbayasgalan/77/base -> origin/gh/tugsbayasgalan/77/base 2025-12-04T08:54:18.6436836Z * [new branch] gh/tugsbayasgalan/77/head -> origin/gh/tugsbayasgalan/77/head 2025-12-04T08:54:18.6436919Z * [new branch] gh/tugsbayasgalan/77/orig -> origin/gh/tugsbayasgalan/77/orig 2025-12-04T08:54:18.6437001Z * [new branch] gh/tugsbayasgalan/78/base -> origin/gh/tugsbayasgalan/78/base 2025-12-04T08:54:18.6437082Z * [new branch] gh/tugsbayasgalan/78/head -> origin/gh/tugsbayasgalan/78/head 2025-12-04T08:54:18.6437193Z * [new branch] gh/tugsbayasgalan/78/orig -> origin/gh/tugsbayasgalan/78/orig 2025-12-04T08:54:18.6437275Z * [new branch] gh/tugsbayasgalan/79/base -> origin/gh/tugsbayasgalan/79/base 2025-12-04T08:54:18.6437357Z * [new branch] gh/tugsbayasgalan/79/head -> origin/gh/tugsbayasgalan/79/head 2025-12-04T08:54:18.6437439Z * [new branch] gh/tugsbayasgalan/79/orig -> origin/gh/tugsbayasgalan/79/orig 2025-12-04T08:54:18.6437519Z * [new branch] gh/tugsbayasgalan/8/base -> origin/gh/tugsbayasgalan/8/base 2025-12-04T08:54:18.6437601Z * [new branch] gh/tugsbayasgalan/8/head -> origin/gh/tugsbayasgalan/8/head 2025-12-04T08:54:18.6437681Z * [new branch] gh/tugsbayasgalan/8/orig -> origin/gh/tugsbayasgalan/8/orig 2025-12-04T08:54:18.6437763Z * [new branch] gh/tugsbayasgalan/80/base -> origin/gh/tugsbayasgalan/80/base 2025-12-04T08:54:18.6437846Z * [new branch] gh/tugsbayasgalan/80/head -> origin/gh/tugsbayasgalan/80/head 2025-12-04T08:54:18.6437931Z * [new branch] gh/tugsbayasgalan/80/orig -> origin/gh/tugsbayasgalan/80/orig 2025-12-04T08:54:18.6438011Z * [new branch] gh/tugsbayasgalan/81/base -> origin/gh/tugsbayasgalan/81/base 2025-12-04T08:54:18.6438136Z * [new branch] gh/tugsbayasgalan/81/head -> origin/gh/tugsbayasgalan/81/head 2025-12-04T08:54:18.6438221Z * [new branch] gh/tugsbayasgalan/81/orig -> origin/gh/tugsbayasgalan/81/orig 2025-12-04T08:54:18.6438303Z * [new branch] gh/tugsbayasgalan/82/base -> origin/gh/tugsbayasgalan/82/base 2025-12-04T08:54:18.6438385Z * [new branch] gh/tugsbayasgalan/82/head -> origin/gh/tugsbayasgalan/82/head 2025-12-04T08:54:18.6438468Z * [new branch] gh/tugsbayasgalan/82/orig -> origin/gh/tugsbayasgalan/82/orig 2025-12-04T08:54:18.6438551Z * [new branch] gh/tugsbayasgalan/83/base -> origin/gh/tugsbayasgalan/83/base 2025-12-04T08:54:18.6438635Z * [new branch] gh/tugsbayasgalan/83/head -> origin/gh/tugsbayasgalan/83/head 2025-12-04T08:54:18.6438717Z * [new branch] gh/tugsbayasgalan/83/orig -> origin/gh/tugsbayasgalan/83/orig 2025-12-04T08:54:18.6438798Z * [new branch] gh/tugsbayasgalan/84/base -> origin/gh/tugsbayasgalan/84/base 2025-12-04T08:54:18.6438880Z * [new branch] gh/tugsbayasgalan/84/head -> origin/gh/tugsbayasgalan/84/head 2025-12-04T08:54:18.6438962Z * [new branch] gh/tugsbayasgalan/84/orig -> origin/gh/tugsbayasgalan/84/orig 2025-12-04T08:54:18.6439068Z * [new branch] gh/tugsbayasgalan/85/base -> origin/gh/tugsbayasgalan/85/base 2025-12-04T08:54:18.6439151Z * [new branch] gh/tugsbayasgalan/85/head -> origin/gh/tugsbayasgalan/85/head 2025-12-04T08:54:18.6439232Z * [new branch] gh/tugsbayasgalan/85/orig -> origin/gh/tugsbayasgalan/85/orig 2025-12-04T08:54:18.6439315Z * [new branch] gh/tugsbayasgalan/86/base -> origin/gh/tugsbayasgalan/86/base 2025-12-04T08:54:18.6439397Z * [new branch] gh/tugsbayasgalan/86/head -> origin/gh/tugsbayasgalan/86/head 2025-12-04T08:54:18.6439479Z * [new branch] gh/tugsbayasgalan/86/orig -> origin/gh/tugsbayasgalan/86/orig 2025-12-04T08:54:18.6439560Z * [new branch] gh/tugsbayasgalan/87/base -> origin/gh/tugsbayasgalan/87/base 2025-12-04T08:54:18.6439644Z * [new branch] gh/tugsbayasgalan/87/head -> origin/gh/tugsbayasgalan/87/head 2025-12-04T08:54:18.6439727Z * [new branch] gh/tugsbayasgalan/87/orig -> origin/gh/tugsbayasgalan/87/orig 2025-12-04T08:54:18.6439809Z * [new branch] gh/tugsbayasgalan/88/base -> origin/gh/tugsbayasgalan/88/base 2025-12-04T08:54:18.6439891Z * [new branch] gh/tugsbayasgalan/88/head -> origin/gh/tugsbayasgalan/88/head 2025-12-04T08:54:18.6439998Z * [new branch] gh/tugsbayasgalan/88/orig -> origin/gh/tugsbayasgalan/88/orig 2025-12-04T08:54:18.6440082Z * [new branch] gh/tugsbayasgalan/89/base -> origin/gh/tugsbayasgalan/89/base 2025-12-04T08:54:18.6440164Z * [new branch] gh/tugsbayasgalan/89/head -> origin/gh/tugsbayasgalan/89/head 2025-12-04T08:54:18.6440245Z * [new branch] gh/tugsbayasgalan/89/orig -> origin/gh/tugsbayasgalan/89/orig 2025-12-04T08:54:18.6440327Z * [new branch] gh/tugsbayasgalan/9/base -> origin/gh/tugsbayasgalan/9/base 2025-12-04T08:54:18.6440407Z * [new branch] gh/tugsbayasgalan/9/head -> origin/gh/tugsbayasgalan/9/head 2025-12-04T08:54:18.6440488Z * [new branch] gh/tugsbayasgalan/9/orig -> origin/gh/tugsbayasgalan/9/orig 2025-12-04T08:54:18.6440573Z * [new branch] gh/tugsbayasgalan/90/base -> origin/gh/tugsbayasgalan/90/base 2025-12-04T08:54:18.6440654Z * [new branch] gh/tugsbayasgalan/90/head -> origin/gh/tugsbayasgalan/90/head 2025-12-04T08:54:18.6440738Z * [new branch] gh/tugsbayasgalan/90/orig -> origin/gh/tugsbayasgalan/90/orig 2025-12-04T08:54:18.6440820Z * [new branch] gh/tugsbayasgalan/91/base -> origin/gh/tugsbayasgalan/91/base 2025-12-04T08:54:18.6440901Z * [new branch] gh/tugsbayasgalan/91/head -> origin/gh/tugsbayasgalan/91/head 2025-12-04T08:54:18.6440981Z * [new branch] gh/tugsbayasgalan/91/orig -> origin/gh/tugsbayasgalan/91/orig 2025-12-04T08:54:18.6441062Z * [new branch] gh/tugsbayasgalan/92/base -> origin/gh/tugsbayasgalan/92/base 2025-12-04T08:54:18.6441145Z * [new branch] gh/tugsbayasgalan/92/head -> origin/gh/tugsbayasgalan/92/head 2025-12-04T08:54:18.6441226Z * [new branch] gh/tugsbayasgalan/92/orig -> origin/gh/tugsbayasgalan/92/orig 2025-12-04T08:54:18.6441310Z * [new branch] gh/tugsbayasgalan/93/base -> origin/gh/tugsbayasgalan/93/base 2025-12-04T08:54:18.6441393Z * [new branch] gh/tugsbayasgalan/93/head -> origin/gh/tugsbayasgalan/93/head 2025-12-04T08:54:18.6441474Z * [new branch] gh/tugsbayasgalan/93/orig -> origin/gh/tugsbayasgalan/93/orig 2025-12-04T08:54:18.6441542Z * [new branch] gh/v0i0/14/base -> origin/gh/v0i0/14/base 2025-12-04T08:54:18.6441606Z * [new branch] gh/v0i0/14/head -> origin/gh/v0i0/14/head 2025-12-04T08:54:18.6441672Z * [new branch] gh/v0i0/14/orig -> origin/gh/v0i0/14/orig 2025-12-04T08:54:18.6441736Z * [new branch] gh/v0i0/15/base -> origin/gh/v0i0/15/base 2025-12-04T08:54:18.6441822Z * [new branch] gh/v0i0/15/head -> origin/gh/v0i0/15/head 2025-12-04T08:54:18.6441886Z * [new branch] gh/v0i0/15/orig -> origin/gh/v0i0/15/orig 2025-12-04T08:54:18.6441949Z * [new branch] gh/v0i0/16/base -> origin/gh/v0i0/16/base 2025-12-04T08:54:18.6442012Z * [new branch] gh/v0i0/16/head -> origin/gh/v0i0/16/head 2025-12-04T08:54:18.6442076Z * [new branch] gh/v0i0/16/orig -> origin/gh/v0i0/16/orig 2025-12-04T08:54:18.6442138Z * [new branch] gh/v0i0/17/base -> origin/gh/v0i0/17/base 2025-12-04T08:54:18.6442201Z * [new branch] gh/v0i0/17/head -> origin/gh/v0i0/17/head 2025-12-04T08:54:18.6442265Z * [new branch] gh/v0i0/17/orig -> origin/gh/v0i0/17/orig 2025-12-04T08:54:18.6442326Z * [new branch] gh/v0i0/18/base -> origin/gh/v0i0/18/base 2025-12-04T08:54:18.6442390Z * [new branch] gh/v0i0/18/head -> origin/gh/v0i0/18/head 2025-12-04T08:54:18.6442453Z * [new branch] gh/v0i0/18/orig -> origin/gh/v0i0/18/orig 2025-12-04T08:54:18.6442515Z * [new branch] gh/v0i0/19/base -> origin/gh/v0i0/19/base 2025-12-04T08:54:18.6442597Z * [new branch] gh/v0i0/19/head -> origin/gh/v0i0/19/head 2025-12-04T08:54:18.6442662Z * [new branch] gh/v0i0/19/orig -> origin/gh/v0i0/19/orig 2025-12-04T08:54:18.6442743Z * [new branch] gh/vishal9-team/1/base -> origin/gh/vishal9-team/1/base 2025-12-04T08:54:18.6442820Z * [new branch] gh/vishal9-team/1/head -> origin/gh/vishal9-team/1/head 2025-12-04T08:54:18.6442896Z * [new branch] gh/vishal9-team/2/base -> origin/gh/vishal9-team/2/base 2025-12-04T08:54:18.6442969Z * [new branch] gh/vishal9-team/2/head -> origin/gh/vishal9-team/2/head 2025-12-04T08:54:18.6443044Z * [new branch] gh/vishal9-team/2/orig -> origin/gh/vishal9-team/2/orig 2025-12-04T08:54:18.6443119Z * [new branch] gh/vishal9-team/3/base -> origin/gh/vishal9-team/3/base 2025-12-04T08:54:18.6443194Z * [new branch] gh/vishal9-team/3/head -> origin/gh/vishal9-team/3/head 2025-12-04T08:54:18.6443270Z * [new branch] gh/vishal9-team/3/orig -> origin/gh/vishal9-team/3/orig 2025-12-04T08:54:18.6443344Z * [new branch] gh/vishal9-team/4/base -> origin/gh/vishal9-team/4/base 2025-12-04T08:54:18.6443417Z * [new branch] gh/vishal9-team/4/head -> origin/gh/vishal9-team/4/head 2025-12-04T08:54:18.6443491Z * [new branch] gh/vishal9-team/4/orig -> origin/gh/vishal9-team/4/orig 2025-12-04T08:54:18.6443558Z * [new branch] gh/vkuzo/1/next -> origin/gh/vkuzo/1/next 2025-12-04T08:54:18.6443625Z * [new branch] gh/vkuzo/2/next -> origin/gh/vkuzo/2/next 2025-12-04T08:54:18.6443692Z * [new branch] gh/vkuzo/3/next -> origin/gh/vkuzo/3/next 2025-12-04T08:54:18.6443766Z * [new branch] gh/wconstab/424/base -> origin/gh/wconstab/424/base 2025-12-04T08:54:18.6443838Z * [new branch] gh/wconstab/424/head -> origin/gh/wconstab/424/head 2025-12-04T08:54:18.6443914Z * [new branch] gh/wconstab/424/orig -> origin/gh/wconstab/424/orig 2025-12-04T08:54:18.6443985Z * [new branch] gh/wconstab/435/base -> origin/gh/wconstab/435/base 2025-12-04T08:54:18.6444056Z * [new branch] gh/wconstab/435/head -> origin/gh/wconstab/435/head 2025-12-04T08:54:18.6444128Z * [new branch] gh/wconstab/435/orig -> origin/gh/wconstab/435/orig 2025-12-04T08:54:18.6444198Z * [new branch] gh/wconstab/444/base -> origin/gh/wconstab/444/base 2025-12-04T08:54:18.6444268Z * [new branch] gh/wconstab/444/head -> origin/gh/wconstab/444/head 2025-12-04T08:54:18.6444362Z * [new branch] gh/wconstab/444/orig -> origin/gh/wconstab/444/orig 2025-12-04T08:54:18.6444432Z * [new branch] gh/wconstab/447/base -> origin/gh/wconstab/447/base 2025-12-04T08:54:18.6444502Z * [new branch] gh/wconstab/447/head -> origin/gh/wconstab/447/head 2025-12-04T08:54:18.6444576Z * [new branch] gh/wconstab/447/orig -> origin/gh/wconstab/447/orig 2025-12-04T08:54:18.6444646Z * [new branch] gh/wconstab/448/base -> origin/gh/wconstab/448/base 2025-12-04T08:54:18.6444717Z * [new branch] gh/wconstab/448/head -> origin/gh/wconstab/448/head 2025-12-04T08:54:18.6444789Z * [new branch] gh/wconstab/448/orig -> origin/gh/wconstab/448/orig 2025-12-04T08:54:18.6444859Z * [new branch] gh/wconstab/449/base -> origin/gh/wconstab/449/base 2025-12-04T08:54:18.6444931Z * [new branch] gh/wconstab/449/head -> origin/gh/wconstab/449/head 2025-12-04T08:54:18.6445003Z * [new branch] gh/wconstab/449/orig -> origin/gh/wconstab/449/orig 2025-12-04T08:54:18.6445073Z * [new branch] gh/wconstab/450/base -> origin/gh/wconstab/450/base 2025-12-04T08:54:18.6445146Z * [new branch] gh/wconstab/450/head -> origin/gh/wconstab/450/head 2025-12-04T08:54:18.6445239Z * [new branch] gh/wconstab/450/orig -> origin/gh/wconstab/450/orig 2025-12-04T08:54:18.6445309Z * [new branch] gh/wconstab/451/base -> origin/gh/wconstab/451/base 2025-12-04T08:54:18.6445381Z * [new branch] gh/wconstab/451/head -> origin/gh/wconstab/451/head 2025-12-04T08:54:18.6445450Z * [new branch] gh/wconstab/451/orig -> origin/gh/wconstab/451/orig 2025-12-04T08:54:18.6445520Z * [new branch] gh/wconstab/452/base -> origin/gh/wconstab/452/base 2025-12-04T08:54:18.6445593Z * [new branch] gh/wconstab/452/head -> origin/gh/wconstab/452/head 2025-12-04T08:54:18.6445665Z * [new branch] gh/wconstab/452/orig -> origin/gh/wconstab/452/orig 2025-12-04T08:54:18.6445735Z * [new branch] gh/wconstab/453/base -> origin/gh/wconstab/453/base 2025-12-04T08:54:18.6445807Z * [new branch] gh/wconstab/453/head -> origin/gh/wconstab/453/head 2025-12-04T08:54:18.6445878Z * [new branch] gh/wconstab/453/orig -> origin/gh/wconstab/453/orig 2025-12-04T08:54:18.6445947Z * [new branch] gh/wconstab/454/base -> origin/gh/wconstab/454/base 2025-12-04T08:54:18.6446020Z * [new branch] gh/wconstab/454/head -> origin/gh/wconstab/454/head 2025-12-04T08:54:18.6446090Z * [new branch] gh/wconstab/454/orig -> origin/gh/wconstab/454/orig 2025-12-04T08:54:18.6446160Z * [new branch] gh/wconstab/455/base -> origin/gh/wconstab/455/base 2025-12-04T08:54:18.6446231Z * [new branch] gh/wconstab/455/head -> origin/gh/wconstab/455/head 2025-12-04T08:54:18.6446304Z * [new branch] gh/wconstab/455/orig -> origin/gh/wconstab/455/orig 2025-12-04T08:54:18.6446374Z * [new branch] gh/wconstab/456/base -> origin/gh/wconstab/456/base 2025-12-04T08:54:18.6446446Z * [new branch] gh/wconstab/456/head -> origin/gh/wconstab/456/head 2025-12-04T08:54:18.6446516Z * [new branch] gh/wconstab/456/orig -> origin/gh/wconstab/456/orig 2025-12-04T08:54:18.6446589Z * [new branch] gh/wconstab/457/base -> origin/gh/wconstab/457/base 2025-12-04T08:54:18.6446659Z * [new branch] gh/wconstab/457/head -> origin/gh/wconstab/457/head 2025-12-04T08:54:18.6446730Z * [new branch] gh/wconstab/457/orig -> origin/gh/wconstab/457/orig 2025-12-04T08:54:18.6446801Z * [new branch] gh/wconstab/458/base -> origin/gh/wconstab/458/base 2025-12-04T08:54:18.6446897Z * [new branch] gh/wconstab/458/head -> origin/gh/wconstab/458/head 2025-12-04T08:54:18.6446968Z * [new branch] gh/wconstab/458/orig -> origin/gh/wconstab/458/orig 2025-12-04T08:54:18.6447040Z * [new branch] gh/wconstab/459/base -> origin/gh/wconstab/459/base 2025-12-04T08:54:18.6447110Z * [new branch] gh/wconstab/459/head -> origin/gh/wconstab/459/head 2025-12-04T08:54:18.6447182Z * [new branch] gh/wconstab/459/orig -> origin/gh/wconstab/459/orig 2025-12-04T08:54:18.6447254Z * [new branch] gh/wconstab/460/base -> origin/gh/wconstab/460/base 2025-12-04T08:54:18.6447324Z * [new branch] gh/wconstab/460/head -> origin/gh/wconstab/460/head 2025-12-04T08:54:18.6447393Z * [new branch] gh/wconstab/460/orig -> origin/gh/wconstab/460/orig 2025-12-04T08:54:18.6447465Z * [new branch] gh/wconstab/461/base -> origin/gh/wconstab/461/base 2025-12-04T08:54:18.6447536Z * [new branch] gh/wconstab/461/head -> origin/gh/wconstab/461/head 2025-12-04T08:54:18.6447606Z * [new branch] gh/wconstab/461/orig -> origin/gh/wconstab/461/orig 2025-12-04T08:54:18.6447678Z * [new branch] gh/wconstab/462/base -> origin/gh/wconstab/462/base 2025-12-04T08:54:18.6447749Z * [new branch] gh/wconstab/462/head -> origin/gh/wconstab/462/head 2025-12-04T08:54:18.6447838Z * [new branch] gh/wconstab/462/orig -> origin/gh/wconstab/462/orig 2025-12-04T08:54:18.6447912Z * [new branch] gh/wconstab/463/base -> origin/gh/wconstab/463/base 2025-12-04T08:54:18.6447982Z * [new branch] gh/wconstab/463/head -> origin/gh/wconstab/463/head 2025-12-04T08:54:18.6448053Z * [new branch] gh/wconstab/463/orig -> origin/gh/wconstab/463/orig 2025-12-04T08:54:18.6448173Z * [new branch] gh/wconstab/464/base -> origin/gh/wconstab/464/base 2025-12-04T08:54:18.6448247Z * [new branch] gh/wconstab/464/head -> origin/gh/wconstab/464/head 2025-12-04T08:54:18.6448320Z * [new branch] gh/wconstab/464/orig -> origin/gh/wconstab/464/orig 2025-12-04T08:54:18.6448390Z * [new branch] gh/wconstab/465/base -> origin/gh/wconstab/465/base 2025-12-04T08:54:18.6448461Z * [new branch] gh/wconstab/465/head -> origin/gh/wconstab/465/head 2025-12-04T08:54:18.6448537Z * [new branch] gh/wconstab/465/orig -> origin/gh/wconstab/465/orig 2025-12-04T08:54:18.6448607Z * [new branch] gh/wconstab/466/base -> origin/gh/wconstab/466/base 2025-12-04T08:54:18.6448677Z * [new branch] gh/wconstab/466/head -> origin/gh/wconstab/466/head 2025-12-04T08:54:18.6448749Z * [new branch] gh/wconstab/466/orig -> origin/gh/wconstab/466/orig 2025-12-04T08:54:18.6448818Z * [new branch] gh/wconstab/467/base -> origin/gh/wconstab/467/base 2025-12-04T08:54:18.6448891Z * [new branch] gh/wconstab/467/head -> origin/gh/wconstab/467/head 2025-12-04T08:54:18.6448963Z * [new branch] gh/wconstab/467/orig -> origin/gh/wconstab/467/orig 2025-12-04T08:54:18.6449033Z * [new branch] gh/wconstab/468/base -> origin/gh/wconstab/468/base 2025-12-04T08:54:18.6449104Z * [new branch] gh/wconstab/468/head -> origin/gh/wconstab/468/head 2025-12-04T08:54:18.6449177Z * [new branch] gh/wconstab/468/orig -> origin/gh/wconstab/468/orig 2025-12-04T08:54:18.6449249Z * [new branch] gh/weifengpy/39/base -> origin/gh/weifengpy/39/base 2025-12-04T08:54:18.6449321Z * [new branch] gh/weifengpy/39/head -> origin/gh/weifengpy/39/head 2025-12-04T08:54:18.6449394Z * [new branch] gh/weifengpy/39/orig -> origin/gh/weifengpy/39/orig 2025-12-04T08:54:18.6449465Z * [new branch] gh/weifengpy/40/base -> origin/gh/weifengpy/40/base 2025-12-04T08:54:18.6449563Z * [new branch] gh/weifengpy/40/head -> origin/gh/weifengpy/40/head 2025-12-04T08:54:18.6449634Z * [new branch] gh/weifengpy/40/orig -> origin/gh/weifengpy/40/orig 2025-12-04T08:54:18.6449705Z * [new branch] gh/weifengpy/41/base -> origin/gh/weifengpy/41/base 2025-12-04T08:54:18.6449779Z * [new branch] gh/weifengpy/41/head -> origin/gh/weifengpy/41/head 2025-12-04T08:54:18.6449850Z * [new branch] gh/weifengpy/41/orig -> origin/gh/weifengpy/41/orig 2025-12-04T08:54:18.6449931Z * [new branch] gh/williamwen42/250/base -> origin/gh/williamwen42/250/base 2025-12-04T08:54:18.6450014Z * [new branch] gh/williamwen42/250/head -> origin/gh/williamwen42/250/head 2025-12-04T08:54:18.6450092Z * [new branch] gh/williamwen42/250/orig -> origin/gh/williamwen42/250/orig 2025-12-04T08:54:18.6450170Z * [new branch] gh/williamwen42/279/base -> origin/gh/williamwen42/279/base 2025-12-04T08:54:18.6450251Z * [new branch] gh/williamwen42/279/head -> origin/gh/williamwen42/279/head 2025-12-04T08:54:18.6450328Z * [new branch] gh/williamwen42/279/orig -> origin/gh/williamwen42/279/orig 2025-12-04T08:54:18.6450405Z * [new branch] gh/williamwen42/282/base -> origin/gh/williamwen42/282/base 2025-12-04T08:54:18.6450517Z * [new branch] gh/williamwen42/282/head -> origin/gh/williamwen42/282/head 2025-12-04T08:54:18.6450595Z * [new branch] gh/williamwen42/282/orig -> origin/gh/williamwen42/282/orig 2025-12-04T08:54:18.6450671Z * [new branch] gh/williamwen42/287/base -> origin/gh/williamwen42/287/base 2025-12-04T08:54:18.6450751Z * [new branch] gh/williamwen42/287/head -> origin/gh/williamwen42/287/head 2025-12-04T08:54:18.6450828Z * [new branch] gh/williamwen42/287/orig -> origin/gh/williamwen42/287/orig 2025-12-04T08:54:18.6450906Z * [new branch] gh/williamwen42/288/base -> origin/gh/williamwen42/288/base 2025-12-04T08:54:18.6450988Z * [new branch] gh/williamwen42/288/head -> origin/gh/williamwen42/288/head 2025-12-04T08:54:18.6451065Z * [new branch] gh/williamwen42/288/orig -> origin/gh/williamwen42/288/orig 2025-12-04T08:54:18.6451144Z * [new branch] gh/williamwen42/296/base -> origin/gh/williamwen42/296/base 2025-12-04T08:54:18.6451224Z * [new branch] gh/williamwen42/296/head -> origin/gh/williamwen42/296/head 2025-12-04T08:54:18.6451300Z * [new branch] gh/williamwen42/296/orig -> origin/gh/williamwen42/296/orig 2025-12-04T08:54:18.6451380Z * [new branch] gh/williamwen42/297/base -> origin/gh/williamwen42/297/base 2025-12-04T08:54:18.6451456Z * [new branch] gh/williamwen42/297/head -> origin/gh/williamwen42/297/head 2025-12-04T08:54:18.6451533Z * [new branch] gh/williamwen42/297/orig -> origin/gh/williamwen42/297/orig 2025-12-04T08:54:18.6451613Z * [new branch] gh/williamwen42/306/base -> origin/gh/williamwen42/306/base 2025-12-04T08:54:18.6451690Z * [new branch] gh/williamwen42/306/head -> origin/gh/williamwen42/306/head 2025-12-04T08:54:18.6451767Z * [new branch] gh/williamwen42/306/orig -> origin/gh/williamwen42/306/orig 2025-12-04T08:54:18.6451847Z * [new branch] gh/williamwen42/309/base -> origin/gh/williamwen42/309/base 2025-12-04T08:54:18.6451925Z * [new branch] gh/williamwen42/309/head -> origin/gh/williamwen42/309/head 2025-12-04T08:54:18.6452001Z * [new branch] gh/williamwen42/309/orig -> origin/gh/williamwen42/309/orig 2025-12-04T08:54:18.6452080Z * [new branch] gh/williamwen42/310/base -> origin/gh/williamwen42/310/base 2025-12-04T08:54:18.6452157Z * [new branch] gh/williamwen42/310/head -> origin/gh/williamwen42/310/head 2025-12-04T08:54:18.6452256Z * [new branch] gh/williamwen42/310/orig -> origin/gh/williamwen42/310/orig 2025-12-04T08:54:18.6452336Z * [new branch] gh/williamwen42/311/base -> origin/gh/williamwen42/311/base 2025-12-04T08:54:18.6452412Z * [new branch] gh/williamwen42/311/head -> origin/gh/williamwen42/311/head 2025-12-04T08:54:18.6452491Z * [new branch] gh/williamwen42/311/orig -> origin/gh/williamwen42/311/orig 2025-12-04T08:54:18.6452569Z * [new branch] gh/williamwen42/319/base -> origin/gh/williamwen42/319/base 2025-12-04T08:54:18.6452646Z * [new branch] gh/williamwen42/319/head -> origin/gh/williamwen42/319/head 2025-12-04T08:54:18.6452724Z * [new branch] gh/williamwen42/319/orig -> origin/gh/williamwen42/319/orig 2025-12-04T08:54:18.6452801Z * [new branch] gh/williamwen42/325/base -> origin/gh/williamwen42/325/base 2025-12-04T08:54:18.6452878Z * [new branch] gh/williamwen42/325/head -> origin/gh/williamwen42/325/head 2025-12-04T08:54:18.6452960Z * [new branch] gh/williamwen42/325/orig -> origin/gh/williamwen42/325/orig 2025-12-04T08:54:18.6453036Z * [new branch] gh/williamwen42/326/base -> origin/gh/williamwen42/326/base 2025-12-04T08:54:18.6453113Z * [new branch] gh/williamwen42/326/head -> origin/gh/williamwen42/326/head 2025-12-04T08:54:18.6453212Z * [new branch] gh/williamwen42/326/orig -> origin/gh/williamwen42/326/orig 2025-12-04T08:54:18.6453290Z * [new branch] gh/williamwen42/327/base -> origin/gh/williamwen42/327/base 2025-12-04T08:54:18.6453368Z * [new branch] gh/williamwen42/327/head -> origin/gh/williamwen42/327/head 2025-12-04T08:54:18.6453446Z * [new branch] gh/williamwen42/327/orig -> origin/gh/williamwen42/327/orig 2025-12-04T08:54:18.6453523Z * [new branch] gh/williamwen42/328/base -> origin/gh/williamwen42/328/base 2025-12-04T08:54:18.6453600Z * [new branch] gh/williamwen42/328/head -> origin/gh/williamwen42/328/head 2025-12-04T08:54:18.6453680Z * [new branch] gh/williamwen42/328/orig -> origin/gh/williamwen42/328/orig 2025-12-04T08:54:18.6453758Z * [new branch] gh/williamwen42/329/base -> origin/gh/williamwen42/329/base 2025-12-04T08:54:18.6453836Z * [new branch] gh/williamwen42/329/head -> origin/gh/williamwen42/329/head 2025-12-04T08:54:18.6453917Z * [new branch] gh/williamwen42/329/orig -> origin/gh/williamwen42/329/orig 2025-12-04T08:54:18.6453995Z * [new branch] gh/williamwen42/330/base -> origin/gh/williamwen42/330/base 2025-12-04T08:54:18.6454075Z * [new branch] gh/williamwen42/330/head -> origin/gh/williamwen42/330/head 2025-12-04T08:54:18.6454154Z * [new branch] gh/williamwen42/330/orig -> origin/gh/williamwen42/330/orig 2025-12-04T08:54:18.6454231Z * [new branch] gh/williamwen42/331/base -> origin/gh/williamwen42/331/base 2025-12-04T08:54:18.6454312Z * [new branch] gh/williamwen42/331/head -> origin/gh/williamwen42/331/head 2025-12-04T08:54:18.6454389Z * [new branch] gh/williamwen42/331/orig -> origin/gh/williamwen42/331/orig 2025-12-04T08:54:18.6454466Z * [new branch] gh/williamwen42/332/base -> origin/gh/williamwen42/332/base 2025-12-04T08:54:18.6454545Z * [new branch] gh/williamwen42/332/head -> origin/gh/williamwen42/332/head 2025-12-04T08:54:18.6454621Z * [new branch] gh/williamwen42/332/orig -> origin/gh/williamwen42/332/orig 2025-12-04T08:54:18.6454698Z * [new branch] gh/williamwen42/333/base -> origin/gh/williamwen42/333/base 2025-12-04T08:54:18.6454777Z * [new branch] gh/williamwen42/333/head -> origin/gh/williamwen42/333/head 2025-12-04T08:54:18.6454854Z * [new branch] gh/williamwen42/333/orig -> origin/gh/williamwen42/333/orig 2025-12-04T08:54:18.6454930Z * [new branch] gh/williamwen42/334/base -> origin/gh/williamwen42/334/base 2025-12-04T08:54:18.6455033Z * [new branch] gh/williamwen42/334/head -> origin/gh/williamwen42/334/head 2025-12-04T08:54:18.6455110Z * [new branch] gh/williamwen42/334/orig -> origin/gh/williamwen42/334/orig 2025-12-04T08:54:18.6455187Z * [new branch] gh/williamwen42/335/base -> origin/gh/williamwen42/335/base 2025-12-04T08:54:18.6455268Z * [new branch] gh/williamwen42/335/head -> origin/gh/williamwen42/335/head 2025-12-04T08:54:18.6455345Z * [new branch] gh/williamwen42/335/orig -> origin/gh/williamwen42/335/orig 2025-12-04T08:54:18.6455423Z * [new branch] gh/williamwen42/336/base -> origin/gh/williamwen42/336/base 2025-12-04T08:54:18.6455503Z * [new branch] gh/williamwen42/336/head -> origin/gh/williamwen42/336/head 2025-12-04T08:54:18.6455580Z * [new branch] gh/williamwen42/336/orig -> origin/gh/williamwen42/336/orig 2025-12-04T08:54:18.6455662Z * [new branch] gh/williamwen42/337/base -> origin/gh/williamwen42/337/base 2025-12-04T08:54:18.6455741Z * [new branch] gh/williamwen42/337/head -> origin/gh/williamwen42/337/head 2025-12-04T08:54:18.6455819Z * [new branch] gh/williamwen42/337/orig -> origin/gh/williamwen42/337/orig 2025-12-04T08:54:18.6455917Z * [new branch] gh/williamwen42/338/base -> origin/gh/williamwen42/338/base 2025-12-04T08:54:18.6455995Z * [new branch] gh/williamwen42/338/head -> origin/gh/williamwen42/338/head 2025-12-04T08:54:18.6456073Z * [new branch] gh/williamwen42/338/orig -> origin/gh/williamwen42/338/orig 2025-12-04T08:54:18.6456152Z * [new branch] gh/williamwen42/339/base -> origin/gh/williamwen42/339/base 2025-12-04T08:54:18.6456229Z * [new branch] gh/williamwen42/339/head -> origin/gh/williamwen42/339/head 2025-12-04T08:54:18.6456305Z * [new branch] gh/williamwen42/339/orig -> origin/gh/williamwen42/339/orig 2025-12-04T08:54:18.6456385Z * [new branch] gh/williamwen42/340/base -> origin/gh/williamwen42/340/base 2025-12-04T08:54:18.6456463Z * [new branch] gh/williamwen42/340/head -> origin/gh/williamwen42/340/head 2025-12-04T08:54:18.6456540Z * [new branch] gh/williamwen42/340/orig -> origin/gh/williamwen42/340/orig 2025-12-04T08:54:18.6456621Z * [new branch] gh/williamwen42/341/base -> origin/gh/williamwen42/341/base 2025-12-04T08:54:18.6456698Z * [new branch] gh/williamwen42/341/head -> origin/gh/williamwen42/341/head 2025-12-04T08:54:18.6456775Z * [new branch] gh/williamwen42/341/orig -> origin/gh/williamwen42/341/orig 2025-12-04T08:54:18.6456854Z * [new branch] gh/williamwen42/342/base -> origin/gh/williamwen42/342/base 2025-12-04T08:54:18.6456931Z * [new branch] gh/williamwen42/342/head -> origin/gh/williamwen42/342/head 2025-12-04T08:54:18.6457012Z * [new branch] gh/williamwen42/342/orig -> origin/gh/williamwen42/342/orig 2025-12-04T08:54:18.6457089Z * [new branch] gh/williamwen42/343/base -> origin/gh/williamwen42/343/base 2025-12-04T08:54:18.6457166Z * [new branch] gh/williamwen42/343/head -> origin/gh/williamwen42/343/head 2025-12-04T08:54:18.6457247Z * [new branch] gh/williamwen42/343/orig -> origin/gh/williamwen42/343/orig 2025-12-04T08:54:18.6457326Z * [new branch] gh/williamwen42/344/base -> origin/gh/williamwen42/344/base 2025-12-04T08:54:18.6457402Z * [new branch] gh/williamwen42/344/head -> origin/gh/williamwen42/344/head 2025-12-04T08:54:18.6457481Z * [new branch] gh/williamwen42/344/orig -> origin/gh/williamwen42/344/orig 2025-12-04T08:54:18.6457557Z * [new branch] gh/williamwen42/345/base -> origin/gh/williamwen42/345/base 2025-12-04T08:54:18.6457636Z * [new branch] gh/williamwen42/345/head -> origin/gh/williamwen42/345/head 2025-12-04T08:54:18.6457746Z * [new branch] gh/williamwen42/345/orig -> origin/gh/williamwen42/345/orig 2025-12-04T08:54:18.6457823Z * [new branch] gh/williamwen42/346/base -> origin/gh/williamwen42/346/base 2025-12-04T08:54:18.6457900Z * [new branch] gh/williamwen42/346/head -> origin/gh/williamwen42/346/head 2025-12-04T08:54:18.6457981Z * [new branch] gh/williamwen42/346/orig -> origin/gh/williamwen42/346/orig 2025-12-04T08:54:18.6458058Z * [new branch] gh/williamwen42/347/base -> origin/gh/williamwen42/347/base 2025-12-04T08:54:18.6458321Z * [new branch] gh/williamwen42/347/head -> origin/gh/williamwen42/347/head 2025-12-04T08:54:18.6458402Z * [new branch] gh/williamwen42/347/orig -> origin/gh/williamwen42/347/orig 2025-12-04T08:54:18.6458480Z * [new branch] gh/williamwen42/348/base -> origin/gh/williamwen42/348/base 2025-12-04T08:54:18.6458559Z * [new branch] gh/williamwen42/348/head -> origin/gh/williamwen42/348/head 2025-12-04T08:54:18.6458637Z * [new branch] gh/williamwen42/348/orig -> origin/gh/williamwen42/348/orig 2025-12-04T08:54:18.6458715Z * [new branch] gh/williamwen42/349/base -> origin/gh/williamwen42/349/base 2025-12-04T08:54:18.6458825Z * [new branch] gh/williamwen42/349/head -> origin/gh/williamwen42/349/head 2025-12-04T08:54:18.6458903Z * [new branch] gh/williamwen42/349/orig -> origin/gh/williamwen42/349/orig 2025-12-04T08:54:18.6458981Z * [new branch] gh/williamwen42/350/base -> origin/gh/williamwen42/350/base 2025-12-04T08:54:18.6459060Z * [new branch] gh/williamwen42/350/head -> origin/gh/williamwen42/350/head 2025-12-04T08:54:18.6459138Z * [new branch] gh/williamwen42/350/orig -> origin/gh/williamwen42/350/orig 2025-12-04T08:54:18.6459214Z * [new branch] gh/williamwen42/351/base -> origin/gh/williamwen42/351/base 2025-12-04T08:54:18.6459294Z * [new branch] gh/williamwen42/351/head -> origin/gh/williamwen42/351/head 2025-12-04T08:54:18.6459371Z * [new branch] gh/williamwen42/351/orig -> origin/gh/williamwen42/351/orig 2025-12-04T08:54:18.6459448Z * [new branch] gh/williamwen42/352/base -> origin/gh/williamwen42/352/base 2025-12-04T08:54:18.6459528Z * [new branch] gh/williamwen42/352/head -> origin/gh/williamwen42/352/head 2025-12-04T08:54:18.6459606Z * [new branch] gh/williamwen42/352/orig -> origin/gh/williamwen42/352/orig 2025-12-04T08:54:18.6459682Z * [new branch] gh/williamwen42/353/base -> origin/gh/williamwen42/353/base 2025-12-04T08:54:18.6459760Z * [new branch] gh/williamwen42/353/head -> origin/gh/williamwen42/353/head 2025-12-04T08:54:18.6459837Z * [new branch] gh/williamwen42/353/orig -> origin/gh/williamwen42/353/orig 2025-12-04T08:54:18.6459915Z * [new branch] gh/williamwen42/354/base -> origin/gh/williamwen42/354/base 2025-12-04T08:54:18.6459994Z * [new branch] gh/williamwen42/354/head -> origin/gh/williamwen42/354/head 2025-12-04T08:54:18.6460070Z * [new branch] gh/williamwen42/354/orig -> origin/gh/williamwen42/354/orig 2025-12-04T08:54:18.6460151Z * [new branch] gh/williamwen42/355/base -> origin/gh/williamwen42/355/base 2025-12-04T08:54:18.6460228Z * [new branch] gh/williamwen42/355/head -> origin/gh/williamwen42/355/head 2025-12-04T08:54:18.6460305Z * [new branch] gh/williamwen42/355/orig -> origin/gh/williamwen42/355/orig 2025-12-04T08:54:18.6460385Z * [new branch] gh/williamwen42/356/base -> origin/gh/williamwen42/356/base 2025-12-04T08:54:18.6460462Z * [new branch] gh/williamwen42/356/head -> origin/gh/williamwen42/356/head 2025-12-04T08:54:18.6460538Z * [new branch] gh/williamwen42/356/orig -> origin/gh/williamwen42/356/orig 2025-12-04T08:54:18.6460645Z * [new branch] gh/williamwen42/357/base -> origin/gh/williamwen42/357/base 2025-12-04T08:54:18.6460722Z * [new branch] gh/williamwen42/357/head -> origin/gh/williamwen42/357/head 2025-12-04T08:54:18.6460798Z * [new branch] gh/williamwen42/357/orig -> origin/gh/williamwen42/357/orig 2025-12-04T08:54:18.6460878Z * [new branch] gh/williamwen42/358/base -> origin/gh/williamwen42/358/base 2025-12-04T08:54:18.6460955Z * [new branch] gh/williamwen42/358/head -> origin/gh/williamwen42/358/head 2025-12-04T08:54:18.6461032Z * [new branch] gh/williamwen42/358/orig -> origin/gh/williamwen42/358/orig 2025-12-04T08:54:18.6461104Z * [new branch] gh/xmfan/169/base -> origin/gh/xmfan/169/base 2025-12-04T08:54:18.6461173Z * [new branch] gh/xmfan/169/head -> origin/gh/xmfan/169/head 2025-12-04T08:54:18.6461240Z * [new branch] gh/xmfan/170/base -> origin/gh/xmfan/170/base 2025-12-04T08:54:18.6461310Z * [new branch] gh/xmfan/170/head -> origin/gh/xmfan/170/head 2025-12-04T08:54:18.6461376Z * [new branch] gh/xmfan/274/base -> origin/gh/xmfan/274/base 2025-12-04T08:54:18.6461442Z * [new branch] gh/xmfan/274/head -> origin/gh/xmfan/274/head 2025-12-04T08:54:18.6461534Z * [new branch] gh/xmfan/274/orig -> origin/gh/xmfan/274/orig 2025-12-04T08:54:18.6461601Z * [new branch] gh/xmfan/277/base -> origin/gh/xmfan/277/base 2025-12-04T08:54:18.6461671Z * [new branch] gh/xmfan/277/head -> origin/gh/xmfan/277/head 2025-12-04T08:54:18.6461737Z * [new branch] gh/xmfan/277/orig -> origin/gh/xmfan/277/orig 2025-12-04T08:54:18.6461802Z * [new branch] gh/xmfan/301/base -> origin/gh/xmfan/301/base 2025-12-04T08:54:18.6461871Z * [new branch] gh/xmfan/301/head -> origin/gh/xmfan/301/head 2025-12-04T08:54:18.6461940Z * [new branch] gh/xmfan/301/orig -> origin/gh/xmfan/301/orig 2025-12-04T08:54:18.6462005Z * [new branch] gh/xmfan/304/base -> origin/gh/xmfan/304/base 2025-12-04T08:54:18.6462073Z * [new branch] gh/xmfan/304/head -> origin/gh/xmfan/304/head 2025-12-04T08:54:18.6462142Z * [new branch] gh/xmfan/304/orig -> origin/gh/xmfan/304/orig 2025-12-04T08:54:18.6462208Z * [new branch] gh/xmfan/309/base -> origin/gh/xmfan/309/base 2025-12-04T08:54:18.6462277Z * [new branch] gh/xmfan/309/head -> origin/gh/xmfan/309/head 2025-12-04T08:54:18.6462343Z * [new branch] gh/xmfan/309/orig -> origin/gh/xmfan/309/orig 2025-12-04T08:54:18.6462409Z * [new branch] gh/xmfan/310/base -> origin/gh/xmfan/310/base 2025-12-04T08:54:18.6462477Z * [new branch] gh/xmfan/310/head -> origin/gh/xmfan/310/head 2025-12-04T08:54:18.6462546Z * [new branch] gh/xmfan/310/orig -> origin/gh/xmfan/310/orig 2025-12-04T08:54:18.6462613Z * [new branch] gh/xmfan/311/base -> origin/gh/xmfan/311/base 2025-12-04T08:54:18.6462681Z * [new branch] gh/xmfan/311/head -> origin/gh/xmfan/311/head 2025-12-04T08:54:18.6462749Z * [new branch] gh/xmfan/311/orig -> origin/gh/xmfan/311/orig 2025-12-04T08:54:18.6462814Z * [new branch] gh/xmfan/312/base -> origin/gh/xmfan/312/base 2025-12-04T08:54:18.6462883Z * [new branch] gh/xmfan/312/head -> origin/gh/xmfan/312/head 2025-12-04T08:54:18.6462949Z * [new branch] gh/xmfan/312/orig -> origin/gh/xmfan/312/orig 2025-12-04T08:54:18.6463014Z * [new branch] gh/xmfan/313/base -> origin/gh/xmfan/313/base 2025-12-04T08:54:18.6463084Z * [new branch] gh/xmfan/313/head -> origin/gh/xmfan/313/head 2025-12-04T08:54:18.6463169Z * [new branch] gh/xmfan/313/orig -> origin/gh/xmfan/313/orig 2025-12-04T08:54:18.6463247Z * [new branch] gh/xuanzhang816/27/base -> origin/gh/xuanzhang816/27/base 2025-12-04T08:54:18.6463326Z * [new branch] gh/xuanzhang816/27/head -> origin/gh/xuanzhang816/27/head 2025-12-04T08:54:18.6463402Z * [new branch] gh/xuanzhang816/27/orig -> origin/gh/xuanzhang816/27/orig 2025-12-04T08:54:18.6463479Z * [new branch] gh/xuanzhang816/32/base -> origin/gh/xuanzhang816/32/base 2025-12-04T08:54:18.6463554Z * [new branch] gh/xuanzhang816/32/head -> origin/gh/xuanzhang816/32/head 2025-12-04T08:54:18.6463630Z * [new branch] gh/xuanzhang816/32/orig -> origin/gh/xuanzhang816/32/orig 2025-12-04T08:54:18.6463707Z * [new branch] gh/xuanzhang816/33/base -> origin/gh/xuanzhang816/33/base 2025-12-04T08:54:18.6463782Z * [new branch] gh/xuanzhang816/33/head -> origin/gh/xuanzhang816/33/head 2025-12-04T08:54:18.6463858Z * [new branch] gh/xuanzhang816/33/orig -> origin/gh/xuanzhang816/33/orig 2025-12-04T08:54:18.6463934Z * [new branch] gh/xuanzhang816/34/base -> origin/gh/xuanzhang816/34/base 2025-12-04T08:54:18.6464008Z * [new branch] gh/xuanzhang816/34/head -> origin/gh/xuanzhang816/34/head 2025-12-04T08:54:18.6464104Z * [new branch] gh/xuanzhang816/34/orig -> origin/gh/xuanzhang816/34/orig 2025-12-04T08:54:18.6464181Z * [new branch] gh/xuanzhang816/35/base -> origin/gh/xuanzhang816/35/base 2025-12-04T08:54:18.6464255Z * [new branch] gh/xuanzhang816/35/head -> origin/gh/xuanzhang816/35/head 2025-12-04T08:54:18.6464329Z * [new branch] gh/xuanzhang816/35/orig -> origin/gh/xuanzhang816/35/orig 2025-12-04T08:54:18.6464404Z * [new branch] gh/yanbing-j/11/base -> origin/gh/yanbing-j/11/base 2025-12-04T08:54:18.6464478Z * [new branch] gh/yanbing-j/11/head -> origin/gh/yanbing-j/11/head 2025-12-04T08:54:18.6464549Z * [new branch] gh/yanbing-j/11/orig -> origin/gh/yanbing-j/11/orig 2025-12-04T08:54:18.6464622Z * [new branch] gh/yanbing-j/12/base -> origin/gh/yanbing-j/12/base 2025-12-04T08:54:18.6464691Z * [new branch] gh/yanbing-j/12/head -> origin/gh/yanbing-j/12/head 2025-12-04T08:54:18.6464762Z * [new branch] gh/yanbing-j/12/orig -> origin/gh/yanbing-j/12/orig 2025-12-04T08:54:18.6464835Z * [new branch] gh/yanbing-j/13/base -> origin/gh/yanbing-j/13/base 2025-12-04T08:54:18.6464905Z * [new branch] gh/yanbing-j/13/head -> origin/gh/yanbing-j/13/head 2025-12-04T08:54:18.6464977Z * [new branch] gh/yanbing-j/13/orig -> origin/gh/yanbing-j/13/orig 2025-12-04T08:54:18.6465047Z * [new branch] gh/yanbing-j/14/base -> origin/gh/yanbing-j/14/base 2025-12-04T08:54:18.6465117Z * [new branch] gh/yanbing-j/14/head -> origin/gh/yanbing-j/14/head 2025-12-04T08:54:18.6465188Z * [new branch] gh/yanbing-j/14/orig -> origin/gh/yanbing-j/14/orig 2025-12-04T08:54:18.6465258Z * [new branch] gh/yanbing-j/15/base -> origin/gh/yanbing-j/15/base 2025-12-04T08:54:18.6465328Z * [new branch] gh/yanbing-j/15/head -> origin/gh/yanbing-j/15/head 2025-12-04T08:54:18.6465401Z * [new branch] gh/yanbing-j/15/orig -> origin/gh/yanbing-j/15/orig 2025-12-04T08:54:18.6465472Z * [new branch] gh/yanbing-j/18/base -> origin/gh/yanbing-j/18/base 2025-12-04T08:54:18.6465542Z * [new branch] gh/yanbing-j/18/head -> origin/gh/yanbing-j/18/head 2025-12-04T08:54:18.6465612Z * [new branch] gh/yanbing-j/18/orig -> origin/gh/yanbing-j/18/orig 2025-12-04T08:54:18.6465681Z * [new branch] gh/yanbing-j/19/base -> origin/gh/yanbing-j/19/base 2025-12-04T08:54:18.6465799Z * [new branch] gh/yanbing-j/19/head -> origin/gh/yanbing-j/19/head 2025-12-04T08:54:18.6465871Z * [new branch] gh/yanbing-j/19/orig -> origin/gh/yanbing-j/19/orig 2025-12-04T08:54:18.6465940Z * [new branch] gh/yanbing-j/20/base -> origin/gh/yanbing-j/20/base 2025-12-04T08:54:18.6466010Z * [new branch] gh/yanbing-j/20/head -> origin/gh/yanbing-j/20/head 2025-12-04T08:54:18.6466083Z * [new branch] gh/yanbing-j/20/orig -> origin/gh/yanbing-j/20/orig 2025-12-04T08:54:18.6466152Z * [new branch] gh/yanbing-j/21/base -> origin/gh/yanbing-j/21/base 2025-12-04T08:54:18.6466223Z * [new branch] gh/yanbing-j/21/head -> origin/gh/yanbing-j/21/head 2025-12-04T08:54:18.6466295Z * [new branch] gh/yanbing-j/22/base -> origin/gh/yanbing-j/22/base 2025-12-04T08:54:18.6466366Z * [new branch] gh/yanbing-j/22/head -> origin/gh/yanbing-j/22/head 2025-12-04T08:54:18.6466440Z * [new branch] gh/yanbing-j/22/orig -> origin/gh/yanbing-j/22/orig 2025-12-04T08:54:18.6466509Z * [new branch] gh/yanbing-j/23/base -> origin/gh/yanbing-j/23/base 2025-12-04T08:54:18.6466578Z * [new branch] gh/yanbing-j/23/head -> origin/gh/yanbing-j/23/head 2025-12-04T08:54:18.6466677Z * [new branch] gh/yanbing-j/23/orig -> origin/gh/yanbing-j/23/orig 2025-12-04T08:54:18.6466747Z * [new branch] gh/yanbing-j/24/base -> origin/gh/yanbing-j/24/base 2025-12-04T08:54:18.6466817Z * [new branch] gh/yanbing-j/24/head -> origin/gh/yanbing-j/24/head 2025-12-04T08:54:18.6466888Z * [new branch] gh/yanbing-j/24/orig -> origin/gh/yanbing-j/24/orig 2025-12-04T08:54:18.6466958Z * [new branch] gh/yanbing-j/25/base -> origin/gh/yanbing-j/25/base 2025-12-04T08:54:18.6467027Z * [new branch] gh/yanbing-j/25/head -> origin/gh/yanbing-j/25/head 2025-12-04T08:54:18.6467100Z * [new branch] gh/yanbing-j/25/orig -> origin/gh/yanbing-j/25/orig 2025-12-04T08:54:18.6467169Z * [new branch] gh/yanbing-j/26/base -> origin/gh/yanbing-j/26/base 2025-12-04T08:54:18.6467239Z * [new branch] gh/yanbing-j/26/head -> origin/gh/yanbing-j/26/head 2025-12-04T08:54:18.6467311Z * [new branch] gh/yanbing-j/26/orig -> origin/gh/yanbing-j/26/orig 2025-12-04T08:54:18.6467390Z * [new branch] gh/yang-yu-hang/1/base -> origin/gh/yang-yu-hang/1/base 2025-12-04T08:54:18.6467466Z * [new branch] gh/yang-yu-hang/1/head -> origin/gh/yang-yu-hang/1/head 2025-12-04T08:54:18.6467543Z * [new branch] gh/yang-yu-hang/1/orig -> origin/gh/yang-yu-hang/1/orig 2025-12-04T08:54:18.6467617Z * [new branch] gh/yang-yu-hang/2/base -> origin/gh/yang-yu-hang/2/base 2025-12-04T08:54:18.6467690Z * [new branch] gh/yang-yu-hang/2/head -> origin/gh/yang-yu-hang/2/head 2025-12-04T08:54:18.6467767Z * [new branch] gh/yang-yu-hang/2/orig -> origin/gh/yang-yu-hang/2/orig 2025-12-04T08:54:18.6467839Z * [new branch] gh/yang-yu-hang/3/base -> origin/gh/yang-yu-hang/3/base 2025-12-04T08:54:18.6467912Z * [new branch] gh/yang-yu-hang/3/head -> origin/gh/yang-yu-hang/3/head 2025-12-04T08:54:18.6467986Z * [new branch] gh/yang-yu-hang/3/orig -> origin/gh/yang-yu-hang/3/orig 2025-12-04T08:54:18.6468059Z * [new branch] gh/yangw-dev/12/base -> origin/gh/yangw-dev/12/base 2025-12-04T08:54:18.6468194Z * [new branch] gh/yangw-dev/12/head -> origin/gh/yangw-dev/12/head 2025-12-04T08:54:18.6468268Z * [new branch] gh/yangw-dev/12/orig -> origin/gh/yangw-dev/12/orig 2025-12-04T08:54:18.6468339Z * [new branch] gh/yangw-dev/13/base -> origin/gh/yangw-dev/13/base 2025-12-04T08:54:18.6468410Z * [new branch] gh/yangw-dev/13/head -> origin/gh/yangw-dev/13/head 2025-12-04T08:54:18.6468509Z * [new branch] gh/yangw-dev/13/orig -> origin/gh/yangw-dev/13/orig 2025-12-04T08:54:18.6468579Z * [new branch] gh/yangw-dev/14/base -> origin/gh/yangw-dev/14/base 2025-12-04T08:54:18.6468650Z * [new branch] gh/yangw-dev/14/head -> origin/gh/yangw-dev/14/head 2025-12-04T08:54:18.6468724Z * [new branch] gh/yangw-dev/14/orig -> origin/gh/yangw-dev/14/orig 2025-12-04T08:54:18.6468793Z * [new branch] gh/yangw-dev/15/base -> origin/gh/yangw-dev/15/base 2025-12-04T08:54:18.6468865Z * [new branch] gh/yangw-dev/15/head -> origin/gh/yangw-dev/15/head 2025-12-04T08:54:18.6468935Z * [new branch] gh/yangw-dev/15/orig -> origin/gh/yangw-dev/15/orig 2025-12-04T08:54:18.6469005Z * [new branch] gh/yangw-dev/19/base -> origin/gh/yangw-dev/19/base 2025-12-04T08:54:18.6469077Z * [new branch] gh/yangw-dev/19/head -> origin/gh/yangw-dev/19/head 2025-12-04T08:54:18.6469148Z * [new branch] gh/yangw-dev/19/orig -> origin/gh/yangw-dev/19/orig 2025-12-04T08:54:18.6469216Z * [new branch] gh/yangw-dev/26/base -> origin/gh/yangw-dev/26/base 2025-12-04T08:54:18.6469288Z * [new branch] gh/yangw-dev/26/head -> origin/gh/yangw-dev/26/head 2025-12-04T08:54:18.6469401Z * [new branch] gh/yangw-dev/26/orig -> origin/gh/yangw-dev/26/orig 2025-12-04T08:54:18.6469472Z * [new branch] gh/yangw-dev/27/base -> origin/gh/yangw-dev/27/base 2025-12-04T08:54:18.6469544Z * [new branch] gh/yangw-dev/27/head -> origin/gh/yangw-dev/27/head 2025-12-04T08:54:18.6469614Z * [new branch] gh/yangw-dev/27/orig -> origin/gh/yangw-dev/27/orig 2025-12-04T08:54:18.6469684Z * [new branch] gh/ydwu4/292/base -> origin/gh/ydwu4/292/base 2025-12-04T08:54:18.6469755Z * [new branch] gh/ydwu4/292/head -> origin/gh/ydwu4/292/head 2025-12-04T08:54:18.6469823Z * [new branch] gh/ydwu4/292/orig -> origin/gh/ydwu4/292/orig 2025-12-04T08:54:18.6469892Z * [new branch] gh/ydwu4/294/base -> origin/gh/ydwu4/294/base 2025-12-04T08:54:18.6469957Z * [new branch] gh/ydwu4/294/head -> origin/gh/ydwu4/294/head 2025-12-04T08:54:18.6470024Z * [new branch] gh/ydwu4/294/orig -> origin/gh/ydwu4/294/orig 2025-12-04T08:54:18.6470091Z * [new branch] gh/ydwu4/295/base -> origin/gh/ydwu4/295/base 2025-12-04T08:54:18.6470156Z * [new branch] gh/ydwu4/295/head -> origin/gh/ydwu4/295/head 2025-12-04T08:54:18.6470222Z * [new branch] gh/ydwu4/295/orig -> origin/gh/ydwu4/295/orig 2025-12-04T08:54:18.6470290Z * [new branch] gh/ydwu4/296/base -> origin/gh/ydwu4/296/base 2025-12-04T08:54:18.6470355Z * [new branch] gh/ydwu4/296/head -> origin/gh/ydwu4/296/head 2025-12-04T08:54:18.6470424Z * [new branch] gh/ydwu4/296/orig -> origin/gh/ydwu4/296/orig 2025-12-04T08:54:18.6470492Z * [new branch] gh/ydwu4/306/base -> origin/gh/ydwu4/306/base 2025-12-04T08:54:18.6470558Z * [new branch] gh/ydwu4/306/head -> origin/gh/ydwu4/306/head 2025-12-04T08:54:18.6470624Z * [new branch] gh/ydwu4/306/orig -> origin/gh/ydwu4/306/orig 2025-12-04T08:54:18.6470693Z * [new branch] gh/ydwu4/312/base -> origin/gh/ydwu4/312/base 2025-12-04T08:54:18.6470758Z * [new branch] gh/ydwu4/312/head -> origin/gh/ydwu4/312/head 2025-12-04T08:54:18.6470823Z * [new branch] gh/ydwu4/312/orig -> origin/gh/ydwu4/312/orig 2025-12-04T08:54:18.6470891Z * [new branch] gh/ydwu4/322/base -> origin/gh/ydwu4/322/base 2025-12-04T08:54:18.6470956Z * [new branch] gh/ydwu4/322/head -> origin/gh/ydwu4/322/head 2025-12-04T08:54:18.6471048Z * [new branch] gh/ydwu4/322/orig -> origin/gh/ydwu4/322/orig 2025-12-04T08:54:18.6471115Z * [new branch] gh/ydwu4/327/base -> origin/gh/ydwu4/327/base 2025-12-04T08:54:18.6471181Z * [new branch] gh/ydwu4/327/head -> origin/gh/ydwu4/327/head 2025-12-04T08:54:18.6471250Z * [new branch] gh/ydwu4/327/orig -> origin/gh/ydwu4/327/orig 2025-12-04T08:54:18.6471317Z * [new branch] gh/ydwu4/328/base -> origin/gh/ydwu4/328/base 2025-12-04T08:54:18.6471383Z * [new branch] gh/ydwu4/328/head -> origin/gh/ydwu4/328/head 2025-12-04T08:54:18.6471451Z * [new branch] gh/ydwu4/328/orig -> origin/gh/ydwu4/328/orig 2025-12-04T08:54:18.6471517Z * [new branch] gh/ydwu4/329/base -> origin/gh/ydwu4/329/base 2025-12-04T08:54:18.6471582Z * [new branch] gh/ydwu4/329/head -> origin/gh/ydwu4/329/head 2025-12-04T08:54:18.6471651Z * [new branch] gh/ydwu4/329/orig -> origin/gh/ydwu4/329/orig 2025-12-04T08:54:18.6471717Z * [new branch] gh/ydwu4/330/base -> origin/gh/ydwu4/330/base 2025-12-04T08:54:18.6471782Z * [new branch] gh/ydwu4/330/head -> origin/gh/ydwu4/330/head 2025-12-04T08:54:18.6471872Z * [new branch] gh/ydwu4/330/orig -> origin/gh/ydwu4/330/orig 2025-12-04T08:54:18.6471938Z * [new branch] gh/ydwu4/331/base -> origin/gh/ydwu4/331/base 2025-12-04T08:54:18.6472003Z * [new branch] gh/ydwu4/331/head -> origin/gh/ydwu4/331/head 2025-12-04T08:54:18.6472072Z * [new branch] gh/ydwu4/331/orig -> origin/gh/ydwu4/331/orig 2025-12-04T08:54:18.6473940Z * [new branch] gh/ydwu4/332/base -> origin/gh/ydwu4/332/base 2025-12-04T08:54:18.6474014Z * [new branch] gh/ydwu4/332/head -> origin/gh/ydwu4/332/head 2025-12-04T08:54:18.6474088Z * [new branch] gh/ydwu4/332/orig -> origin/gh/ydwu4/332/orig 2025-12-04T08:54:18.6474156Z * [new branch] gh/ydwu4/333/base -> origin/gh/ydwu4/333/base 2025-12-04T08:54:18.6474220Z * [new branch] gh/ydwu4/333/head -> origin/gh/ydwu4/333/head 2025-12-04T08:54:18.6474288Z * [new branch] gh/ydwu4/333/orig -> origin/gh/ydwu4/333/orig 2025-12-04T08:54:18.6474356Z * [new branch] gh/ydwu4/334/base -> origin/gh/ydwu4/334/base 2025-12-04T08:54:18.6474423Z * [new branch] gh/ydwu4/334/head -> origin/gh/ydwu4/334/head 2025-12-04T08:54:18.6474494Z * [new branch] gh/ydwu4/334/orig -> origin/gh/ydwu4/334/orig 2025-12-04T08:54:18.6474559Z * [new branch] gh/ydwu4/335/base -> origin/gh/ydwu4/335/base 2025-12-04T08:54:18.6474625Z * [new branch] gh/ydwu4/335/head -> origin/gh/ydwu4/335/head 2025-12-04T08:54:18.6474695Z * [new branch] gh/ydwu4/335/orig -> origin/gh/ydwu4/335/orig 2025-12-04T08:54:18.6474761Z * [new branch] gh/ydwu4/337/base -> origin/gh/ydwu4/337/base 2025-12-04T08:54:18.6474829Z * [new branch] gh/ydwu4/337/head -> origin/gh/ydwu4/337/head 2025-12-04T08:54:18.6474894Z * [new branch] gh/ydwu4/337/orig -> origin/gh/ydwu4/337/orig 2025-12-04T08:54:18.6474962Z * [new branch] gh/ydwu4/339/base -> origin/gh/ydwu4/339/base 2025-12-04T08:54:18.6475031Z * [new branch] gh/ydwu4/339/head -> origin/gh/ydwu4/339/head 2025-12-04T08:54:18.6475096Z * [new branch] gh/ydwu4/339/orig -> origin/gh/ydwu4/339/orig 2025-12-04T08:54:18.6475161Z * [new branch] gh/yf225/133/base -> origin/gh/yf225/133/base 2025-12-04T08:54:18.6475227Z * [new branch] gh/yf225/133/head -> origin/gh/yf225/133/head 2025-12-04T08:54:18.6475328Z * [new branch] gh/yf225/93/base -> origin/gh/yf225/93/base 2025-12-04T08:54:18.6475395Z * [new branch] gh/yf225/93/head -> origin/gh/yf225/93/head 2025-12-04T08:54:18.6475471Z * [new branch] gh/yifuwang/152/base -> origin/gh/yifuwang/152/base 2025-12-04T08:54:18.6475547Z * [new branch] gh/yifuwang/152/head -> origin/gh/yifuwang/152/head 2025-12-04T08:54:18.6475623Z * [new branch] gh/yifuwang/152/orig -> origin/gh/yifuwang/152/orig 2025-12-04T08:54:18.6475702Z * [new branch] gh/yifuwang/195/base -> origin/gh/yifuwang/195/base 2025-12-04T08:54:18.6475774Z * [new branch] gh/yifuwang/195/head -> origin/gh/yifuwang/195/head 2025-12-04T08:54:18.6475848Z * [new branch] gh/yifuwang/195/orig -> origin/gh/yifuwang/195/orig 2025-12-04T08:54:18.6475923Z * [new branch] gh/yiming0416/1/base -> origin/gh/yiming0416/1/base 2025-12-04T08:54:18.6475996Z * [new branch] gh/yiming0416/1/head -> origin/gh/yiming0416/1/head 2025-12-04T08:54:18.6476067Z * [new branch] gh/yiming0416/2/base -> origin/gh/yiming0416/2/base 2025-12-04T08:54:18.6476140Z * [new branch] gh/yiming0416/2/head -> origin/gh/yiming0416/2/head 2025-12-04T08:54:18.6476213Z * [new branch] gh/yushangdi/1/base -> origin/gh/yushangdi/1/base 2025-12-04T08:54:18.6476307Z * [new branch] gh/yushangdi/1/head -> origin/gh/yushangdi/1/head 2025-12-04T08:54:18.6476383Z * [new branch] gh/yushangdi/10/base -> origin/gh/yushangdi/10/base 2025-12-04T08:54:18.6476455Z * [new branch] gh/yushangdi/10/head -> origin/gh/yushangdi/10/head 2025-12-04T08:54:18.6476529Z * [new branch] gh/yushangdi/10/orig -> origin/gh/yushangdi/10/orig 2025-12-04T08:54:18.6476599Z * [new branch] gh/yushangdi/11/base -> origin/gh/yushangdi/11/base 2025-12-04T08:54:18.6476670Z * [new branch] gh/yushangdi/11/head -> origin/gh/yushangdi/11/head 2025-12-04T08:54:18.6476740Z * [new branch] gh/yushangdi/11/orig -> origin/gh/yushangdi/11/orig 2025-12-04T08:54:18.6476813Z * [new branch] gh/yushangdi/2/base -> origin/gh/yushangdi/2/base 2025-12-04T08:54:18.6476884Z * [new branch] gh/yushangdi/2/head -> origin/gh/yushangdi/2/head 2025-12-04T08:54:18.6476956Z * [new branch] gh/yushangdi/7/base -> origin/gh/yushangdi/7/base 2025-12-04T08:54:18.6477027Z * [new branch] gh/yushangdi/7/head -> origin/gh/yushangdi/7/head 2025-12-04T08:54:18.6477097Z * [new branch] gh/yushangdi/7/orig -> origin/gh/yushangdi/7/orig 2025-12-04T08:54:18.6477166Z * [new branch] gh/yushangdi/8/base -> origin/gh/yushangdi/8/base 2025-12-04T08:54:18.6477237Z * [new branch] gh/yushangdi/8/head -> origin/gh/yushangdi/8/head 2025-12-04T08:54:18.6477310Z * [new branch] gh/yushangdi/8/orig -> origin/gh/yushangdi/8/orig 2025-12-04T08:54:18.6477380Z * [new branch] gh/yushangdi/9/base -> origin/gh/yushangdi/9/base 2025-12-04T08:54:18.6477451Z * [new branch] gh/yushangdi/9/head -> origin/gh/yushangdi/9/head 2025-12-04T08:54:18.6477520Z * [new branch] gh/yushangdi/9/orig -> origin/gh/yushangdi/9/orig 2025-12-04T08:54:18.6477590Z * [new branch] gh/zklaus/19/base -> origin/gh/zklaus/19/base 2025-12-04T08:54:18.6477658Z * [new branch] gh/zklaus/19/head -> origin/gh/zklaus/19/head 2025-12-04T08:54:18.6477725Z * [new branch] gh/zklaus/19/orig -> origin/gh/zklaus/19/orig 2025-12-04T08:54:18.6477792Z * [new branch] gh/zklaus/20/base -> origin/gh/zklaus/20/base 2025-12-04T08:54:18.6477860Z * [new branch] gh/zklaus/20/head -> origin/gh/zklaus/20/head 2025-12-04T08:54:18.6477950Z * [new branch] gh/zklaus/20/orig -> origin/gh/zklaus/20/orig 2025-12-04T08:54:18.6478016Z * [new branch] gh/zklaus/21/base -> origin/gh/zklaus/21/base 2025-12-04T08:54:18.6478130Z * [new branch] gh/zklaus/21/head -> origin/gh/zklaus/21/head 2025-12-04T08:54:18.6478199Z * [new branch] gh/zklaus/21/orig -> origin/gh/zklaus/21/orig 2025-12-04T08:54:18.6478267Z * [new branch] gh/zklaus/22/base -> origin/gh/zklaus/22/base 2025-12-04T08:54:18.6478336Z * [new branch] gh/zklaus/22/head -> origin/gh/zklaus/22/head 2025-12-04T08:54:18.6478402Z * [new branch] gh/zklaus/22/orig -> origin/gh/zklaus/22/orig 2025-12-04T08:54:18.6478470Z * [new branch] gh/zklaus/23/base -> origin/gh/zklaus/23/base 2025-12-04T08:54:18.6478535Z * [new branch] gh/zklaus/23/head -> origin/gh/zklaus/23/head 2025-12-04T08:54:18.6478604Z * [new branch] gh/zklaus/23/orig -> origin/gh/zklaus/23/orig 2025-12-04T08:54:18.6478674Z * [new branch] gh/zklaus/24/base -> origin/gh/zklaus/24/base 2025-12-04T08:54:18.6478740Z * [new branch] gh/zklaus/24/head -> origin/gh/zklaus/24/head 2025-12-04T08:54:18.6478805Z * [new branch] gh/zklaus/24/orig -> origin/gh/zklaus/24/orig 2025-12-04T08:54:18.6478915Z * [new branch] gh/zou3519/1197/base -> origin/gh/zou3519/1197/base 2025-12-04T08:54:18.6478986Z * [new branch] gh/zou3519/1197/head -> origin/gh/zou3519/1197/head 2025-12-04T08:54:18.6479056Z * [new branch] gh/zou3519/1197/orig -> origin/gh/zou3519/1197/orig 2025-12-04T08:54:18.6479125Z * [new branch] gh/zou3519/1199/base -> origin/gh/zou3519/1199/base 2025-12-04T08:54:18.6479194Z * [new branch] gh/zou3519/1199/head -> origin/gh/zou3519/1199/head 2025-12-04T08:54:18.6479263Z * [new branch] gh/zou3519/1199/orig -> origin/gh/zou3519/1199/orig 2025-12-04T08:54:18.6479332Z * [new branch] gh/zou3519/1200/base -> origin/gh/zou3519/1200/base 2025-12-04T08:54:18.6479399Z * [new branch] gh/zou3519/1200/head -> origin/gh/zou3519/1200/head 2025-12-04T08:54:18.6479467Z * [new branch] gh/zou3519/1200/orig -> origin/gh/zou3519/1200/orig 2025-12-04T08:54:18.6479537Z * [new branch] gh/zou3519/1201/base -> origin/gh/zou3519/1201/base 2025-12-04T08:54:18.6479604Z * [new branch] gh/zou3519/1201/head -> origin/gh/zou3519/1201/head 2025-12-04T08:54:18.6479671Z * [new branch] gh/zou3519/1201/orig -> origin/gh/zou3519/1201/orig 2025-12-04T08:54:18.6479740Z * [new branch] gh/zou3519/1202/base -> origin/gh/zou3519/1202/base 2025-12-04T08:54:18.6479807Z * [new branch] gh/zou3519/1202/head -> origin/gh/zou3519/1202/head 2025-12-04T08:54:18.6479878Z * [new branch] gh/zou3519/1202/orig -> origin/gh/zou3519/1202/orig 2025-12-04T08:54:18.6479947Z * [new branch] gh/zpcore/1/base -> origin/gh/zpcore/1/base 2025-12-04T08:54:18.6480014Z * [new branch] gh/zpcore/1/head -> origin/gh/zpcore/1/head 2025-12-04T08:54:18.6480083Z * [new branch] gh/zpcore/11/base -> origin/gh/zpcore/11/base 2025-12-04T08:54:18.6480150Z * [new branch] gh/zpcore/11/head -> origin/gh/zpcore/11/head 2025-12-04T08:54:18.6480219Z * [new branch] gh/zpcore/11/orig -> origin/gh/zpcore/11/orig 2025-12-04T08:54:18.6480287Z * [new branch] gh/zpcore/12/base -> origin/gh/zpcore/12/base 2025-12-04T08:54:18.6480353Z * [new branch] gh/zpcore/12/head -> origin/gh/zpcore/12/head 2025-12-04T08:54:18.6480419Z * [new branch] gh/zpcore/12/orig -> origin/gh/zpcore/12/orig 2025-12-04T08:54:18.6481337Z * [new branch] gh/zpcore/13/base -> origin/gh/zpcore/13/base 2025-12-04T08:54:18.6481404Z * [new branch] gh/zpcore/13/head -> origin/gh/zpcore/13/head 2025-12-04T08:54:18.6481470Z * [new branch] gh/zpcore/13/orig -> origin/gh/zpcore/13/orig 2025-12-04T08:54:18.6481539Z * [new branch] gh/zpcore/14/base -> origin/gh/zpcore/14/base 2025-12-04T08:54:18.6481608Z * [new branch] gh/zpcore/14/head -> origin/gh/zpcore/14/head 2025-12-04T08:54:18.6481674Z * [new branch] gh/zpcore/14/orig -> origin/gh/zpcore/14/orig 2025-12-04T08:54:18.6481741Z * [new branch] gh/zpcore/15/base -> origin/gh/zpcore/15/base 2025-12-04T08:54:18.6481807Z * [new branch] gh/zpcore/15/head -> origin/gh/zpcore/15/head 2025-12-04T08:54:18.6481872Z * [new branch] gh/zpcore/15/orig -> origin/gh/zpcore/15/orig 2025-12-04T08:54:18.6481941Z * [new branch] gh/zpcore/2/base -> origin/gh/zpcore/2/base 2025-12-04T08:54:18.6482009Z * [new branch] gh/zpcore/2/head -> origin/gh/zpcore/2/head 2025-12-04T08:54:18.6482075Z * [new branch] gh/zpcore/21/base -> origin/gh/zpcore/21/base 2025-12-04T08:54:18.6482142Z * [new branch] gh/zpcore/21/head -> origin/gh/zpcore/21/head 2025-12-04T08:54:18.6482233Z * [new branch] gh/zpcore/21/orig -> origin/gh/zpcore/21/orig 2025-12-04T08:54:18.6482300Z * [new branch] gh/zpcore/22/base -> origin/gh/zpcore/22/base 2025-12-04T08:54:18.6482367Z * [new branch] gh/zpcore/22/head -> origin/gh/zpcore/22/head 2025-12-04T08:54:18.6482434Z * [new branch] gh/zpcore/22/orig -> origin/gh/zpcore/22/orig 2025-12-04T08:54:18.6482501Z * [new branch] gh/zpcore/23/base -> origin/gh/zpcore/23/base 2025-12-04T08:54:18.6482566Z * [new branch] gh/zpcore/23/head -> origin/gh/zpcore/23/head 2025-12-04T08:54:18.6482635Z * [new branch] gh/zpcore/23/orig -> origin/gh/zpcore/23/orig 2025-12-04T08:54:18.6482702Z * [new branch] gh/zpcore/24/base -> origin/gh/zpcore/24/base 2025-12-04T08:54:18.6482769Z * [new branch] gh/zpcore/24/head -> origin/gh/zpcore/24/head 2025-12-04T08:54:18.6482837Z * [new branch] gh/zpcore/24/orig -> origin/gh/zpcore/24/orig 2025-12-04T08:54:18.6482904Z * [new branch] gh/zpcore/25/base -> origin/gh/zpcore/25/base 2025-12-04T08:54:18.6482970Z * [new branch] gh/zpcore/25/head -> origin/gh/zpcore/25/head 2025-12-04T08:54:18.6483037Z * [new branch] gh/zpcore/25/orig -> origin/gh/zpcore/25/orig 2025-12-04T08:54:18.6483104Z * [new branch] gh/zpcore/26/base -> origin/gh/zpcore/26/base 2025-12-04T08:54:18.6483170Z * [new branch] gh/zpcore/26/head -> origin/gh/zpcore/26/head 2025-12-04T08:54:18.6483238Z * [new branch] gh/zpcore/26/orig -> origin/gh/zpcore/26/orig 2025-12-04T08:54:18.6483307Z * [new branch] gh/zpcore/27/base -> origin/gh/zpcore/27/base 2025-12-04T08:54:18.6483373Z * [new branch] gh/zpcore/27/head -> origin/gh/zpcore/27/head 2025-12-04T08:54:18.6483440Z * [new branch] gh/zpcore/27/orig -> origin/gh/zpcore/27/orig 2025-12-04T08:54:18.6483507Z * [new branch] gh/zpcore/28/base -> origin/gh/zpcore/28/base 2025-12-04T08:54:18.6483573Z * [new branch] gh/zpcore/28/head -> origin/gh/zpcore/28/head 2025-12-04T08:54:18.6483638Z * [new branch] gh/zpcore/28/orig -> origin/gh/zpcore/28/orig 2025-12-04T08:54:18.6483705Z * [new branch] gh/zpcore/3/base -> origin/gh/zpcore/3/base 2025-12-04T08:54:18.6483771Z * [new branch] gh/zpcore/3/head -> origin/gh/zpcore/3/head 2025-12-04T08:54:18.6483859Z * [new branch] gh/zpcore/4/base -> origin/gh/zpcore/4/base 2025-12-04T08:54:18.6483926Z * [new branch] gh/zpcore/4/head -> origin/gh/zpcore/4/head 2025-12-04T08:54:18.6483992Z * [new branch] gh/zpcore/5/base -> origin/gh/zpcore/5/base 2025-12-04T08:54:18.6484058Z * [new branch] gh/zpcore/5/head -> origin/gh/zpcore/5/head 2025-12-04T08:54:18.6484125Z * [new branch] gh/zpcore/6/base -> origin/gh/zpcore/6/base 2025-12-04T08:54:18.6484191Z * [new branch] gh/zpcore/6/head -> origin/gh/zpcore/6/head 2025-12-04T08:54:18.6484257Z * [new branch] gh/zpcore/7/base -> origin/gh/zpcore/7/base 2025-12-04T08:54:18.6484323Z * [new branch] gh/zpcore/7/head -> origin/gh/zpcore/7/head 2025-12-04T08:54:18.6484388Z * [new branch] gh/zpcore/8/base -> origin/gh/zpcore/8/base 2025-12-04T08:54:18.6484457Z * [new branch] gh/zpcore/8/head -> origin/gh/zpcore/8/head 2025-12-04T08:54:18.6484525Z * [new branch] google-main -> origin/google-main 2025-12-04T08:54:18.6484611Z * [new branch] guangyey/external_stream -> origin/guangyey/external_stream 2025-12-04T08:54:18.6484685Z * [new branch] guangyey/test_2025 -> origin/guangyey/test_2025 2025-12-04T08:54:18.6484844Z * [new branch] guilhermeleobas/cherry-pick-55d87d9dfd9 -> origin/guilhermeleobas/cherry-pick-55d87d9dfd9 2025-12-04T08:54:18.6484961Z * [new branch] hameerabbasi/complex_tensor_subclass -> origin/hameerabbasi/complex_tensor_subclass 2025-12-04T08:54:18.6485100Z * [new branch] hameerabbasi/fix-ctensor-gradcheck-tests -> origin/hameerabbasi/fix-ctensor-gradcheck-tests 2025-12-04T08:54:18.6485207Z * [new branch] hameerabbasi/gradcheck-allclose -> origin/hameerabbasi/gradcheck-allclose 2025-12-04T08:54:18.6485275Z * [new branch] hc_baseline -> origin/hc_baseline 2025-12-04T08:54:18.6485338Z * [new branch] hhh_rand -> origin/hhh_rand 2025-12-04T08:54:18.6485400Z * [new branch] huba/f1 -> origin/huba/f1 2025-12-04T08:54:18.6485587Z * [new branch] increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test -> origin/increase-timeout-linux-jammy-cuda12_8-py3_10-gcc11-test 2025-12-04T08:54:18.6485649Z * [new branch] inlining -> origin/inlining 2025-12-04T08:54:18.6485720Z * [new branch] inlining-ezyang -> origin/inlining-ezyang 2025-12-04T08:54:18.6485805Z * [new branch] install-torchao-0.13.0 -> origin/install-torchao-0.13.0 2025-12-04T08:54:18.6485980Z * [new branch] instrument-trunk-pull-linux-with-job-test-filters -> origin/instrument-trunk-pull-linux-with-job-test-filters 2025-12-04T08:54:18.6486050Z * [new branch] invoke-subgraph -> origin/invoke-subgraph 2025-12-04T08:54:18.6486118Z * [new branch] issue#58739 -> origin/issue#58739 2025-12-04T08:54:18.6486198Z * [new branch] jainapurva-patch-1 -> origin/jainapurva-patch-1 2025-12-04T08:54:18.6486260Z * [new branch] jathu/o3 -> origin/jathu/o3 2025-12-04T08:54:18.6486325Z * [new branch] jathu/sve -> origin/jathu/sve 2025-12-04T08:54:18.6486446Z * [new branch] jcaip/test-cusparselt-version-0.6.2 -> origin/jcaip/test-cusparselt-version-0.6.2 2025-12-04T08:54:18.6486550Z * [new branch] jcaip/update-cusparselt-0.6.2 -> origin/jcaip/update-cusparselt-0.6.2 2025-12-04T08:54:18.6486662Z * [new branch] jiannanWang/memorysnapshot_filter -> origin/jiannanWang/memorysnapshot_filter 2025-12-04T08:54:18.6486770Z * [new branch] jiannanWang/profilerstepwarning -> origin/jiannanWang/profilerstepwarning 2025-12-04T08:54:18.6486874Z * [new branch] jithunnair-amd-patch-1 -> origin/jithunnair-amd-patch-1 2025-12-04T08:54:18.6486960Z * [new branch] jithunnair-amd-patch-10 -> origin/jithunnair-amd-patch-10 2025-12-04T08:54:18.6487043Z * [new branch] jithunnair-amd-patch-2 -> origin/jithunnair-amd-patch-2 2025-12-04T08:54:18.6487123Z * [new branch] jithunnair-amd-patch-3 -> origin/jithunnair-amd-patch-3 2025-12-04T08:54:18.6487204Z * [new branch] jithunnair-amd-patch-4 -> origin/jithunnair-amd-patch-4 2025-12-04T08:54:18.6487283Z * [new branch] jithunnair-amd-patch-5 -> origin/jithunnair-amd-patch-5 2025-12-04T08:54:18.6487362Z * [new branch] jithunnair-amd-patch-6 -> origin/jithunnair-amd-patch-6 2025-12-04T08:54:18.6487440Z * [new branch] jithunnair-amd-patch-7 -> origin/jithunnair-amd-patch-7 2025-12-04T08:54:18.6487518Z * [new branch] jithunnair-amd-patch-8 -> origin/jithunnair-amd-patch-8 2025-12-04T08:54:18.6487599Z * [new branch] jithunnair-amd-patch-9 -> origin/jithunnair-amd-patch-9 2025-12-04T08:54:18.6487676Z * [new branch] justinchu/native-qdq -> origin/justinchu/native-qdq 2025-12-04T08:54:18.6487748Z * [new branch] kainan666/xlf_debug -> origin/kainan666/xlf_debug 2025-12-04T08:54:18.6487834Z * [new branch] kainan_test -> origin/kainan_test 2025-12-04T08:54:18.6487912Z * [new branch] larryliu0820-patch-1 -> origin/larryliu0820-patch-1 2025-12-04T08:54:18.6488017Z * [new branch] leslie/test_group_gemm_epilogues -> origin/leslie/test_group_gemm_epilogues 2025-12-04T08:54:18.6488168Z * [new branch] lessw2020/fix_cutlass_cache_error -> origin/lessw2020/fix_cutlass_cache_error 2025-12-04T08:54:18.6488249Z * [new branch] liaoxuan/shm_all_reduce -> origin/liaoxuan/shm_all_reduce 2025-12-04T08:54:18.6488351Z * [new branch] liaoxuan/test_fa_disable_softmax -> origin/liaoxuan/test_fa_disable_softmax 2025-12-04T08:54:18.6488430Z * [new branch] liaoxuan/test_int8_sdpa -> origin/liaoxuan/test_int8_sdpa 2025-12-04T08:54:18.6488498Z * [new branch] llama4-stable -> origin/llama4-stable 2025-12-04T08:54:18.6488565Z * [new branch] lts/release/1.8 -> origin/lts/release/1.8 2025-12-04T08:54:18.6488641Z * [new branch] lucaskabela/#94773 -> origin/lucaskabela/#94773 2025-12-04T08:54:18.6488716Z * [new branch] lucaskabela/fix_164876 -> origin/lucaskabela/fix_164876 2025-12-04T08:54:18.6488799Z * [new branch] lucaskabela/flop_counter -> origin/lucaskabela/flop_counter 2025-12-04T08:54:18.6488896Z * [new branch] lucaskabela/func_under_decomp -> origin/lucaskabela/func_under_decomp 2025-12-04T08:54:18.6488999Z * [new branch] lucaskabela/functional_in_dynamo -> origin/lucaskabela/functional_in_dynamo 2025-12-04T08:54:18.6489126Z * [new branch] lucaskabela/install_params_as_graph_attr -> origin/lucaskabela/install_params_as_graph_attr 2025-12-04T08:54:18.6489240Z * [new branch] lucaskabela/parameters_as_graph_attr -> origin/lucaskabela/parameters_as_graph_attr 2025-12-04T08:54:18.6489373Z * [new branch] lucaskabela/remove_aot_dispatcher_metadata -> origin/lucaskabela/remove_aot_dispatcher_metadata 2025-12-04T08:54:18.6489452Z * [new branch] lucaskabela/rnn_decomp -> origin/lucaskabela/rnn_decomp 2025-12-04T08:54:18.6489544Z * [new branch] lucaskabela/typing_backends -> origin/lucaskabela/typing_backends 2025-12-04T08:54:18.6489641Z * [new branch] lucaskabela/typing_ctx_manager -> origin/lucaskabela/typing_ctx_manager 2025-12-04T08:54:18.6489736Z * [new branch] lucaskabela/typing_nn_module -> origin/lucaskabela/typing_nn_module 2025-12-04T08:54:18.6489875Z * [new branch] lucaskabela/typing_user_defined -> origin/lucaskabela/typing_user_defined 2025-12-04T08:54:18.6489969Z * [new branch] lucaskabela/typing_variables -> origin/lucaskabela/typing_variables 2025-12-04T08:54:18.6490078Z * [new branch] lucaskabela/typing_variables_dicts -> origin/lucaskabela/typing_variables_dicts 2025-12-04T08:54:18.6490200Z * [new branch] lucaskabela/typing_variables_functions -> origin/lucaskabela/typing_variables_functions 2025-12-04T08:54:18.6490306Z * [new branch] lucaskabela/typing_variables_lists -> origin/lucaskabela/typing_variables_lists 2025-12-04T08:54:18.6490379Z * [new branch] lw/torch_box_by_ref -> origin/lw/torch_box_by_ref 2025-12-04T08:54:18.6490439Z * [new branch] main -> origin/main 2025-12-04T08:54:18.6490511Z * [new branch] malfet-patch-1 -> origin/malfet-patch-1 2025-12-04T08:54:18.6490581Z * [new branch] malfet-patch-2 -> origin/malfet-patch-2 2025-12-04T08:54:18.6490647Z * [new branch] malfet-patch-3 -> origin/malfet-patch-3 2025-12-04T08:54:18.6490714Z * [new branch] malfet-patch-4 -> origin/malfet-patch-4 2025-12-04T08:54:18.6490780Z * [new branch] malfet-patch-5 -> origin/malfet-patch-5 2025-12-04T08:54:18.6490877Z * [new branch] malfet-patch-6 -> origin/malfet-patch-6 2025-12-04T08:54:18.6490944Z * [new branch] malfet-patch-7 -> origin/malfet-patch-7 2025-12-04T08:54:18.6491008Z * [new branch] malfet-patch-8 -> origin/malfet-patch-8 2025-12-04T08:54:18.6491082Z * [new branch] malfet/add-3.14-ci -> origin/malfet/add-3.14-ci 2025-12-04T08:54:18.6491243Z * [new branch] malfet/be-do-not-make-typos-in-build-artifacts -> origin/malfet/be-do-not-make-typos-in-build-artifacts 2025-12-04T08:54:18.6491411Z * [new branch] malfet/be-move-more-settings-to-checkout-pytorch -> origin/malfet/be-move-more-settings-to-checkout-pytorch 2025-12-04T08:54:18.6491538Z * [new branch] malfet/be-remove-misisng-neon-headers -> origin/malfet/be-remove-misisng-neon-headers 2025-12-04T08:54:18.6491636Z * [new branch] malfet/mps-implement-col2im -> origin/malfet/mps-implement-col2im 2025-12-04T08:54:18.6491753Z * [new branch] manuel/aoti_metal_shimify-thread_safe -> origin/manuel/aoti_metal_shimify-thread_safe 2025-12-04T08:54:18.6491845Z * [new branch] manuel/inductor_link_openmp -> origin/manuel/inductor_link_openmp 2025-12-04T08:54:18.6491921Z * [new branch] masnesral/metaconda -> origin/masnesral/metaconda 2025-12-04T08:54:18.6491997Z * [new branch] mem_profiler_flaky_fix -> origin/mem_profiler_flaky_fix 2025-12-04T08:54:18.6492077Z * [new branch] mem_profiler_stack_trace -> origin/mem_profiler_stack_trace 2025-12-04T08:54:18.6492154Z * [new branch] memory_profiler_stack -> origin/memory_profiler_stack 2025-12-04T08:54:18.6492227Z * [new branch] metascroy-patch-1 -> origin/metascroy-patch-1 2025-12-04T08:54:18.6492292Z * [new branch] mingw_posix -> origin/mingw_posix 2025-12-04T08:54:18.6492366Z * [new branch] mlazos/S429861-debug -> origin/mlazos/S429861-debug 2025-12-04T08:54:18.6492428Z * [new branch] mlazos/aa -> origin/mlazos/aa 2025-12-04T08:54:18.6492491Z * [new branch] mlazos/acts -> origin/mlazos/acts 2025-12-04T08:54:18.6492562Z * [new branch] mlazos/arg-renames -> origin/mlazos/arg-renames 2025-12-04T08:54:18.6492639Z * [new branch] mlazos/bad-cudagraphs -> origin/mlazos/bad-cudagraphs 2025-12-04T08:54:18.6492740Z * [new branch] mlazos/baseline-graph-breaks -> origin/mlazos/baseline-graph-breaks 2025-12-04T08:54:18.6492831Z * [new branch] mlazos/beta-tensor -> origin/mlazos/beta-tensor 2025-12-04T08:54:18.6492897Z * [new branch] mlazos/buffers -> origin/mlazos/buffers 2025-12-04T08:54:18.6492967Z * [new branch] mlazos/buffers2 -> origin/mlazos/buffers2 2025-12-04T08:54:18.6493033Z * [new branch] mlazos/buffers3 -> origin/mlazos/buffers3 2025-12-04T08:54:18.6493098Z * [new branch] mlazos/bwd -> origin/mlazos/bwd 2025-12-04T08:54:18.6493171Z * [new branch] mlazos/combo-test -> origin/mlazos/combo-test 2025-12-04T08:54:18.6493244Z * [new branch] mlazos/ctx-cleanup -> origin/mlazos/ctx-cleanup 2025-12-04T08:54:18.6493318Z * [new branch] mlazos/cuda-cmd-log -> origin/mlazos/cuda-cmd-log 2025-12-04T08:54:18.6493400Z * [new branch] mlazos/cudagraph-tests -> origin/mlazos/cudagraph-tests 2025-12-04T08:54:18.6493503Z * [new branch] mlazos/cudagraphs-measurement -> origin/mlazos/cudagraphs-measurement 2025-12-04T08:54:18.6493576Z * [new branch] mlazos/cutlass-test -> origin/mlazos/cutlass-test 2025-12-04T08:54:18.6493657Z * [new branch] mlazos/cutlass-topo-bug -> origin/mlazos/cutlass-topo-bug 2025-12-04T08:54:18.6493755Z * [new branch] mlazos/dataclass-proxy -> origin/mlazos/dataclass-proxy 2025-12-04T08:54:18.6493824Z * [new branch] mlazos/dc-attrs -> origin/mlazos/dc-attrs 2025-12-04T08:54:18.6493893Z * [new branch] mlazos/dc-helion -> origin/mlazos/dc-helion 2025-12-04T08:54:18.6493959Z * [new branch] mlazos/dict-fix -> origin/mlazos/dict-fix 2025-12-04T08:54:18.6494031Z * [new branch] mlazos/disable-tf -> origin/mlazos/disable-tf 2025-12-04T08:54:18.6494097Z * [new branch] mlazos/dupe-fix -> origin/mlazos/dupe-fix 2025-12-04T08:54:18.6494166Z * [new branch] mlazos/dyn-batch -> origin/mlazos/dyn-batch 2025-12-04T08:54:18.6494230Z * [new branch] mlazos/evt -> origin/mlazos/evt 2025-12-04T08:54:18.6494310Z * [new branch] mlazos/extract-examples -> origin/mlazos/extract-examples 2025-12-04T08:54:18.6494379Z * [new branch] mlazos/foreach-op -> origin/mlazos/foreach-op 2025-12-04T08:54:18.6494444Z * [new branch] mlazos/fp8 -> origin/mlazos/fp8 2025-12-04T08:54:18.6494511Z * [new branch] mlazos/fp8-bias -> origin/mlazos/fp8-bias 2025-12-04T08:54:18.6494588Z * [new branch] mlazos/fp8-bias-fusion -> origin/mlazos/fp8-bias-fusion 2025-12-04T08:54:18.6494658Z * [new branch] mlazos/fp8-fixes -> origin/mlazos/fp8-fixes 2025-12-04T08:54:18.6494722Z * [new branch] mlazos/freezing -> origin/mlazos/freezing 2025-12-04T08:54:18.6494790Z * [new branch] mlazos/h-comp -> origin/mlazos/h-comp 2025-12-04T08:54:18.6494857Z * [new branch] mlazos/h-comp2 -> origin/mlazos/h-comp2 2025-12-04T08:54:18.6494923Z * [new branch] mlazos/hash-hop -> origin/mlazos/hash-hop 2025-12-04T08:54:18.6494984Z * [new branch] mlazos/hc -> origin/mlazos/hc 2025-12-04T08:54:18.6495055Z * [new branch] mlazos/hc-cycles -> origin/mlazos/hc-cycles 2025-12-04T08:54:18.6495121Z * [new branch] mlazos/hc-fixes -> origin/mlazos/hc-fixes 2025-12-04T08:54:18.6495188Z * [new branch] mlazos/hc-fixes3 -> origin/mlazos/hc-fixes3 2025-12-04T08:54:18.6495256Z * [new branch] mlazos/hc-fixes4 -> origin/mlazos/hc-fixes4 2025-12-04T08:54:18.6495320Z * [new branch] mlazos/hc-hf -> origin/mlazos/hc-hf 2025-12-04T08:54:18.6495384Z * [new branch] mlazos/hc-mut -> origin/mlazos/hc-mut 2025-12-04T08:54:18.6495467Z * [new branch] mlazos/hc10 -> origin/mlazos/hc10 2025-12-04T08:54:18.6495528Z * [new branch] mlazos/hc11 -> origin/mlazos/hc11 2025-12-04T08:54:18.6495589Z * [new branch] mlazos/hc12 -> origin/mlazos/hc12 2025-12-04T08:54:18.6495652Z * [new branch] mlazos/hc13 -> origin/mlazos/hc13 2025-12-04T08:54:18.6495712Z * [new branch] mlazos/hc14 -> origin/mlazos/hc14 2025-12-04T08:54:18.6495772Z * [new branch] mlazos/hc15 -> origin/mlazos/hc15 2025-12-04T08:54:18.6495832Z * [new branch] mlazos/hc2 -> origin/mlazos/hc2 2025-12-04T08:54:18.6495893Z * [new branch] mlazos/hc4 -> origin/mlazos/hc4 2025-12-04T08:54:18.6495954Z * [new branch] mlazos/hc5 -> origin/mlazos/hc5 2025-12-04T08:54:18.6496016Z * [new branch] mlazos/hc6 -> origin/mlazos/hc6 2025-12-04T08:54:18.6496074Z * [new branch] mlazos/hc7 -> origin/mlazos/hc7 2025-12-04T08:54:18.6496134Z * [new branch] mlazos/hc8 -> origin/mlazos/hc8 2025-12-04T08:54:18.6496193Z * [new branch] mlazos/hc9 -> origin/mlazos/hc9 2025-12-04T08:54:18.6496283Z * [new branch] mlazos/hc_baseline2 -> origin/mlazos/hc_baseline2 2025-12-04T08:54:18.6496366Z * [new branch] mlazos/inductor-streams -> origin/mlazos/inductor-streams 2025-12-04T08:54:18.6496427Z * [new branch] mlazos/main -> origin/mlazos/main 2025-12-04T08:54:18.6496488Z * [new branch] mlazos/mcg2 -> origin/mlazos/mcg2 2025-12-04T08:54:18.6496564Z * [new branch] mlazos/meta-guards -> origin/mlazos/meta-guards 2025-12-04T08:54:18.6496665Z * [new branch] mlazos/mlazos/foreach-map-adam -> origin/mlazos/mlazos/foreach-map-adam 2025-12-04T08:54:18.6496762Z * [new branch] mlazos/mlazos/tf-mode-backup -> origin/mlazos/mlazos/tf-mode-backup 2025-12-04T08:54:18.6496831Z * [new branch] mlazos/mod-fix -> origin/mlazos/mod-fix 2025-12-04T08:54:18.6496897Z * [new branch] mlazos/mode-fix -> origin/mlazos/mode-fix 2025-12-04T08:54:18.6496964Z * [new branch] mlazos/offsets -> origin/mlazos/offsets 2025-12-04T08:54:18.6497038Z * [new branch] mlazos/overguarding -> origin/mlazos/overguarding 2025-12-04T08:54:18.6497111Z * [new branch] mlazos/proxy-ctors -> origin/mlazos/proxy-ctors 2025-12-04T08:54:18.6497179Z * [new branch] mlazos/quant-fix -> origin/mlazos/quant-fix 2025-12-04T08:54:18.6497249Z * [new branch] mlazos/resnet-fix -> origin/mlazos/resnet-fix 2025-12-04T08:54:18.6497321Z * [new branch] mlazos/rm-buf-names -> origin/mlazos/rm-buf-names 2025-12-04T08:54:18.6497389Z * [new branch] mlazos/rm-code -> origin/mlazos/rm-code 2025-12-04T08:54:18.6497456Z * [new branch] mlazos/rm-spam -> origin/mlazos/rm-spam 2025-12-04T08:54:18.6497518Z * [new branch] mlazos/rtp -> origin/mlazos/rtp 2025-12-04T08:54:18.6497599Z * [new branch] mlazos/static-idx-dbg -> origin/mlazos/static-idx-dbg 2025-12-04T08:54:18.6497683Z * [new branch] mlazos/static-inputs-log -> origin/mlazos/static-inputs-log 2025-12-04T08:54:18.6497746Z * [new branch] mlazos/stests -> origin/mlazos/stests 2025-12-04T08:54:18.6497818Z * [new branch] mlazos/stream-ops -> origin/mlazos/stream-ops 2025-12-04T08:54:18.6497883Z * [new branch] mlazos/td-fix2 -> origin/mlazos/td-fix2 2025-12-04T08:54:18.6497961Z * [new branch] mlazos/tensor-hasattr2 -> origin/mlazos/tensor-hasattr2 2025-12-04T08:54:18.6498047Z * [new branch] mlazos/test -> origin/mlazos/test 2025-12-04T08:54:18.6498156Z * [new branch] mlazos/tf-mode -> origin/mlazos/tf-mode 2025-12-04T08:54:18.6498236Z * [new branch] mlazos/tf-mode-backup2 -> origin/mlazos/tf-mode-backup2 2025-12-04T08:54:18.6498314Z * [new branch] mlazos/tf-mode-reland -> origin/mlazos/tf-mode-reland 2025-12-04T08:54:18.6498390Z * [new branch] mlazos/tf-mode-reland2 -> origin/mlazos/tf-mode-reland2 2025-12-04T08:54:18.6498464Z * [new branch] mlazos/tf-mode-reland3 -> origin/mlazos/tf-mode-reland3 2025-12-04T08:54:18.6498542Z * [new branch] mlazos/triton-no-epi -> origin/mlazos/triton-no-epi 2025-12-04T08:54:18.6498612Z * [new branch] mlazos/tune-proto -> origin/mlazos/tune-proto 2025-12-04T08:54:18.6498684Z * [new branch] mlazos/tuple-fixes -> origin/mlazos/tuple-fixes 2025-12-04T08:54:18.6498760Z * [new branch] mlazos/tuple-fixes2 -> origin/mlazos/tuple-fixes2 2025-12-04T08:54:18.6498835Z * [new branch] mlazos/tuple-handling -> origin/mlazos/tuple-handling 2025-12-04T08:54:18.6498914Z * [new branch] mlazos/user-stream-base -> origin/mlazos/user-stream-base 2025-12-04T08:54:18.6499014Z * [new branch] mlazos/user-streams -> origin/mlazos/user-streams 2025-12-04T08:54:18.6499107Z * [new branch] mlazos/user-streams-backup -> origin/mlazos/user-streams-backup 2025-12-04T08:54:18.6499202Z * [new branch] mlazos/user-streams-backup2 -> origin/mlazos/user-streams-backup2 2025-12-04T08:54:18.6499270Z * [new branch] mlazos/vary-beta -> origin/mlazos/vary-beta 2025-12-04T08:54:18.6499340Z * [new branch] mlazos/vary-beta2 -> origin/mlazos/vary-beta2 2025-12-04T08:54:18.6499413Z * [new branch] mlazos/weird-perf1 -> origin/mlazos/weird-perf1 2025-12-04T08:54:18.6499488Z * [new branch] mm_out_dtype_compile -> origin/mm_out_dtype_compile 2025-12-04T08:54:18.6499552Z * [new branch] module-shim -> origin/module-shim 2025-12-04T08:54:18.6499615Z * [new branch] move_config -> origin/move_config 2025-12-04T08:54:18.6499685Z * [new branch] msaroufim/reduce -> origin/msaroufim/reduce 2025-12-04T08:54:18.6499753Z * [new branch] mtia/basic-cmake -> origin/mtia/basic-cmake 2025-12-04T08:54:18.6499856Z * [new branch] mwizak/fix-triton-block-shape -> origin/mwizak/fix-triton-block-shape 2025-12-04T08:54:18.6499922Z * [new branch] my_varlen_backup -> origin/my_varlen_backup 2025-12-04T08:54:18.6499996Z * [new branch] nativert_num_outputs -> origin/nativert_num_outputs 2025-12-04T08:54:18.6500059Z * [new branch] new-codegen -> origin/new-codegen 2025-12-04T08:54:18.6500126Z * [new branch] newtest-base -> origin/newtest-base 2025-12-04T08:54:18.6500196Z * [new branch] ngimel/addmm_dtype -> origin/ngimel/addmm_dtype 2025-12-04T08:54:18.6500262Z * [new branch] ngimel/div_inv -> origin/ngimel/div_inv 2025-12-04T08:54:18.6500340Z * [new branch] ngimel/error_index_list -> origin/ngimel/error_index_list 2025-12-04T08:54:18.6500410Z * [new branch] ngimel/gather_grid -> origin/ngimel/gather_grid 2025-12-04T08:54:18.6500498Z * [new branch] ngimel/gather_grid_release -> origin/ngimel/gather_grid_release 2025-12-04T08:54:18.6500563Z * [new branch] ngimel/gg_new -> origin/ngimel/gg_new 2025-12-04T08:54:18.6500629Z * [new branch] ngimel/hostalloc -> origin/ngimel/hostalloc 2025-12-04T08:54:18.6500699Z * [new branch] ngimel/storage_id -> origin/ngimel/storage_id 2025-12-04T08:54:18.6500797Z * [new branch] nightly -> origin/nightly 2025-12-04T08:54:18.6500914Z * [new branch] nikitaved/addmm_1_rowcol_lt_path_check -> origin/nikitaved/addmm_1_rowcol_lt_path_check 2025-12-04T08:54:18.6501035Z * [new branch] nikitaved/addmm_epilogue_fusions_2d_bias -> origin/nikitaved/addmm_epilogue_fusions_2d_bias 2025-12-04T08:54:18.6501163Z * [new branch] nikitaved/addmm_epilogue_fusions_inductor -> origin/nikitaved/addmm_epilogue_fusions_inductor 2025-12-04T08:54:18.6501285Z * [new branch] nikitaved/addmm_epilogue_fusions_scratch -> origin/nikitaved/addmm_epilogue_fusions_scratch 2025-12-04T08:54:18.6501400Z * [new branch] nikitaved/grad_addmm_epilogue_fusions -> origin/nikitaved/grad_addmm_epilogue_fusions 2025-12-04T08:54:18.6501510Z * [new branch] nikitaved/simpler_can_use_32bit_index -> origin/nikitaved/simpler_can_use_32bit_index 2025-12-04T08:54:18.6501581Z * [new branch] nikitaved/test -> origin/nikitaved/test 2025-12-04T08:54:18.6501705Z * [new branch] nmacchioni-perf-test-async-autotune -> origin/nmacchioni-perf-test-async-autotune 2025-12-04T08:54:18.6501782Z * [new branch] no_distributed_log_spew -> origin/no_distributed_log_spew 2025-12-04T08:54:18.6501866Z * [new branch] nofun-hack -> origin/nofun-hack 2025-12-04T08:54:18.6501928Z * [new branch] norm_bench -> origin/norm_bench 2025-12-04T08:54:18.6502004Z * [new branch] nullplay/fuse_matmul -> origin/nullplay/fuse_matmul 2025-12-04T08:54:18.6502079Z * [new branch] nullplay_fuse_matmul -> origin/nullplay_fuse_matmul 2025-12-04T08:54:18.6502146Z * [new branch] optimizer_test -> origin/optimizer_test 2025-12-04T08:54:18.6502214Z * [new branch] orig/release/1.10 -> origin/orig/release/1.10 2025-12-04T08:54:18.6502285Z * [new branch] orig/release/1.11 -> origin/orig/release/1.11 2025-12-04T08:54:18.6502351Z * [new branch] orig/release/1.12 -> origin/orig/release/1.12 2025-12-04T08:54:18.6502418Z * [new branch] orig/release/1.13 -> origin/orig/release/1.13 2025-12-04T08:54:18.6502484Z * [new branch] orig/release/1.6 -> origin/orig/release/1.6 2025-12-04T08:54:18.6502550Z * [new branch] orig/release/1.7 -> origin/orig/release/1.7 2025-12-04T08:54:18.6502616Z * [new branch] orig/release/1.8 -> origin/orig/release/1.8 2025-12-04T08:54:18.6502682Z * [new branch] orig/release/1.9 -> origin/orig/release/1.9 2025-12-04T08:54:18.6502747Z * [new branch] orig/release/2.0 -> origin/orig/release/2.0 2025-12-04T08:54:18.6502812Z * [new branch] orig/release/2.1 -> origin/orig/release/2.1 2025-12-04T08:54:18.6502877Z * [new branch] orig/release/2.2 -> origin/orig/release/2.2 2025-12-04T08:54:18.6502942Z * [new branch] orig/release/2.3 -> origin/orig/release/2.3 2025-12-04T08:54:18.6503008Z * [new branch] orig/release/2.4 -> origin/orig/release/2.4 2025-12-04T08:54:18.6503072Z * [new branch] orig/release/2.5 -> origin/orig/release/2.5 2025-12-04T08:54:18.6503138Z * [new branch] orig/release/2.6 -> origin/orig/release/2.6 2025-12-04T08:54:18.6503205Z * [new branch] orig/release/2.7 -> origin/orig/release/2.7 2025-12-04T08:54:18.6503269Z * [new branch] orig/release/2.8 -> origin/orig/release/2.8 2025-12-04T08:54:18.6503333Z * [new branch] orig/release/2.9 -> origin/orig/release/2.9 2025-12-04T08:54:18.6503419Z * [new branch] origin/gh/fxdawnn/1/base -> origin/origin/gh/fxdawnn/1/base 2025-12-04T08:54:18.6503525Z * [new branch] origin/gh/fxdawnn/1/orig -> origin/origin/gh/fxdawnn/1/orig 2025-12-04T08:54:18.6503607Z * [new branch] origin/gh/zpcore/14/orig -> origin/origin/gh/zpcore/14/orig 2025-12-04T08:54:18.6503677Z * [new branch] oulgen-patch-1 -> origin/oulgen-patch-1 2025-12-04T08:54:18.6503747Z * [new branch] oulgen-patch-2 -> origin/oulgen-patch-2 2025-12-04T08:54:18.6503816Z * [new branch] oulgen-patch-3 -> origin/oulgen-patch-3 2025-12-04T08:54:18.6503882Z * [new branch] oulgen-patch-4 -> origin/oulgen-patch-4 2025-12-04T08:54:18.6503950Z * [new branch] padded-tensor -> origin/padded-tensor 2025-12-04T08:54:18.6504013Z * [new branch] pca2 -> origin/pca2 2025-12-04T08:54:18.6504085Z * [new branch] per_channel_backup -> origin/per_channel_backup 2025-12-04T08:54:18.6504150Z * [new branch] perf_ops -> origin/perf_ops 2025-12-04T08:54:18.6504215Z * [new branch] perf_ops_2_9 -> origin/perf_ops_2_9 2025-12-04T08:54:18.6504287Z * [new branch] pianpwk-patch-1 -> origin/pianpwk-patch-1 2025-12-04T08:54:18.6504374Z * [new branch] pianpwk/__draft_debug_mode -> origin/pianpwk/__draft_debug_mode 2025-12-04T08:54:18.6504503Z * [new branch] pianpwk/_debug_mode_for_triton_draft -> origin/pianpwk/_debug_mode_for_triton_draft 2025-12-04T08:54:18.6504605Z * [new branch] pianpwk/_debug_nn_module_compile -> origin/pianpwk/_debug_nn_module_compile 2025-12-04T08:54:18.6504692Z * [new branch] pianpwk/_draft_triton_11_3 -> origin/pianpwk/_draft_triton_11_3 2025-12-04T08:54:18.6504783Z * [new branch] pianpwk/_manual_bucket_draft -> origin/pianpwk/_manual_bucket_draft 2025-12-04T08:54:18.6504884Z * [new branch] pianpwk/_profile_w_dispatch_keys -> origin/pianpwk/_profile_w_dispatch_keys 2025-12-04T08:54:18.6504983Z * [new branch] pianpwk/_super_draft_debug_mode -> origin/pianpwk/_super_draft_debug_mode 2025-12-04T08:54:18.6505086Z * [new branch] pianpwk/_unbacked_local_shard_size -> origin/pianpwk/_unbacked_local_shard_size 2025-12-04T08:54:18.6505163Z * [new branch] pianpwk/anomaly_tb -> origin/pianpwk/anomaly_tb 2025-12-04T08:54:18.6505244Z * [new branch] pianpwk/auto_fx_annotate -> origin/pianpwk/auto_fx_annotate 2025-12-04T08:54:18.6505355Z * [new branch] pianpwk/backed_size_oblivious_export -> origin/pianpwk/backed_size_oblivious_export 2025-12-04T08:54:18.6505443Z * [new branch] pianpwk/bert_dynamic_perf -> origin/pianpwk/bert_dynamic_perf 2025-12-04T08:54:18.6505539Z * [new branch] pianpwk/debug_fwd_stack_traces -> origin/pianpwk/debug_fwd_stack_traces 2025-12-04T08:54:18.6505624Z * [new branch] pianpwk/debug_hash_tensor -> origin/pianpwk/debug_hash_tensor 2025-12-04T08:54:18.6505716Z * [new branch] pianpwk/debug_mode_annotate -> origin/pianpwk/debug_mode_annotate 2025-12-04T08:54:18.6505804Z * [new branch] pianpwk/debug_mode_defaults -> origin/pianpwk/debug_mode_defaults 2025-12-04T08:54:18.6505884Z * [new branch] pianpwk/debug_mode_hacks -> origin/pianpwk/debug_mode_hacks 2025-12-04T08:54:18.6505993Z * [new branch] pianpwk/debug_mode_opcall_refactor -> origin/pianpwk/debug_mode_opcall_refactor 2025-12-04T08:54:18.6506081Z * [new branch] pianpwk/debug_mode_show_ids -> origin/pianpwk/debug_mode_show_ids 2025-12-04T08:54:18.6506164Z * [new branch] pianpwk/debug_mode_triton -> origin/pianpwk/debug_mode_triton 2025-12-04T08:54:18.6506260Z * [new branch] pianpwk/debug_show_stack_trace -> origin/pianpwk/debug_show_stack_trace 2025-12-04T08:54:18.6506360Z * [new branch] pianpwk/debug_wait_on_collective -> origin/pianpwk/debug_wait_on_collective 2025-12-04T08:54:18.6506478Z * [new branch] pianpwk/debugmode_compile_tf -> origin/pianpwk/debugmode_compile_tf 2025-12-04T08:54:18.6506602Z * [new branch] pianpwk/dispatch_key_debugging_for_debug -> origin/pianpwk/dispatch_key_debugging_for_debug 2025-12-04T08:54:18.6506707Z * [new branch] pianpwk/draft_debug_mode_tfcompile -> origin/pianpwk/draft_debug_mode_tfcompile 2025-12-04T08:54:18.6506802Z * [new branch] pianpwk/draft_multikernel_nn -> origin/pianpwk/draft_multikernel_nn 2025-12-04T08:54:18.6506916Z * [new branch] pianpwk/draft_multikernel_status_10_5 -> origin/pianpwk/draft_multikernel_status_10_5 2025-12-04T08:54:18.6507008Z * [new branch] pianpwk/dtensor_custom_chunk -> origin/pianpwk/dtensor_custom_chunk 2025-12-04T08:54:18.6507112Z * [new branch] pianpwk/dtensor_unbacked_keypath -> origin/pianpwk/dtensor_unbacked_keypath 2025-12-04T08:54:18.6507192Z * [new branch] pianpwk/event_list_tree -> origin/pianpwk/event_list_tree 2025-12-04T08:54:18.6507272Z * [new branch] pianpwk/false_numel_refs -> origin/pianpwk/false_numel_refs 2025-12-04T08:54:18.6507351Z * [new branch] pianpwk/maybe_guard_rel -> origin/pianpwk/maybe_guard_rel 2025-12-04T08:54:18.6507473Z * [new branch] pianpwk/multikernel_hints_draft -> origin/pianpwk/multikernel_hints_draft 2025-12-04T08:54:18.6507582Z * [new branch] pianpwk/no_size_oblivious_slice_scat -> origin/pianpwk/no_size_oblivious_slice_scat 2025-12-04T08:54:18.6507698Z * [new branch] pianpwk/oblivious_reshape_view_better -> origin/pianpwk/oblivious_reshape_view_better 2025-12-04T08:54:18.6507780Z * [new branch] pianpwk/pre_forward_hook -> origin/pianpwk/pre_forward_hook 2025-12-04T08:54:18.6507887Z * [new branch] pianpwk/skip_python_keys_alternate -> origin/pianpwk/skip_python_keys_alternate 2025-12-04T08:54:18.6507993Z * [new branch] pianpwk/skip_python_keys_in_guards -> origin/pianpwk/skip_python_keys_in_guards 2025-12-04T08:54:18.6508125Z * [new branch] pianpwk/sym_tokens_draft -> origin/pianpwk/sym_tokens_draft 2025-12-04T08:54:18.6508204Z * [new branch] pianpwk/symint_one_hot -> origin/pianpwk/symint_one_hot 2025-12-04T08:54:18.6508318Z * [new branch] pianpwk/test_pointwise_guard_or_false -> origin/pianpwk/test_pointwise_guard_or_false 2025-12-04T08:54:18.6508416Z * [new branch] pianpwk/totally_draft_sym_wrap -> origin/pianpwk/totally_draft_sym_wrap 2025-12-04T08:54:18.6508503Z * [new branch] pianpwk/try_dumb_stuff -> origin/pianpwk/try_dumb_stuff 2025-12-04T08:54:18.6508582Z * [new branch] pianpwk/try_dumb_stuff_2 -> origin/pianpwk/try_dumb_stuff_2 2025-12-04T08:54:18.6508673Z * [new branch] pianpwk/unbacked_dtensor_mm -> origin/pianpwk/unbacked_dtensor_mm 2025-12-04T08:54:18.6508771Z * [new branch] pianpwk/unbacked_tracing_12_2 -> origin/pianpwk/unbacked_tracing_12_2 2025-12-04T08:54:18.6508853Z * [new branch] pianpwk/user_symints -> origin/pianpwk/user_symints 2025-12-04T08:54:18.6508931Z * [new branch] pianpwk/wan21_reshape -> origin/pianpwk/wan21_reshape 2025-12-04T08:54:18.6509026Z * [new branch] piz/fix_partial_backward_1112 -> origin/piz/fix_partial_backward_1112 2025-12-04T08:54:18.6509099Z * [new branch] piz/prop_cache_clean -> origin/piz/prop_cache_clean 2025-12-04T08:54:18.6509167Z * [new branch] pool-separate -> origin/pool-separate 2025-12-04T08:54:18.6509232Z * [new branch] pr-156087 -> origin/pr-156087 2025-12-04T08:54:18.6509291Z * [new branch] pr/131860 -> origin/pr/131860 2025-12-04T08:54:18.6509361Z * [new branch] predispatch_to -> origin/predispatch_to 2025-12-04T08:54:18.6509459Z * [new branch] protect-c17 -> origin/protect-c17 2025-12-04T08:54:18.6509525Z * [new branch] pt-opt-cuda3 -> origin/pt-opt-cuda3 2025-12-04T08:54:18.6509606Z * [new branch] python_compiled_autograd -> origin/python_compiled_autograd 2025-12-04T08:54:18.6509736Z * [new branch] q1l1/fix_device_moved_constant_type_unknown -> origin/q1l1/fix_device_moved_constant_type_unknown 2025-12-04T08:54:18.6509873Z * [new branch] q1l1/fix_wrong_default_type_for_kernel_call_args -> origin/q1l1/fix_wrong_default_type_for_kernel_call_args 2025-12-04T08:54:18.6509954Z * [new branch] qchip/export-D54134695 -> origin/qchip/export-D54134695 2025-12-04T08:54:18.6510029Z * [new branch] quote-pytest_cache -> origin/quote-pytest_cache 2025-12-04T08:54:18.6510125Z * [new branch] reland-accgrad-stream-warn -> origin/reland-accgrad-stream-warn 2025-12-04T08:54:18.6510192Z * [new branch] release/1.10 -> origin/release/1.10 2025-12-04T08:54:18.6510256Z * [new branch] release/1.11 -> origin/release/1.11 2025-12-04T08:54:18.6510318Z * [new branch] release/1.12 -> origin/release/1.12 2025-12-04T08:54:18.6510382Z * [new branch] release/1.13 -> origin/release/1.13 2025-12-04T08:54:18.6510477Z * [new branch] release/1.4 -> origin/release/1.4 2025-12-04T08:54:18.6510541Z * [new branch] release/1.4.1 -> origin/release/1.4.1 2025-12-04T08:54:18.6510603Z * [new branch] release/1.5 -> origin/release/1.5 2025-12-04T08:54:18.6510664Z * [new branch] release/1.6 -> origin/release/1.6 2025-12-04T08:54:18.6510724Z * [new branch] release/1.7 -> origin/release/1.7 2025-12-04T08:54:18.6510792Z * [new branch] release/1.8 -> origin/release/1.8 2025-12-04T08:54:18.6510860Z * [new branch] release/1.9 -> origin/release/1.9 2025-12-04T08:54:18.6510925Z * [new branch] release/2.0 -> origin/release/2.0 2025-12-04T08:54:18.6510991Z * [new branch] release/2.1 -> origin/release/2.1 2025-12-04T08:54:18.6511050Z * [new branch] release/2.2 -> origin/release/2.2 2025-12-04T08:54:18.6511111Z * [new branch] release/2.3 -> origin/release/2.3 2025-12-04T08:54:18.6511172Z * [new branch] release/2.4 -> origin/release/2.4 2025-12-04T08:54:18.6511232Z * [new branch] release/2.5 -> origin/release/2.5 2025-12-04T08:54:18.6511291Z * [new branch] release/2.6 -> origin/release/2.6 2025-12-04T08:54:18.6511351Z * [new branch] release/2.7 -> origin/release/2.7 2025-12-04T08:54:18.6511410Z * [new branch] release/2.8 -> origin/release/2.8 2025-12-04T08:54:18.6511474Z * [new branch] release/2.9 -> origin/release/2.9 2025-12-04T08:54:18.6511537Z * [new branch] release_notes -> origin/release_notes 2025-12-04T08:54:18.6511612Z * [new branch] remove_pyinterpreter -> origin/remove_pyinterpreter 2025-12-04T08:54:18.6511737Z * [new branch] replace-pytorch-labs-20250812-195836 -> origin/replace-pytorch-labs-20250812-195836 2025-12-04T08:54:18.6511855Z * [new branch] replace-pytorch-labs-20250812-200248 -> origin/replace-pytorch-labs-20250812-200248 2025-12-04T08:54:18.6511971Z * [new branch] replace-pytorch-labs-20250812-200324 -> origin/replace-pytorch-labs-20250812-200324 2025-12-04T08:54:18.6512087Z * [new branch] replace-pytorch-labs-20250812-204020 -> origin/replace-pytorch-labs-20250812-204020 2025-12-04T08:54:18.6512216Z * [new branch] revert-131069-gh/krzysztofjordan/1/head -> origin/revert-131069-gh/krzysztofjordan/1/head 2025-12-04T08:54:18.6512348Z * [new branch] revert-131469-gh/andrewor14/51/head -> origin/revert-131469-gh/andrewor14/51/head 2025-12-04T08:54:18.6512451Z * [new branch] revert-152361-gh/fadara01/1/head -> origin/revert-152361-gh/fadara01/1/head 2025-12-04T08:54:18.6512553Z * [new branch] revert-156870-gh/skarjala/3/head -> origin/revert-156870-gh/skarjala/3/head 2025-12-04T08:54:18.6512725Z * [new branch] revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ -> origin/revert-157914-cherry-pick-157503-by-pytorch_bot_bot_ 2025-12-04T08:54:18.6512823Z * [new branch] revert-hoo-invoke-subgraph -> origin/revert-hoo-invoke-subgraph 2025-12-04T08:54:18.6512923Z * [new branch] revert_always_build_distributed -> origin/revert_always_build_distributed 2025-12-04T08:54:18.6512991Z * [new branch] rms_norm_patch -> origin/rms_norm_patch 2025-12-04T08:54:18.6513089Z * [new branch] ruisi/fix_all_to_all_estimation -> origin/ruisi/fix_all_to_all_estimation 2025-12-04T08:54:18.6513171Z * [new branch] ruisi/fix_comm_estimation -> origin/ruisi/fix_comm_estimation 2025-12-04T08:54:18.6513277Z * [new branch] ruisi/fix_dynamic_shape_estimation -> origin/ruisi/fix_dynamic_shape_estimation 2025-12-04T08:54:18.6513392Z * [new branch] ruisi/fix_llama3_autobucketing -> origin/ruisi/fix_llama3_autobucketing 2025-12-04T08:54:18.6513495Z * [new branch] ruisi/fix_manual_bucketing_ep_pass -> origin/ruisi/fix_manual_bucketing_ep_pass 2025-12-04T08:54:18.6513577Z * [new branch] ruisi/manual_bucket_pass -> origin/ruisi/manual_bucket_pass 2025-12-04T08:54:18.6513723Z * [new branch] ryanguo99/cleanup-dynamo-expected-failures -> origin/ryanguo99/cleanup-dynamo-expected-failures 2025-12-04T08:54:18.6513810Z * [new branch] ryanguo99/fix-closure-var -> origin/ryanguo99/fix-closure-var 2025-12-04T08:54:18.6513889Z * [new branch] rzou/faketensor_bench -> origin/rzou/faketensor_bench 2025-12-04T08:54:18.6513950Z * [new branch] rzou/njt -> origin/rzou/njt 2025-12-04T08:54:18.6514012Z * [new branch] rzou/pca -> origin/rzou/pca 2025-12-04T08:54:18.6514080Z * [new branch] rzou/realprop -> origin/rzou/realprop 2025-12-04T08:54:18.6514143Z * [new branch] samplevllm -> origin/samplevllm 2025-12-04T08:54:18.6514309Z * [new branch] sanchitintel/weird_thing_with_test_cpu_select_algorithm -> origin/sanchitintel/weird_thing_with_test_cpu_select_algorithm 2025-12-04T08:54:18.6514402Z * [new branch] sapling-pr-archive-SS-JIA -> origin/sapling-pr-archive-SS-JIA 2025-12-04T08:54:18.6514514Z * [new branch] sapling-pr-archive-tushar00jain -> origin/sapling-pr-archive-tushar00jain 2025-12-04T08:54:18.6514577Z * [new branch] save -> origin/save 2025-12-04T08:54:18.6514638Z * [new branch] scaled_mm -> origin/scaled_mm 2025-12-04T08:54:18.6514701Z * [new branch] scan_attempt -> origin/scan_attempt 2025-12-04T08:54:18.6514763Z * [new branch] sdym/2.5.1 -> origin/sdym/2.5.1 2025-12-04T08:54:18.6514870Z * [new branch] sekyondaMeta-dynamoconfig-fix -> origin/sekyondaMeta-dynamoconfig-fix 2025-12-04T08:54:18.6514945Z * [new branch] shengf/fx-xform-perf -> origin/shengf/fx-xform-perf 2025-12-04T08:54:18.6515022Z * [new branch] shoumikhin-patch-1 -> origin/shoumikhin-patch-1 2025-12-04T08:54:18.6515096Z * [new branch] solve-accuracy-fix -> origin/solve-accuracy-fix 2025-12-04T08:54:18.6515175Z * [new branch] some_rocm_inductor_skips -> origin/some_rocm_inductor_skips 2025-12-04T08:54:18.6515275Z * [new branch] soulitzer/stash-tls-ac -> origin/soulitzer/stash-tls-ac 2025-12-04T08:54:18.6515356Z * [new branch] sparse-mm-bf16-support -> origin/sparse-mm-bf16-support 2025-12-04T08:54:18.6515428Z * [new branch] starterTaskUpdate -> origin/starterTaskUpdate 2025-12-04T08:54:18.6515489Z * [new branch] suo -> origin/suo 2025-12-04T08:54:18.6515553Z * [new branch] sve-poc -> origin/sve-poc 2025-12-04T08:54:18.6515615Z * [new branch] switch-bn -> origin/switch-bn 2025-12-04T08:54:18.6515709Z * [new branch] sy_annotation_in_autograd_hop -> origin/sy_annotation_in_autograd_hop 2025-12-04T08:54:18.6515777Z * [new branch] sy_aot_eager_record -> origin/sy_aot_eager_record 2025-12-04T08:54:18.6515847Z * [new branch] sy_custom_bucketing -> origin/sy_custom_bucketing 2025-12-04T08:54:18.6515917Z * [new branch] sy_debug_mode_test -> origin/sy_debug_mode_test 2025-12-04T08:54:18.6515981Z * [new branch] sy_deserialize -> origin/sy_deserialize 2025-12-04T08:54:18.6516047Z * [new branch] sy_dump_gm_code -> origin/sy_dump_gm_code 2025-12-04T08:54:18.6516109Z * [new branch] sy_exp -> origin/sy_exp 2025-12-04T08:54:18.6516200Z * [new branch] sy_export_annotation -> origin/sy_export_annotation 2025-12-04T08:54:18.6516270Z * [new branch] sy_invoke_subgraph -> origin/sy_invoke_subgraph 2025-12-04T08:54:18.6516336Z * [new branch] sy_kernel_bw_name -> origin/sy_kernel_bw_name 2025-12-04T08:54:18.6516398Z * [new branch] sy_multi_arch -> origin/sy_multi_arch 2025-12-04T08:54:18.6516466Z * [new branch] sy_nn_module_stack -> origin/sy_nn_module_stack 2025-12-04T08:54:18.6516535Z * [new branch] sy_original_dtensor -> origin/sy_original_dtensor 2025-12-04T08:54:18.6516603Z * [new branch] sy_profiler_cia -> origin/sy_profiler_cia 2025-12-04T08:54:18.6516667Z * [new branch] symm_mem_sync -> origin/symm_mem_sync 2025-12-04T08:54:18.6516750Z * [new branch] sympy-bottleneck-repro -> origin/sympy-bottleneck-repro 2025-12-04T08:54:18.6516827Z * [new branch] tensordict_integration -> origin/tensordict_integration 2025-12-04T08:54:18.6516907Z * [new branch] test-move-conda-builds -> origin/test-move-conda-builds 2025-12-04T08:54:18.6516969Z * [new branch] test-old -> origin/test-old 2025-12-04T08:54:18.6517032Z * [new branch] test/bmm_heur -> origin/test/bmm_heur 2025-12-04T08:54:18.6517133Z * [new branch] tianren/customOp_autotune_fix -> origin/tianren/customOp_autotune_fix 2025-12-04T08:54:18.6517243Z * [new branch] tianren/customOp_enable_max_autotune -> origin/tianren/customOp_enable_max_autotune 2025-12-04T08:54:18.6517324Z * [new branch] tianren/customOp_fusion -> origin/tianren/customOp_fusion 2025-12-04T08:54:18.6517449Z * [new branch] tianren/customop_collectiveop_benchmark -> origin/tianren/customop_collectiveop_benchmark 2025-12-04T08:54:18.6517583Z * [new branch] tianren/customop_collectiveop_benchmark_fix -> origin/tianren/customop_collectiveop_benchmark_fix 2025-12-04T08:54:18.6517682Z * [new branch] tianren/customop_dynamic_config -> origin/tianren/customop_dynamic_config 2025-12-04T08:54:18.6517775Z * [new branch] tianren/dynamic_range_input -> origin/tianren/dynamic_range_input 2025-12-04T08:54:18.6517872Z * [new branch] tianren/dynamic_range_input_fix -> origin/tianren/dynamic_range_input_fix 2025-12-04T08:54:18.6517976Z * [new branch] tianren/dynamic_range_input_merge -> origin/tianren/dynamic_range_input_merge 2025-12-04T08:54:18.6518130Z * [new branch] tianren/flex_paged_attn_fix_temp -> origin/tianren/flex_paged_attn_fix_temp 2025-12-04T08:54:18.6518210Z * [new branch] tianren/fx_codegen_dump -> origin/tianren/fx_codegen_dump 2025-12-04T08:54:18.6518292Z * [new branch] tianren/symmetric_memory -> origin/tianren/symmetric_memory 2025-12-04T08:54:18.6518358Z * [new branch] tianren/test -> origin/tianren/test 2025-12-04T08:54:18.6518431Z * [new branch] tidy_performance_cyy -> origin/tidy_performance_cyy 2025-12-04T08:54:18.6518492Z * [new branch] tmp -> origin/tmp 2025-12-04T08:54:18.6518558Z * [new branch] torchtitan_ep -> origin/torchtitan_ep 2025-12-04T08:54:18.6518634Z * [new branch] torchtitan_integration -> origin/torchtitan_integration 2025-12-04T08:54:18.6518718Z * [new branch] trace_fsdp_torchtune_lora -> origin/trace_fsdp_torchtune_lora 2025-12-04T08:54:18.6518802Z * [new branch] traceable_fsdp_unit_tests -> origin/traceable_fsdp_unit_tests 2025-12-04T08:54:18.6518870Z * [new branch] tree_loop_vec_base -> origin/tree_loop_vec_base 2025-12-04T08:54:18.6518937Z * [new branch] triton_kernel -> origin/triton_kernel 2025-12-04T08:54:18.6519024Z * [new branch] tt_pkg_1908 -> origin/tt_pkg_1908 2025-12-04T08:54:18.6519085Z * [new branch] type_dec -> origin/type_dec 2025-12-04T08:54:18.6519179Z * [new branch] udate-sphinx-dependancies -> origin/udate-sphinx-dependancies 2025-12-04T08:54:18.6519317Z * [new branch] update-audio-commit-hash/17630256502-1803-1 -> origin/update-audio-commit-hash/17630256502-1803-1 2025-12-04T08:54:18.6519450Z * [new branch] update-audio-commit-hash/19087141161-1916-1 -> origin/update-audio-commit-hash/19087141161-1916-1 2025-12-04T08:54:18.6519581Z * [new branch] update-audio-commit-hash/19250643381-1929-1 -> origin/update-audio-commit-hash/19250643381-1929-1 2025-12-04T08:54:18.6519709Z * [new branch] update-audio-commit-hash/19397724337-1935-1 -> origin/update-audio-commit-hash/19397724337-1935-1 2025-12-04T08:54:18.6519840Z * [new branch] update-audio-commit-hash/19555670148-1941-1 -> origin/update-audio-commit-hash/19555670148-1941-1 2025-12-04T08:54:18.6519968Z * [new branch] update-audio-commit-hash/19750627930-1946-1 -> origin/update-audio-commit-hash/19750627930-1946-1 2025-12-04T08:54:18.6520101Z * [new branch] update-triton-commit-hash/13663274526-1487-2 -> origin/update-triton-commit-hash/13663274526-1487-2 2025-12-04T08:54:18.6520234Z * [new branch] update-vision-commit-hash/19087141161-1916-1 -> origin/update-vision-commit-hash/19087141161-1916-1 2025-12-04T08:54:18.6520365Z * [new branch] update-vision-commit-hash/19184897099-1925-1 -> origin/update-vision-commit-hash/19184897099-1925-1 2025-12-04T08:54:18.6520498Z * [new branch] update-vision-commit-hash/19250643381-1929-1 -> origin/update-vision-commit-hash/19250643381-1929-1 2025-12-04T08:54:18.6520630Z * [new branch] update-vision-commit-hash/19381328640-1934-1 -> origin/update-vision-commit-hash/19381328640-1934-1 2025-12-04T08:54:18.6520761Z * [new branch] update-vision-commit-hash/19485237164-1938-1 -> origin/update-vision-commit-hash/19485237164-1938-1 2025-12-04T08:54:18.6520890Z * [new branch] update-vllm-commit-hash/18451675449-1879-1 -> origin/update-vllm-commit-hash/18451675449-1879-1 2025-12-04T08:54:18.6520973Z * [new branch] update-vllm-dockerfile -> origin/update-vllm-dockerfile 2025-12-04T08:54:18.6521096Z * [new branch] update-xla-commit-hash/19224287370-211-1 -> origin/update-xla-commit-hash/19224287370-211-1 2025-12-04T08:54:18.6521247Z * [new branch] update-xla-commit-hash/19422028566-212-1 -> origin/update-xla-commit-hash/19422028566-212-1 2025-12-04T08:54:18.6521367Z * [new branch] update-xla-commit-hash/19626841311-213-1 -> origin/update-xla-commit-hash/19626841311-213-1 2025-12-04T08:54:18.6521492Z * [new branch] update_docs_torch_multinomial_issue#125388 -> origin/update_docs_torch_multinomial_issue#125388 2025-12-04T08:54:18.6521573Z * [new branch] update_operator_readme -> origin/update_operator_readme 2025-12-04T08:54:18.6521660Z * [new branch] update_slow_tests_1722488736 -> origin/update_slow_tests_1722488736 2025-12-04T08:54:18.6521747Z * [new branch] update_slow_tests_1722879173 -> origin/update_slow_tests_1722879173 2025-12-04T08:54:18.6521832Z * [new branch] update_slow_tests_1762155677 -> origin/update_slow_tests_1762155677 2025-12-04T08:54:18.6521915Z * [new branch] update_slow_tests_1763365283 -> origin/update_slow_tests_1763365283 2025-12-04T08:54:18.6522006Z * [new branch] update_submodule_FBGEMM -> origin/update_submodule_FBGEMM 2025-12-04T08:54:18.6522084Z * [new branch] update_submodule_kineto -> origin/update_submodule_kineto 2025-12-04T08:54:18.6522174Z * [new branch] update_submodule_tensorpipe -> origin/update_submodule_tensorpipe 2025-12-04T08:54:18.6522298Z * [new branch] upload-tests-for-autorevert -> origin/upload-tests-for-autorevert 2025-12-04T08:54:18.6522361Z * [new branch] v0.1.2 -> origin/v0.1.2 2025-12-04T08:54:18.6522421Z * [new branch] v1.0.1 -> origin/v1.0.1 2025-12-04T08:54:18.6522480Z * [new branch] v1.0.3 -> origin/v1.0.3 2025-12-04T08:54:18.6522537Z * [new branch] v1.1.0 -> origin/v1.1.0 2025-12-04T08:54:18.6522594Z * [new branch] v1.2.0 -> origin/v1.2.0 2025-12-04T08:54:18.6522653Z * [new branch] v1.3.0 -> origin/v1.3.0 2025-12-04T08:54:18.6522709Z * [new branch] v1.3.1 -> origin/v1.3.1 2025-12-04T08:54:18.6522773Z * [new branch] validate_fn -> origin/validate_fn 2025-12-04T08:54:18.6522842Z * [new branch] validations_2.6 -> origin/validations_2.6 2025-12-04T08:54:18.6522910Z * [new branch] validations_2.8 -> origin/validations_2.8 2025-12-04T08:54:18.6522974Z * [new branch] varlen-api -> origin/varlen-api 2025-12-04T08:54:18.6523051Z * [new branch] varlen-api-backup -> origin/varlen-api-backup 2025-12-04T08:54:18.6523128Z * [new branch] varlen_batch_invariance -> origin/varlen_batch_invariance 2025-12-04T08:54:18.6523192Z * [new branch] viable/strict -> origin/viable/strict 2025-12-04T08:54:18.6523309Z * [new branch] vishal9-team/dtensor_parallelism_toy -> origin/vishal9-team/dtensor_parallelism_toy 2025-12-04T08:54:18.6523374Z * [new branch] vllmbuildci -> origin/vllmbuildci 2025-12-04T08:54:18.6523434Z * [new branch] vllmpin -> origin/vllmpin 2025-12-04T08:54:18.6523525Z * [new branch] vscode-recommend-pyrefly -> origin/vscode-recommend-pyrefly 2025-12-04T08:54:18.6523595Z * [new branch] wdvr-patch-1 -> origin/wdvr-patch-1 2025-12-04T08:54:18.6523660Z * [new branch] wdvr/iss_145259 -> origin/wdvr/iss_145259 2025-12-04T08:54:18.6523723Z * [new branch] whc/pei -> origin/whc/pei 2025-12-04T08:54:18.6523786Z * [new branch] whc/pp_fix -> origin/whc/pp_fix 2025-12-04T08:54:18.6523850Z * [new branch] whc/sharding -> origin/whc/sharding 2025-12-04T08:54:18.6523914Z * [new branch] whc/sharding2 -> origin/whc/sharding2 2025-12-04T08:54:18.6523997Z * [new branch] whc/uneven -> origin/whc/uneven 2025-12-04T08:54:18.6524067Z * [new branch] whc/uneven-merge -> origin/whc/uneven-merge 2025-12-04T08:54:18.6524128Z * [new branch] win_warnings -> origin/win_warnings 2025-12-04T08:54:18.6524204Z * [new branch] windows_libtorch_free -> origin/windows_libtorch_free 2025-12-04T08:54:18.6524268Z * [new branch] xmfan-war -> origin/xmfan-war 2025-12-04T08:54:18.6524331Z * [new branch] xmfan/ca_0516 -> origin/xmfan/ca_0516 2025-12-04T08:54:18.6524399Z * [new branch] xmfan/ca_1051b93192 -> origin/xmfan/ca_1051b93192 2025-12-04T08:54:18.6524550Z * [new branch] xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 -> origin/xmfan/ca_1a722f62c248391fc4a542e8851a5559aa356ae8 2025-12-04T08:54:18.6524621Z * [new branch] xmfan/ca_5a2be192d1 -> origin/xmfan/ca_5a2be192d1 2025-12-04T08:54:18.6524690Z * [new branch] xmfan/ca_9d59b516e9 -> origin/xmfan/ca_9d59b516e9 2025-12-04T08:54:18.6524757Z * [new branch] xmfan/ca_apr8 -> origin/xmfan/ca_apr8 2025-12-04T08:54:18.6524820Z * [new branch] xmfan/ca_base -> origin/xmfan/ca_base 2025-12-04T08:54:18.6524906Z * [new branch] xmfan/ca_dynamic -> origin/xmfan/ca_dynamic 2025-12-04T08:54:18.6524974Z * [new branch] xmfan/ca_fix_dyn -> origin/xmfan/ca_fix_dyn 2025-12-04T08:54:18.6525046Z * [new branch] xmfan/ca_fix_lowering -> origin/xmfan/ca_fix_lowering 2025-12-04T08:54:18.6525121Z * [new branch] xmfan/ca_fix_polyfills -> origin/xmfan/ca_fix_polyfills 2025-12-04T08:54:18.6525185Z * [new branch] xmfan/ca_jan3 -> origin/xmfan/ca_jan3 2025-12-04T08:54:18.6525249Z * [new branch] xmfan/ca_jun18 -> origin/xmfan/ca_jun18 2025-12-04T08:54:18.6525317Z * [new branch] xmfan/ca_jun24 -> origin/xmfan/ca_jun24 2025-12-04T08:54:18.6525384Z * [new branch] xmfan/ca_nested -> origin/xmfan/ca_nested 2025-12-04T08:54:18.6525452Z * [new branch] xmfan/ca_overhead -> origin/xmfan/ca_overhead 2025-12-04T08:54:18.6525546Z * [new branch] xmfan/ca_overhead_0eba7e5451 -> origin/xmfan/ca_overhead_0eba7e5451 2025-12-04T08:54:18.6525613Z * [new branch] xmfan/cacu_jun18 -> origin/xmfan/cacu_jun18 2025-12-04T08:54:18.6525680Z * [new branch] xmfan/cacu_jun19 -> origin/xmfan/cacu_jun19 2025-12-04T08:54:18.6525747Z * [new branch] xmfan/cacu_jun4 -> origin/xmfan/cacu_jun4 2025-12-04T08:54:18.6525829Z * [new branch] xmfan/disable_duck_shape -> origin/xmfan/disable_duck_shape 2025-12-04T08:54:18.6525925Z * [new branch] xmfan/fca_cpp_node_passthrough -> origin/xmfan/fca_cpp_node_passthrough 2025-12-04T08:54:18.6526077Z * [new branch] xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/post_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T08:54:18.6526223Z * [new branch] xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 -> origin/xmfan/pre_3945954741e2d37023c5d6954f9483008e0892f9 2025-12-04T08:54:18.6526293Z * [new branch] xmfan/single_step -> origin/xmfan/single_step 2025-12-04T08:54:18.6526359Z * [new branch] xmfan/sth_0829 -> origin/xmfan/sth_0829 2025-12-04T08:54:18.6526421Z * [new branch] xmfan/test -> origin/xmfan/test 2025-12-04T08:54:18.6526507Z * [new branch] yguo/debug-0226-constexpr -> origin/yguo/debug-0226-constexpr 2025-12-04T08:54:18.6526585Z * [new branch] yguo/new_latest_changes -> origin/yguo/new_latest_changes 2025-12-04T08:54:18.6526679Z * [new branch] yguo/patch_constexpr_changes -> origin/yguo/patch_constexpr_changes 2025-12-04T08:54:18.6526769Z * [new branch] yiming/bootcamp -> origin/yiming/bootcamp 2025-12-04T08:54:18.6526871Z * [new branch] yiming/run_with_start_end_rng_hop -> origin/yiming/run_with_start_end_rng_hop 2025-12-04T08:54:18.6526935Z * [new branch] yolo-llama3 -> origin/yolo-llama3 2025-12-04T08:54:18.6527008Z * [new branch] zainr/canary-test -> origin/zainr/canary-test 2025-12-04T08:54:18.6527094Z * [new branch] zainr/cleanup-gh-runners -> origin/zainr/cleanup-gh-runners 2025-12-04T08:54:18.6527174Z * [new branch] zainr/pull-migration-c -> origin/zainr/pull-migration-c 2025-12-04T08:54:18.6527237Z * [new branch] zainr/test2 -> origin/zainr/test2 2025-12-04T08:54:18.6527309Z * [new branch] zasdfgbnm-patch-3 -> origin/zasdfgbnm-patch-3 2025-12-04T08:54:18.6527370Z * [new branch] zb2p -> origin/zb2p 2025-12-04T08:54:18.6527454Z * [new branch] zeros-and-scatter-part2 -> origin/zeros-and-scatter-part2 2025-12-04T08:54:18.6527541Z * [new branch] zhxchen17/ci/vllm_lora_oom -> origin/zhxchen17/ci/vllm_lora_oom 2025-12-04T08:54:18.6527642Z * [new branch] zhxchen17/ci/vllm_multimodal_oom -> origin/zhxchen17/ci/vllm_multimodal_oom 2025-12-04T08:54:18.6527736Z * [new branch] zhxchen17/ci/vllm_pin -> origin/zhxchen17/ci/vllm_pin 2025-12-04T08:54:18.6527858Z * [new branch] zhxchen17/dynamo/unsafe_drop_all_guards -> origin/zhxchen17/dynamo/unsafe_drop_all_guards 2025-12-04T08:54:18.6527955Z * [new branch] zhxchen17/export/call_override -> origin/zhxchen17/export/call_override 2025-12-04T08:54:18.6528041Z * [new branch] zhxchen17/export/codemod1 -> origin/zhxchen17/export/codemod1 2025-12-04T08:54:18.6528174Z * [new branch] zhxchen17/export/ctx_return -> origin/zhxchen17/export/ctx_return 2025-12-04T08:54:18.6528303Z * [new branch] zhxchen17/export/disable_side_effect_warn -> origin/zhxchen17/export/disable_side_effect_warn 2025-12-04T08:54:18.6528402Z * [new branch] zhxchen17/export/pytree_check -> origin/zhxchen17/export/pytree_check 2025-12-04T08:54:18.6528490Z * [new branch] zhxchen17/precompile/aoti -> origin/zhxchen17/precompile/aoti 2025-12-04T08:54:18.6528586Z * [new branch] zhxchen17/precompile/globals -> origin/zhxchen17/precompile/globals 2025-12-04T08:54:18.6528702Z * [new branch] zhxchen17/precompile/inductor_guards -> origin/zhxchen17/precompile/inductor_guards 2025-12-04T08:54:18.6528774Z * [new branch] zhxchen17/scratch/0 -> origin/zhxchen17/scratch/0 2025-12-04T08:54:18.6528879Z * [new branch] zhxchen17/torch_export_api_update -> origin/zhxchen17/torch_export_api_update 2025-12-04T08:54:18.6528957Z * [new branch] zhxhcen17/moodycamel -> origin/zhxhcen17/moodycamel 2025-12-04T08:54:18.6529031Z * [new branch] zxiiro/build-times -> origin/zxiiro/build-times 2025-12-04T08:54:18.6529104Z * [new branch] zxiiro/c7i.2xlarge -> origin/zxiiro/c7i.2xlarge 2025-12-04T08:54:18.6529183Z * [new branch] zxiiro/c7i.2xlarge.h100 -> origin/zxiiro/c7i.2xlarge.h100 2025-12-04T08:54:18.6529246Z * [new branch] zxiiro/main -> origin/zxiiro/main 2025-12-04T08:54:18.6529312Z * [new branch] zxiiro/risc64 -> origin/zxiiro/risc64 2025-12-04T08:54:18.6529402Z * [new branch] zxiiro/test-multicloud-arc -> origin/zxiiro/test-multicloud-arc 2025-12-04T08:54:18.6529462Z * [new tag] ciflow/dynamo/169525 -> ciflow/dynamo/169525 2025-12-04T08:54:18.6529533Z t [tag update] ciflow/inductor/167647 -> ciflow/inductor/167647 2025-12-04T08:54:18.6529628Z t [tag update] ciflow/inductor/168266 -> ciflow/inductor/168266 2025-12-04T08:54:18.6529694Z t [tag update] ciflow/inductor/169535 -> ciflow/inductor/169535 2025-12-04T08:54:18.6529755Z * [new tag] ciflow/trunk/165728 -> ciflow/trunk/165728 2025-12-04T08:54:18.6529813Z * [new tag] ciflow/trunk/169048 -> ciflow/trunk/169048 2025-12-04T08:54:18.6529873Z * [new tag] ciflow/trunk/169125 -> ciflow/trunk/169125 2025-12-04T08:54:18.6529932Z * [new tag] ciflow/trunk/169555 -> ciflow/trunk/169555 2025-12-04T08:54:18.6529990Z * [new tag] ciflow/xpu/169555 -> ciflow/xpu/169555 2025-12-04T08:54:18.8438844Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T08:54:18.8601805Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:18.8607801Z ##[endgroup] 2025-12-04T08:54:18.8608264Z ##[group]Determining the checkout info 2025-12-04T08:54:18.8610204Z ##[endgroup] 2025-12-04T08:54:18.8617201Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T08:54:18.8711350Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T08:54:18.8737281Z ##[group]Checking out the ref 2025-12-04T08:54:18.8743029Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:18.9024985Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:18.9032941Z ##[endgroup] 2025-12-04T08:54:18.9033479Z ##[group]Setting up auth for fetching submodules 2025-12-04T08:54:18.9043827Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:18.9087203Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T08:54:18.9128207Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T08:54:18.9179650Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T08:54:18.9233815Z ##[endgroup] 2025-12-04T08:54:18.9234330Z ##[group]Fetching submodules 2025-12-04T08:54:18.9240416Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T08:54:18.9505282Z Synchronizing submodule url for 'android/libs/fbjni' 2025-12-04T08:54:18.9523403Z Synchronizing submodule url for 'third_party/FP16' 2025-12-04T08:54:18.9538179Z Synchronizing submodule url for 'third_party/FXdiv' 2025-12-04T08:54:18.9550777Z Synchronizing submodule url for 'third_party/NNPACK' 2025-12-04T08:54:18.9563107Z Synchronizing submodule url for 'third_party/NVTX' 2025-12-04T08:54:18.9577529Z Synchronizing submodule url for 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:18.9599037Z Synchronizing submodule url for 'third_party/XNNPACK' 2025-12-04T08:54:18.9635979Z Synchronizing submodule url for 'third_party/aiter' 2025-12-04T08:54:18.9652660Z Synchronizing submodule url for 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:18.9676219Z Synchronizing submodule url for 'third_party/benchmark' 2025-12-04T08:54:18.9700093Z Synchronizing submodule url for 'third_party/composable_kernel' 2025-12-04T08:54:18.9716685Z Synchronizing submodule url for 'third_party/cpp-httplib' 2025-12-04T08:54:18.9730200Z Synchronizing submodule url for 'third_party/cpuinfo' 2025-12-04T08:54:18.9753293Z Synchronizing submodule url for 'third_party/cudnn_frontend' 2025-12-04T08:54:18.9776904Z Synchronizing submodule url for 'third_party/cutlass' 2025-12-04T08:54:18.9794421Z Synchronizing submodule url for 'third_party/fbgemm' 2025-12-04T08:54:18.9809943Z Synchronizing submodule url for 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:18.9831648Z Synchronizing submodule url for 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:18.9863620Z Synchronizing submodule url for 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:18.9875586Z Synchronizing submodule url for 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:18.9891751Z Synchronizing submodule url for 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:18.9905804Z Synchronizing submodule url for 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:18.9915233Z Synchronizing submodule url for 'third_party/fbgemm/external/json' 2025-12-04T08:54:18.9931114Z Synchronizing submodule url for 'third_party/flash-attention' 2025-12-04T08:54:18.9962155Z Synchronizing submodule url for 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:18.9989920Z Synchronizing submodule url for 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:19.0006664Z Synchronizing submodule url for 'third_party/flatbuffers' 2025-12-04T08:54:19.0022269Z Synchronizing submodule url for 'third_party/fmt' 2025-12-04T08:54:19.0035611Z Synchronizing submodule url for 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:19.0048686Z Synchronizing submodule url for 'third_party/gloo' 2025-12-04T08:54:19.0074214Z Synchronizing submodule url for 'third_party/googletest' 2025-12-04T08:54:19.0087586Z Synchronizing submodule url for 'third_party/ideep' 2025-12-04T08:54:19.0114598Z Synchronizing submodule url for 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:19.0130440Z Synchronizing submodule url for 'third_party/ittapi' 2025-12-04T08:54:19.0152377Z Synchronizing submodule url for 'third_party/kineto' 2025-12-04T08:54:19.0184228Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:19.0197936Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:19.0209893Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:19.0226226Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:19.0238575Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:19.0253200Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:19.0266115Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:19.0286549Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:19.0298531Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:19.0312293Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:19.0324316Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:19.0336688Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:19.0347867Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:19.0364374Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:19.0375424Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:19.0390281Z Synchronizing submodule url for 'third_party/kleidiai' 2025-12-04T08:54:19.0402944Z Synchronizing submodule url for 'third_party/mimalloc' 2025-12-04T08:54:19.0428007Z Synchronizing submodule url for 'third_party/nlohmann' 2025-12-04T08:54:19.0441975Z Synchronizing submodule url for 'third_party/onnx' 2025-12-04T08:54:19.0461412Z Synchronizing submodule url for 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:19.0475570Z Synchronizing submodule url for 'third_party/opentelemetry-cpp' 2025-12-04T08:54:19.0492296Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:19.0501987Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:19.0517866Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:19.0528765Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:19.0546888Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:19.0560317Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:19.0572113Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:19.0585189Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:19.0596635Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:19.0608856Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:19.0631945Z Synchronizing submodule url for 'third_party/pocketfft' 2025-12-04T08:54:19.0654769Z Synchronizing submodule url for 'third_party/protobuf' 2025-12-04T08:54:19.0678652Z Synchronizing submodule url for 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:19.0689617Z Synchronizing submodule url for 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:19.0704139Z Synchronizing submodule url for 'third_party/psimd' 2025-12-04T08:54:19.0718859Z Synchronizing submodule url for 'third_party/pthreadpool' 2025-12-04T08:54:19.0732066Z Synchronizing submodule url for 'third_party/pybind11' 2025-12-04T08:54:19.0746769Z Synchronizing submodule url for 'third_party/python-peachpy' 2025-12-04T08:54:19.0760130Z Synchronizing submodule url for 'third_party/sleef' 2025-12-04T08:54:19.0773662Z Synchronizing submodule url for 'third_party/tensorpipe' 2025-12-04T08:54:19.0788144Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:19.0804864Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:19.0816050Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:19.0838452Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:19.0859004Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:19.0895435Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T08:54:19.1178934Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T08:54:19.1248495Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T08:54:19.1306244Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T08:54:19.1426640Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T08:54:19.1496735Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T08:54:19.1595916Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T08:54:19.6369162Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T08:54:19.6569334Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T08:54:19.6776358Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T08:54:19.6944603Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T08:54:19.7176573Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:19.7277093Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T08:54:19.7935162Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T08:54:19.8019860Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T08:54:19.8199485Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T08:54:19.8966744Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T08:54:19.9336595Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T08:54:20.1255389Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:20.1940089Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T08:54:20.2549563Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T08:54:20.2800223Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:20.2884396Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T08:54:20.3502478Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T08:54:20.3640174Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T08:54:20.3837654Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T08:54:20.3975451Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T08:54:20.4077543Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T08:54:20.4267785Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T08:54:20.4492970Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T08:54:20.4637066Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T08:54:20.4867582Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:20.4961412Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T08:54:20.9194195Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T08:54:20.9327671Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T08:54:20.9428148Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T08:54:20.9532147Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T08:54:20.9632197Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T08:54:20.9714750Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T08:54:20.9784265Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T08:54:20.9861750Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T08:54:20.9962168Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T08:54:21.0051177Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T08:54:21.0136453Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:21.0261276Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T08:54:21.0334053Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T08:54:21.0426879Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T08:54:21.0525716Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T08:54:21.0619803Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:21.0702865Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T08:54:21.0777796Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:21.0886543Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T08:54:21.0982028Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T08:54:21.1082799Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T08:54:21.2772628Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T08:54:21.2974403Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T08:54:21.3094379Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T08:54:21.3174647Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T08:54:21.3230749Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T08:54:21.3305329Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T08:54:21.3401847Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T08:54:21.3456957Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T08:54:21.3517896Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T08:54:21.3597837Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T08:54:21.3718277Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T08:54:21.3801732Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:21.3980428Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T08:54:21.4068738Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T08:54:21.5388435Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T08:54:21.5506283Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T08:54:21.5728620Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T08:54:21.5807660Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T08:54:21.5894780Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T08:54:21.6099327Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T08:54:21.6327380Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T08:54:21.6594250Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T08:54:21.6713710Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T08:54:21.6893329Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T08:54:21.7016795Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T08:54:21.7317803Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T08:54:21.7467205Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T08:54:21.7555570Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T08:54:21.7603920Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T08:54:21.7849595Z Entering 'android/libs/fbjni' 2025-12-04T08:54:21.7886205Z Entering 'third_party/FP16' 2025-12-04T08:54:21.7923876Z Entering 'third_party/FXdiv' 2025-12-04T08:54:21.7959491Z Entering 'third_party/NNPACK' 2025-12-04T08:54:21.7991489Z Entering 'third_party/NVTX' 2025-12-04T08:54:21.8031344Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:21.8057085Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:21.8099945Z Entering 'third_party/aiter' 2025-12-04T08:54:21.8130592Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:21.8162321Z Entering 'third_party/benchmark' 2025-12-04T08:54:21.8188505Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:21.8222026Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:21.8246575Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:21.8271500Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:21.8308322Z Entering 'third_party/cutlass' 2025-12-04T08:54:21.8353685Z Entering 'third_party/fbgemm' 2025-12-04T08:54:21.8392943Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:21.8419625Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:21.8466565Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:21.8494240Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:21.8540819Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:21.8590241Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:21.8616767Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:21.8663678Z Entering 'third_party/flash-attention' 2025-12-04T08:54:21.8697187Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:21.8723319Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:21.8763554Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:21.8804128Z Entering 'third_party/fmt' 2025-12-04T08:54:21.8833840Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:21.8869126Z Entering 'third_party/gloo' 2025-12-04T08:54:21.8899299Z Entering 'third_party/googletest' 2025-12-04T08:54:21.8943909Z Entering 'third_party/ideep' 2025-12-04T08:54:21.8979768Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:21.9018324Z Entering 'third_party/ittapi' 2025-12-04T08:54:21.9042594Z Entering 'third_party/kineto' 2025-12-04T08:54:21.9066664Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:21.9088408Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:21.9122443Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:21.9145101Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:21.9168726Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:21.9191468Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:21.9215660Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:21.9250193Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:21.9273706Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:21.9300928Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:21.9333726Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:21.9361359Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:21.9400421Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:21.9451915Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:21.9494908Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:21.9531704Z Entering 'third_party/kleidiai' 2025-12-04T08:54:21.9570043Z Entering 'third_party/mimalloc' 2025-12-04T08:54:21.9600096Z Entering 'third_party/nlohmann' 2025-12-04T08:54:21.9626318Z Entering 'third_party/onnx' 2025-12-04T08:54:21.9656959Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:21.9694948Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:21.9730672Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:21.9754481Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:21.9788203Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:21.9811544Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:21.9844507Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:21.9873518Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:21.9901707Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:21.9925169Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:21.9946235Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:21.9975609Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:22.0007470Z Entering 'third_party/pocketfft' 2025-12-04T08:54:22.0034338Z Entering 'third_party/protobuf' 2025-12-04T08:54:22.0058956Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:22.0099568Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:22.0126054Z Entering 'third_party/psimd' 2025-12-04T08:54:22.0171661Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:22.0195995Z Entering 'third_party/pybind11' 2025-12-04T08:54:22.0223700Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:22.0248049Z Entering 'third_party/sleef' 2025-12-04T08:54:22.0271030Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:22.0295891Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:22.0329673Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:22.0354087Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:22.0376716Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:22.0398663Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:22.0450053Z ##[endgroup] 2025-12-04T08:54:22.0450425Z ##[group]Persisting credentials for submodules 2025-12-04T08:54:22.0456446Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T08:54:22.0702886Z Entering 'android/libs/fbjni' 2025-12-04T08:54:22.0736105Z Entering 'third_party/FP16' 2025-12-04T08:54:22.0759118Z Entering 'third_party/FXdiv' 2025-12-04T08:54:22.0787557Z Entering 'third_party/NNPACK' 2025-12-04T08:54:22.0825466Z Entering 'third_party/NVTX' 2025-12-04T08:54:22.0851418Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:22.0885599Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:22.0934135Z Entering 'third_party/aiter' 2025-12-04T08:54:22.0983354Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:22.1022898Z Entering 'third_party/benchmark' 2025-12-04T08:54:22.1046366Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:22.1087345Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:22.1115726Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:22.1142174Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:22.1170957Z Entering 'third_party/cutlass' 2025-12-04T08:54:22.1196913Z Entering 'third_party/fbgemm' 2025-12-04T08:54:22.1222253Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:22.1241721Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:22.1280584Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:22.1314464Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:22.1339072Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:22.1359358Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:22.1378315Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:22.1424018Z Entering 'third_party/flash-attention' 2025-12-04T08:54:22.1458889Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:22.1503882Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:22.1546481Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:22.1575545Z Entering 'third_party/fmt' 2025-12-04T08:54:22.1603894Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:22.1636963Z Entering 'third_party/gloo' 2025-12-04T08:54:22.1670859Z Entering 'third_party/googletest' 2025-12-04T08:54:22.1710286Z Entering 'third_party/ideep' 2025-12-04T08:54:22.1753931Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:22.1788634Z Entering 'third_party/ittapi' 2025-12-04T08:54:22.1816362Z Entering 'third_party/kineto' 2025-12-04T08:54:22.1841096Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:22.1878129Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:22.1909343Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:22.1949207Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:22.1998850Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:22.2052675Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:22.2081626Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:22.2116806Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:22.2167506Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:22.2205346Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:22.2247543Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:22.2280969Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:22.2324050Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:22.2362926Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:22.2393830Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:22.2436028Z Entering 'third_party/kleidiai' 2025-12-04T08:54:22.2478214Z Entering 'third_party/mimalloc' 2025-12-04T08:54:22.2526589Z Entering 'third_party/nlohmann' 2025-12-04T08:54:22.2573535Z Entering 'third_party/onnx' 2025-12-04T08:54:22.2620322Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:22.2666969Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:22.2717570Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:22.2750227Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:22.2782938Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:22.2802548Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:22.2823325Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:22.2843782Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:22.2862643Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:22.2883137Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:22.2920456Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:22.2965512Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:22.2996106Z Entering 'third_party/pocketfft' 2025-12-04T08:54:22.3028189Z Entering 'third_party/protobuf' 2025-12-04T08:54:22.3067116Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:22.3100513Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:22.3134304Z Entering 'third_party/psimd' 2025-12-04T08:54:22.3163221Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:22.3182628Z Entering 'third_party/pybind11' 2025-12-04T08:54:22.3227811Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:22.3274726Z Entering 'third_party/sleef' 2025-12-04T08:54:22.3315030Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:22.3356312Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:22.3410250Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:22.3457581Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:22.3483938Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:22.3509442Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:22.3565170Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T08:54:22.3832959Z Entering 'android/libs/fbjni' 2025-12-04T08:54:22.3863743Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:22.3884857Z Entering 'third_party/FP16' 2025-12-04T08:54:22.3933649Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:22.3945525Z Entering 'third_party/FXdiv' 2025-12-04T08:54:22.3984109Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:22.4004865Z Entering 'third_party/NNPACK' 2025-12-04T08:54:22.4051535Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:22.4062303Z Entering 'third_party/NVTX' 2025-12-04T08:54:22.4102616Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:22.4115332Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:22.4160790Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:22.4172010Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:22.4223543Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:22.4241276Z Entering 'third_party/aiter' 2025-12-04T08:54:22.4267976Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:22.4278394Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:22.4318389Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:22.4342721Z Entering 'third_party/benchmark' 2025-12-04T08:54:22.4369742Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:22.4382055Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:22.4415006Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:22.4439691Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:22.4481081Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:22.4502466Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:22.4540450Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:22.4552156Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:22.4594354Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:22.4613105Z Entering 'third_party/cutlass' 2025-12-04T08:54:22.4667878Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:22.4693080Z Entering 'third_party/fbgemm' 2025-12-04T08:54:22.4741120Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:22.4756720Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:22.4820708Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:22.4842942Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:22.4875783Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:22.4889567Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:22.4918729Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:22.4927875Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:22.4958750Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:22.4973146Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:22.5008525Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:22.5027426Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:22.5052558Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:22.5072412Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:22.5094953Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:22.5106708Z Entering 'third_party/flash-attention' 2025-12-04T08:54:22.5158818Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:22.5184015Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:22.5231587Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:22.5243888Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:22.5283591Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:22.5311086Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:22.5352993Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:22.5376450Z Entering 'third_party/fmt' 2025-12-04T08:54:22.5415575Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:22.5436077Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:22.5474575Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:22.5487724Z Entering 'third_party/gloo' 2025-12-04T08:54:22.5515122Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:22.5529295Z Entering 'third_party/googletest' 2025-12-04T08:54:22.5559593Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.5573079Z Entering 'third_party/ideep' 2025-12-04T08:54:22.5620227Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:22.5642709Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:22.5682950Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:22.5712701Z Entering 'third_party/ittapi' 2025-12-04T08:54:22.5747423Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:22.5761087Z Entering 'third_party/kineto' 2025-12-04T08:54:22.5799916Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:22.5823043Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:22.5859339Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:22.5883609Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:22.5930500Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:22.5953446Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:22.5995925Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:22.6006287Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:22.6039526Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:22.6060632Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:22.6104542Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:22.6120066Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:22.6152808Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:22.6165408Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:22.6218615Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:22.6227002Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:22.6268111Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.6287144Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:22.6320321Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:22.6329537Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:22.6364950Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:22.6374807Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:22.6417310Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:22.6428873Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:22.6460742Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:22.6473303Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:22.6498433Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:22.6514423Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:22.6549810Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:22.6560076Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:22.6593202Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.6607079Z Entering 'third_party/kleidiai' 2025-12-04T08:54:22.6634364Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:22.6646349Z Entering 'third_party/mimalloc' 2025-12-04T08:54:22.6680333Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:22.6693941Z Entering 'third_party/nlohmann' 2025-12-04T08:54:22.6738804Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:22.6749093Z Entering 'third_party/onnx' 2025-12-04T08:54:22.6792281Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:22.6812261Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:22.6832844Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:22.6845768Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:22.6866033Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:22.6875831Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:22.6921502Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:22.6944067Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:22.6997553Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.7013288Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:22.7040579Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:22.7054042Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:22.7096219Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:22.7109992Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:22.7154939Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:22.7170579Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:22.7216983Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:22.7242298Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:22.7281452Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:22.7302200Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:22.7333365Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:22.7346937Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:22.7373632Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:22.7397578Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:22.7428003Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:22.7467480Z Entering 'third_party/pocketfft' 2025-12-04T08:54:22.7522844Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:22.7537802Z Entering 'third_party/protobuf' 2025-12-04T08:54:22.7573139Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:22.7587154Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:22.7618732Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:22.7635572Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:22.7680342Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.7704424Z Entering 'third_party/psimd' 2025-12-04T08:54:22.7729656Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:22.7745879Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:22.7780047Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:22.7806661Z Entering 'third_party/pybind11' 2025-12-04T08:54:22.7845724Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:22.7874084Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:22.7913771Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:22.7930207Z Entering 'third_party/sleef' 2025-12-04T08:54:22.7964622Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:22.7980994Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:22.8017206Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:22.8030780Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:22.8063884Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:22.8072866Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:22.8109546Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:22.8120061Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:22.8149853Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:22.8165906Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:22.8200886Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:22.8213111Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:22.8274425Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:22.8555425Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T08:54:22.8826091Z Entering 'android/libs/fbjni' 2025-12-04T08:54:22.8856582Z Entering 'third_party/FP16' 2025-12-04T08:54:22.8895141Z Entering 'third_party/FXdiv' 2025-12-04T08:54:22.8954590Z Entering 'third_party/NNPACK' 2025-12-04T08:54:22.9003315Z Entering 'third_party/NVTX' 2025-12-04T08:54:22.9044678Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:22.9102454Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:22.9148254Z Entering 'third_party/aiter' 2025-12-04T08:54:22.9177707Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:22.9223197Z Entering 'third_party/benchmark' 2025-12-04T08:54:22.9257660Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:22.9297447Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:22.9326225Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:22.9349196Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:22.9372897Z Entering 'third_party/cutlass' 2025-12-04T08:54:22.9414610Z Entering 'third_party/fbgemm' 2025-12-04T08:54:22.9442276Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:22.9461598Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:22.9504083Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:22.9526691Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:22.9550936Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:22.9571127Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:22.9602286Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:22.9635252Z Entering 'third_party/flash-attention' 2025-12-04T08:54:22.9665757Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:22.9712421Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:22.9765277Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:22.9797644Z Entering 'third_party/fmt' 2025-12-04T08:54:22.9848965Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:22.9871228Z Entering 'third_party/gloo' 2025-12-04T08:54:22.9892645Z Entering 'third_party/googletest' 2025-12-04T08:54:22.9921411Z Entering 'third_party/ideep' 2025-12-04T08:54:22.9953435Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:22.9985512Z Entering 'third_party/ittapi' 2025-12-04T08:54:23.0014584Z Entering 'third_party/kineto' 2025-12-04T08:54:23.0048907Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:23.0084928Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:23.0122344Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:23.0146005Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:23.0168408Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:23.0190327Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:23.0226490Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:23.0254107Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:23.0279396Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:23.0300587Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:23.0321663Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:23.0355885Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:23.0377457Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:23.0403448Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:23.0424913Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:23.0459137Z Entering 'third_party/kleidiai' 2025-12-04T08:54:23.0481071Z Entering 'third_party/mimalloc' 2025-12-04T08:54:23.0501965Z Entering 'third_party/nlohmann' 2025-12-04T08:54:23.0531899Z Entering 'third_party/onnx' 2025-12-04T08:54:23.0561906Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:23.0598644Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:23.0642872Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:23.0663820Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:23.0696730Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:23.0734508Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:23.0761268Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:23.0799168Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:23.0820758Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:23.0853083Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:23.0884787Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:23.0906144Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:23.0943188Z Entering 'third_party/pocketfft' 2025-12-04T08:54:23.0981557Z Entering 'third_party/protobuf' 2025-12-04T08:54:23.1009732Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:23.1029622Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:23.1052644Z Entering 'third_party/psimd' 2025-12-04T08:54:23.1083320Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:23.1113579Z Entering 'third_party/pybind11' 2025-12-04T08:54:23.1136372Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:23.1159033Z Entering 'third_party/sleef' 2025-12-04T08:54:23.1181748Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:23.1221051Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:23.1241546Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:23.1259957Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:23.1287623Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:23.1307434Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:23.1362838Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T08:54:23.1566691Z Entering 'android/libs/fbjni' 2025-12-04T08:54:23.1594680Z Entering 'third_party/FP16' 2025-12-04T08:54:23.1628959Z Entering 'third_party/FXdiv' 2025-12-04T08:54:23.1661086Z Entering 'third_party/NNPACK' 2025-12-04T08:54:23.1693126Z Entering 'third_party/NVTX' 2025-12-04T08:54:23.1713444Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:23.1734978Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:23.1797336Z Entering 'third_party/aiter' 2025-12-04T08:54:23.1841455Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:23.1890358Z Entering 'third_party/benchmark' 2025-12-04T08:54:23.1929539Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:23.1960786Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:23.1984044Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:23.2018035Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:23.2051395Z Entering 'third_party/cutlass' 2025-12-04T08:54:23.2100463Z Entering 'third_party/fbgemm' 2025-12-04T08:54:23.2135082Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:23.2160167Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:23.2185513Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:23.2211286Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:23.2233236Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:23.2270857Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:23.2291315Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:23.2314152Z Entering 'third_party/flash-attention' 2025-12-04T08:54:23.2350975Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:23.2390781Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:23.2432228Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:23.2469618Z Entering 'third_party/fmt' 2025-12-04T08:54:23.2491581Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:23.2525208Z Entering 'third_party/gloo' 2025-12-04T08:54:23.2546636Z Entering 'third_party/googletest' 2025-12-04T08:54:23.2567007Z Entering 'third_party/ideep' 2025-12-04T08:54:23.2587489Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:23.2617306Z Entering 'third_party/ittapi' 2025-12-04T08:54:23.2654855Z Entering 'third_party/kineto' 2025-12-04T08:54:23.2694687Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:23.2740737Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:23.2796883Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:23.2843450Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:23.2883410Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:23.2914862Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:23.2955376Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:23.2990095Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:23.3012939Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:23.3042191Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:23.3064162Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:23.3085293Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:23.3106812Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:23.3135267Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:23.3163148Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:23.3207841Z Entering 'third_party/kleidiai' 2025-12-04T08:54:23.3259356Z Entering 'third_party/mimalloc' 2025-12-04T08:54:23.3299901Z Entering 'third_party/nlohmann' 2025-12-04T08:54:23.3335958Z Entering 'third_party/onnx' 2025-12-04T08:54:23.3386656Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:23.3417895Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:23.3446933Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:23.3473518Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:23.3511268Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:23.3551193Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:23.3587392Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:23.3630890Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:23.3665449Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:23.3699210Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:23.3745036Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:23.3786526Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:23.3827473Z Entering 'third_party/pocketfft' 2025-12-04T08:54:23.3852678Z Entering 'third_party/protobuf' 2025-12-04T08:54:23.3878484Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:23.3920351Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:23.3966644Z Entering 'third_party/psimd' 2025-12-04T08:54:23.3992381Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:23.4017265Z Entering 'third_party/pybind11' 2025-12-04T08:54:23.4069455Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:23.4095944Z Entering 'third_party/sleef' 2025-12-04T08:54:23.4122801Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:23.4154966Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:23.4187425Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:23.4234311Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:23.4268744Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:23.4296175Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:23.4357547Z ##[endgroup] 2025-12-04T08:54:23.4568208Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T08:54:23.4651940Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:23.4866126Z ##[group]Run actions/checkout@v4 2025-12-04T08:54:23.4866374Z with: 2025-12-04T08:54:23.4866597Z ref: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:23.4866854Z fetch-depth: 0 2025-12-04T08:54:23.4867044Z submodules: recursive 2025-12-04T08:54:23.4867241Z show-progress: false 2025-12-04T08:54:23.4867459Z repository: pytorch/pytorch 2025-12-04T08:54:23.4867750Z token: *** 2025-12-04T08:54:23.4867926Z ssh-strict: true 2025-12-04T08:54:23.4868171Z ssh-user: git 2025-12-04T08:54:23.4868359Z persist-credentials: true 2025-12-04T08:54:23.4868572Z clean: true 2025-12-04T08:54:23.4868769Z sparse-checkout-cone-mode: true 2025-12-04T08:54:23.4869009Z fetch-tags: false 2025-12-04T08:54:23.4869190Z lfs: false 2025-12-04T08:54:23.4869366Z set-safe-directory: true 2025-12-04T08:54:23.4869566Z env: 2025-12-04T08:54:23.4869739Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:23.4869937Z ##[endgroup] 2025-12-04T08:54:23.5342873Z Syncing repository: pytorch/pytorch 2025-12-04T08:54:23.5343553Z ##[group]Getting Git version info 2025-12-04T08:54:23.5344071Z Working directory is '/home/runner/_work/pytorch/pytorch' 2025-12-04T08:54:23.5357111Z [command]/usr/bin/git version 2025-12-04T08:54:23.5386868Z git version 2.52.0 2025-12-04T08:54:23.5401134Z ##[endgroup] 2025-12-04T08:54:23.5407847Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/058bb40f-cb54-4573-b0fa-55fdc50ecb67/.gitconfig' 2025-12-04T08:54:23.5418215Z Temporarily overriding HOME='/home/runner/_work/_temp/058bb40f-cb54-4573-b0fa-55fdc50ecb67' before making global git config changes 2025-12-04T08:54:23.5419230Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T08:54:23.5421037Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T08:54:23.5466330Z [command]/usr/bin/git config --local --get remote.origin.url 2025-12-04T08:54:23.5490356Z https://github.com/pytorch/pytorch 2025-12-04T08:54:23.5502208Z ##[group]Removing previously created refs, to avoid conflicts 2025-12-04T08:54:23.5504460Z [command]/usr/bin/git rev-parse --symbolic-full-name --verify --quiet HEAD 2025-12-04T08:54:23.5530981Z HEAD 2025-12-04T08:54:23.5576804Z ##[endgroup] 2025-12-04T08:54:23.5577924Z [command]/usr/bin/git submodule status 2025-12-04T08:54:23.5842716Z 7e1e1fe3858c63c251c637ae41a20de425dde96f android/libs/fbjni (v0.1.0-12-g7e1e1fe) 2025-12-04T08:54:23.5904349Z 4dfe081cf6bcd15db339cf2680b9281b8451eeb3 third_party/FP16 (4dfe081) 2025-12-04T08:54:23.5955738Z b408327ac2a15ec3e43352421954f5b1967701d1 third_party/FXdiv (b408327) 2025-12-04T08:54:23.6005696Z c07e3a0400713d546e0dea2d5466dd22ea389c73 third_party/NNPACK (c07e3a0) 2025-12-04T08:54:23.6042865Z 3ebbc93ded7285963bff932c678fa367eb393ba6 third_party/NVTX (v3.1.0-313-g3ebbc93) 2025-12-04T08:54:23.6103954Z 1d8f600fd424278486eade7ed3e877c99f0846b1 third_party/VulkanMemoryAllocator (v2.1.0-982-g1d8f600) 2025-12-04T08:54:23.6397927Z 51a0103656eff6fc9bfd39a4597923c4b542c883 third_party/XNNPACK (remotes/origin/ds/ndk-1243-g51a0103656) 2025-12-04T08:54:23.6441710Z 01aae101b9e5e94d6c16a9514c9fb8df99c93150 third_party/aiter (v0.1.1-92-g01aae101) 2025-12-04T08:54:23.6460539Z 299e5928955cc62af9968370293b916f5130916f third_party/benchmark (v1.9.3) 2025-12-04T08:54:23.6522240Z 7fe50dc3da2069d6645d9deb8c017a876472a977 third_party/composable_kernel (rocm-6.4.3-459-g7fe50dc3d) 2025-12-04T08:54:23.6633138Z 89c932f313c6437c38f2982869beacc89c2f2246 third_party/cpp-httplib (v0.26.0) 2025-12-04T08:54:23.6729297Z f858c30bcb16f8effd5ff46996f0514539e17abc third_party/cpuinfo (f858c30) 2025-12-04T08:54:23.6761924Z 0b1577c8c83401237d601d0d0db5210506705396 third_party/cudnn_frontend (v0.5-61-g0b1577c) 2025-12-04T08:54:23.6844103Z f88806b1e31dfa579842638740216dd41fc6c588 third_party/cutlass (v4.3.1) 2025-12-04T08:54:23.6886903Z c0b988d39a9e47c794d699f29930ed4d7c7e13a4 third_party/fbgemm (v1.4.0-rc1-2-gc0b988d39) 2025-12-04T08:54:23.6950198Z 979702c87a8713a8e0a5e9fee122b90d2ef13be5 third_party/flash-attention (v2.7.4) 2025-12-04T08:54:23.6970280Z a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757 third_party/flatbuffers (v24.12.23) 2025-12-04T08:54:23.7232668Z 407c905e45ad75fc29bf0f9bb7c5c2fd3475976f third_party/fmt (12.1.0) 2025-12-04T08:54:23.7364020Z 3fb5c176c17c765a3492cd2f0321b0dab712f350 third_party/gemmlowp/gemmlowp (remotes/origin/revert-87-master-135-g3fb5c17) 2025-12-04T08:54:23.7464901Z 54cbae0d3a67fa890b4c3d9ee162b7860315e341 third_party/gloo (remotes/origin/gh/c-p-i-o/1/base-37-g54cbae0) 2025-12-04T08:54:23.7624454Z 52eb8108c5bdec04579160ae17225d66034bd723 third_party/googletest (release-1.8.0-3544-g52eb8108) 2025-12-04T08:54:23.7715775Z 719d8e6cd7f7a0e01b155657526d693acf97c2b3 third_party/ideep (pytorch-rls-v3.7.1) 2025-12-04T08:54:23.7782996Z dec1d23ca65ab069d225dfe40dea14f455170959 third_party/ittapi (v3.25.5) 2025-12-04T08:54:23.7958558Z 31f85df8fbd89c188f14ef10f1ec65379786b943 third_party/kineto (heads/main) 2025-12-04T08:54:23.8000411Z d7770c89632329a9914ef1a90289917597639cbe third_party/kleidiai (v1.15.0) 2025-12-04T08:54:23.8039164Z fbd8b99c2b828428947d70fdc046bb55609be93e third_party/mimalloc (v2.2.4) 2025-12-04T08:54:23.8072920Z 55f93686c01528224f448c19128836e7df245f72 third_party/nlohmann (v3.12.0) 2025-12-04T08:54:23.8304902Z e709452ef2bbc1d113faf678c24e6d3467696e83 third_party/onnx (v1.18.0) 2025-12-04T08:54:23.8336014Z a799f4aed9c94b765dcdaabaeab7d5e7e2310878 third_party/opentelemetry-cpp (v1.14.2) 2025-12-04T08:54:23.8382197Z 0fa0ef591e38c2758e3184c6c23e497b9f732ffa third_party/pocketfft (release_for_eigen-40-g0fa0ef5) 2025-12-04T08:54:23.8609294Z d1eca4e4b421cd2997495c4b4e65cea6be4e9b8a third_party/protobuf (v3.7.0-rc.2-1279-gd1eca4e4b) 2025-12-04T08:54:23.8718371Z 072586a71b55b7f8c584153d223e95687148a900 third_party/psimd (heads/master) 2025-12-04T08:54:23.8773286Z 4fe0e1e183925bf8cfa6aae24237e724a96479b8 third_party/pthreadpool (0.1-144-g4fe0e1e) 2025-12-04T08:54:23.8810924Z f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8 third_party/pybind11 (v3.0.1) 2025-12-04T08:54:23.8906894Z f45429b087dd7d5bc78bb40dc7cf06425c252d67 third_party/python-peachpy (remotes/origin/pre-generated) 2025-12-04T08:54:23.8993367Z 5a1d179df9cf652951b59010a2d2075372d67f68 third_party/sleef (3.8) 2025-12-04T08:54:23.9060334Z 2b4cd91092d335a697416b2a3cb398283246849d third_party/tensorpipe (heads/main) 2025-12-04T08:54:23.9078593Z ##[group]Cleaning the repository 2025-12-04T08:54:23.9085782Z [command]/usr/bin/git clean -ffdx 2025-12-04T08:54:23.9215249Z [command]/usr/bin/git reset --hard HEAD 2025-12-04T08:54:24.0035202Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:24.0112480Z ##[endgroup] 2025-12-04T08:54:24.0116914Z ##[group]Disabling automatic garbage collection 2025-12-04T08:54:24.0126943Z [command]/usr/bin/git config --local gc.auto 0 2025-12-04T08:54:24.0169526Z ##[endgroup] 2025-12-04T08:54:24.0169998Z ##[group]Setting up auth 2025-12-04T08:54:24.0181054Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T08:54:24.0220016Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T08:54:24.0544594Z Entering 'android/libs/fbjni' 2025-12-04T08:54:24.0586478Z Entering 'third_party/FP16' 2025-12-04T08:54:24.0617062Z Entering 'third_party/FXdiv' 2025-12-04T08:54:24.0671379Z Entering 'third_party/NNPACK' 2025-12-04T08:54:24.0696292Z Entering 'third_party/NVTX' 2025-12-04T08:54:24.0747325Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:24.0771247Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:24.0812362Z Entering 'third_party/aiter' 2025-12-04T08:54:24.0854133Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:24.0900207Z Entering 'third_party/benchmark' 2025-12-04T08:54:24.0931729Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:24.0970885Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:24.0996683Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:24.1019919Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:24.1042517Z Entering 'third_party/cutlass' 2025-12-04T08:54:24.1078012Z Entering 'third_party/fbgemm' 2025-12-04T08:54:24.1103905Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:24.1137383Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:24.1173058Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:24.1201262Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:24.1228338Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:24.1254588Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:24.1289357Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:24.1337724Z Entering 'third_party/flash-attention' 2025-12-04T08:54:24.1374067Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:24.1429976Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:24.1482361Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:24.1517063Z Entering 'third_party/fmt' 2025-12-04T08:54:24.1546190Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:24.1568205Z Entering 'third_party/gloo' 2025-12-04T08:54:24.1598738Z Entering 'third_party/googletest' 2025-12-04T08:54:24.1625252Z Entering 'third_party/ideep' 2025-12-04T08:54:24.1676659Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:24.1746184Z Entering 'third_party/ittapi' 2025-12-04T08:54:24.1799372Z Entering 'third_party/kineto' 2025-12-04T08:54:24.1834534Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:24.1858132Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:24.1903459Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:24.1960734Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:24.2002980Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:24.2053578Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:24.2099460Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:24.2123645Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:24.2161052Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:24.2183437Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:24.2216924Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:24.2247121Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:24.2292704Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:24.2329818Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:24.2373908Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:24.2433205Z Entering 'third_party/kleidiai' 2025-12-04T08:54:24.2490520Z Entering 'third_party/mimalloc' 2025-12-04T08:54:24.2548345Z Entering 'third_party/nlohmann' 2025-12-04T08:54:24.2585144Z Entering 'third_party/onnx' 2025-12-04T08:54:24.2648361Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:24.2692165Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:24.2730513Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:24.2777215Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:24.2817102Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:24.2841219Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:24.2880213Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:24.2914805Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:24.2946294Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:24.2972470Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:24.3004731Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:24.3030549Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:24.3095940Z Entering 'third_party/pocketfft' 2025-12-04T08:54:24.3132231Z Entering 'third_party/protobuf' 2025-12-04T08:54:24.3183768Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:24.3222123Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:24.3248496Z Entering 'third_party/psimd' 2025-12-04T08:54:24.3294157Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:24.3349153Z Entering 'third_party/pybind11' 2025-12-04T08:54:24.3389239Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:24.3439384Z Entering 'third_party/sleef' 2025-12-04T08:54:24.3489408Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:24.3535683Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:24.3578842Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:24.3618941Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:24.3656459Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:24.3682734Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:24.3764720Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T08:54:24.3800543Z http.https://github.com/.extraheader 2025-12-04T08:54:24.3817117Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T08:54:24.3848527Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T08:54:24.4056002Z Entering 'android/libs/fbjni' 2025-12-04T08:54:24.4082838Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4113148Z Entering 'third_party/FP16' 2025-12-04T08:54:24.4132396Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4150095Z Entering 'third_party/FXdiv' 2025-12-04T08:54:24.4169632Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4203020Z Entering 'third_party/NNPACK' 2025-12-04T08:54:24.4217004Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4243618Z Entering 'third_party/NVTX' 2025-12-04T08:54:24.4268807Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4289455Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:24.4304610Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4323636Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:24.4344904Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4387884Z Entering 'third_party/aiter' 2025-12-04T08:54:24.4420041Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4462379Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:24.4492181Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4519377Z Entering 'third_party/benchmark' 2025-12-04T08:54:24.4553654Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4591203Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:24.4608478Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4637523Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:24.4663992Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4695817Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:24.4728146Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4747785Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:24.4764756Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4787671Z Entering 'third_party/cutlass' 2025-12-04T08:54:24.4802438Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4837456Z Entering 'third_party/fbgemm' 2025-12-04T08:54:24.4856430Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4878338Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:24.4903913Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4926426Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:24.4953881Z http.https://github.com/.extraheader 2025-12-04T08:54:24.4989395Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:24.5004107Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5035516Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:24.5055522Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5099754Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:24.5123388Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5153182Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:24.5174601Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5206404Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:24.5225454Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5248566Z Entering 'third_party/flash-attention' 2025-12-04T08:54:24.5270349Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5292661Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:24.5304896Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5328336Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:24.5347241Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5369199Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:24.5386103Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5406374Z Entering 'third_party/fmt' 2025-12-04T08:54:24.5435355Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5464763Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:24.5481372Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5522794Z Entering 'third_party/gloo' 2025-12-04T08:54:24.5554106Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5585371Z Entering 'third_party/googletest' 2025-12-04T08:54:24.5615587Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5647142Z Entering 'third_party/ideep' 2025-12-04T08:54:24.5677606Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5701302Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:24.5720553Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5747693Z Entering 'third_party/ittapi' 2025-12-04T08:54:24.5769110Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5803393Z Entering 'third_party/kineto' 2025-12-04T08:54:24.5837568Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5857631Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:24.5871689Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5895843Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:24.5916570Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5936122Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:24.5957697Z http.https://github.com/.extraheader 2025-12-04T08:54:24.5979350Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:24.6003336Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6021728Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:24.6035080Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6052033Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:24.6081613Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6118276Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:24.6147574Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6177107Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:24.6191149Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6220639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:24.6234153Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6255869Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:24.6275688Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6300490Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:24.6314036Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6333512Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:24.6362999Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6396190Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:24.6409970Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6434404Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:24.6447031Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6465431Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:24.6478723Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6505744Z Entering 'third_party/kleidiai' 2025-12-04T08:54:24.6525607Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6555717Z Entering 'third_party/mimalloc' 2025-12-04T08:54:24.6576470Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6595671Z Entering 'third_party/nlohmann' 2025-12-04T08:54:24.6611081Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6636595Z Entering 'third_party/onnx' 2025-12-04T08:54:24.6665750Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6712442Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:24.6742313Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6769464Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:24.6799457Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6826097Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:24.6839516Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6868206Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:24.6898275Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6927705Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:24.6949487Z http.https://github.com/.extraheader 2025-12-04T08:54:24.6976683Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:24.7010314Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7041653Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:24.7068507Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7098392Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:24.7113318Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7131201Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:24.7148661Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7182758Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:24.7196933Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7223395Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:24.7236772Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7268651Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:24.7286337Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7315218Z Entering 'third_party/pocketfft' 2025-12-04T08:54:24.7330475Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7351372Z Entering 'third_party/protobuf' 2025-12-04T08:54:24.7365027Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7400258Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:24.7428561Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7463327Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:24.7476755Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7498002Z Entering 'third_party/psimd' 2025-12-04T08:54:24.7519572Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7550476Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:24.7566022Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7604028Z Entering 'third_party/pybind11' 2025-12-04T08:54:24.7619624Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7642451Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:24.7667759Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7685575Z Entering 'third_party/sleef' 2025-12-04T08:54:24.7713951Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7743713Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:24.7758211Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7794471Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:24.7827459Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7846147Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:24.7881823Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7914889Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:24.7927863Z http.https://github.com/.extraheader 2025-12-04T08:54:24.7956739Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:24.7992299Z http.https://github.com/.extraheader 2025-12-04T08:54:24.8011790Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:24.8038159Z http.https://github.com/.extraheader 2025-12-04T08:54:24.8086917Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:24.8130829Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T08:54:24.8420517Z Entering 'android/libs/fbjni' 2025-12-04T08:54:24.8440127Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:24.8452359Z Entering 'third_party/FP16' 2025-12-04T08:54:24.8475774Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:24.8497298Z Entering 'third_party/FXdiv' 2025-12-04T08:54:24.8510500Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:24.8520653Z Entering 'third_party/NNPACK' 2025-12-04T08:54:24.8538720Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:24.8554189Z Entering 'third_party/NVTX' 2025-12-04T08:54:24.8581393Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:24.8604483Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:24.8624849Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:24.8644386Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:24.8667296Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:24.8703946Z Entering 'third_party/aiter' 2025-12-04T08:54:24.8731019Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:24.8752879Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:24.8765995Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:24.8794281Z Entering 'third_party/benchmark' 2025-12-04T08:54:24.8806955Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:24.8823823Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:24.8838857Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:24.8849773Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:24.8871435Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:24.8887774Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:24.8904796Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:24.8914691Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:24.8925614Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:24.8935326Z Entering 'third_party/cutlass' 2025-12-04T08:54:24.8954083Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:24.8967429Z Entering 'third_party/fbgemm' 2025-12-04T08:54:24.8979541Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:24.9001276Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:24.9028887Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:24.9047284Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:24.9060401Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:24.9074378Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:24.9092094Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:24.9101983Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:24.9123361Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:24.9135551Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:24.9164707Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:24.9184158Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:24.9210773Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:24.9220893Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:24.9239237Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:24.9250794Z Entering 'third_party/flash-attention' 2025-12-04T08:54:24.9269776Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:24.9281033Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:24.9294591Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:24.9321124Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:24.9333961Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:24.9364381Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:24.9389386Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:24.9406946Z Entering 'third_party/fmt' 2025-12-04T08:54:24.9419859Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:24.9441816Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:24.9469211Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:24.9480448Z Entering 'third_party/gloo' 2025-12-04T08:54:24.9492743Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:24.9506037Z Entering 'third_party/googletest' 2025-12-04T08:54:24.9517039Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:24.9539482Z Entering 'third_party/ideep' 2025-12-04T08:54:24.9552142Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:24.9571754Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:24.9596476Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:24.9612081Z Entering 'third_party/ittapi' 2025-12-04T08:54:24.9629282Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:24.9640191Z Entering 'third_party/kineto' 2025-12-04T08:54:24.9668398Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:24.9689782Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:24.9706969Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:24.9716171Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:24.9738319Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:24.9750682Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:24.9770016Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:24.9780290Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:24.9790859Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:24.9811074Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:24.9834168Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:24.9853210Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:24.9869813Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:24.9881375Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:24.9899634Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:24.9908441Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:24.9925128Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:24.9942645Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:24.9969807Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:24.9980358Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:24.9991388Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:25.0012074Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:25.0036271Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:25.0047018Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:25.0073996Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:25.0097421Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:25.0109423Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:25.0123483Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:25.0147503Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:25.0156238Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:25.0183066Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:25.0196151Z Entering 'third_party/kleidiai' 2025-12-04T08:54:25.0207014Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:25.0221688Z Entering 'third_party/mimalloc' 2025-12-04T08:54:25.0241169Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:25.0256655Z Entering 'third_party/nlohmann' 2025-12-04T08:54:25.0277142Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:25.0291086Z Entering 'third_party/onnx' 2025-12-04T08:54:25.0302898Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:25.0319706Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:25.0338878Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:25.0361901Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:25.0380273Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:25.0391185Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:25.0416537Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:25.0426789Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:25.0438234Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:25.0447166Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:25.0469511Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:25.0479616Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:25.0497869Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:25.0508148Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:25.0527779Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:25.0537908Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:25.0548949Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:25.0559891Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:25.0582584Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:25.0591930Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:25.0604088Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:25.0626643Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:25.0643888Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:25.0655854Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:25.0667233Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:25.0687293Z Entering 'third_party/pocketfft' 2025-12-04T08:54:25.0700490Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:25.0709638Z Entering 'third_party/protobuf' 2025-12-04T08:54:25.0719946Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:25.0730016Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:25.0746383Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:25.0756038Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:25.0766335Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:25.0776790Z Entering 'third_party/psimd' 2025-12-04T08:54:25.0793773Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:25.0804702Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:25.0816410Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:25.0828035Z Entering 'third_party/pybind11' 2025-12-04T08:54:25.0839861Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:25.0854956Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:25.0866658Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:25.0876365Z Entering 'third_party/sleef' 2025-12-04T08:54:25.0893011Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:25.0902944Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:25.0920453Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:25.0930689Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:25.0958011Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:25.0977343Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:25.0988168Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:25.0996903Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:25.1019917Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:25.1029476Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:25.1041800Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:25.1050851Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:25.1079100Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:25.1117627Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1145082Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1184700Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1216885Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1239916Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1274402Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1297352Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1318365Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1352535Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1375291Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1411763Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1447390Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1487932Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1518600Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1541317Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1562614Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1584819Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1619144Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1656405Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1694564Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1730772Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1752734Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1775212Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1797269Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1821322Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1843426Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1865598Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1900091Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1938001Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1962513Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.1993158Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2017280Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2051805Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2077971Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2116509Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2139970Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2162871Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2200962Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2231410Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2266918Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2304454Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2327999Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2350648Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2372759Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2407302Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2443946Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2479711Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2513797Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2547335Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2576470Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2599823Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2623003Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2661166Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2698440Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2722284Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2756822Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2793572Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2817564Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2842409Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2865681Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2888667Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2912654Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2936305Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.2975758Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3000383Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3028528Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3052087Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3076198Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3101930Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3135730Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3172844Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3211675Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3237677Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3273387Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3307839Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3330326Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3368771Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3403220Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3427887Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3452632Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3488975Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T08:54:25.3526948Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:25.3566475Z ##[endgroup] 2025-12-04T08:54:25.3567074Z ##[group]Fetching the repository 2025-12-04T08:54:25.3577473Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --no-recurse-submodules origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2025-12-04T08:54:26.8947226Z [command]/usr/bin/git rev-parse --verify --quiet ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32^{object} 2025-12-04T08:54:26.9048259Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:26.9053571Z ##[endgroup] 2025-12-04T08:54:26.9054108Z ##[group]Determining the checkout info 2025-12-04T08:54:26.9056183Z ##[endgroup] 2025-12-04T08:54:26.9063569Z [command]/usr/bin/git sparse-checkout disable 2025-12-04T08:54:26.9175522Z [command]/usr/bin/git config --local --unset-all extensions.worktreeConfig 2025-12-04T08:54:26.9205246Z ##[group]Checking out the ref 2025-12-04T08:54:26.9210991Z [command]/usr/bin/git checkout --progress --force ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:26.9489828Z HEAD is now at ffd9b0fb4355 Resolve collective autotuning test failure on arm (#168919) 2025-12-04T08:54:26.9495591Z ##[endgroup] 2025-12-04T08:54:26.9496129Z ##[group]Setting up auth for fetching submodules 2025-12-04T08:54:26.9498998Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2025-12-04T08:54:26.9544875Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2025-12-04T08:54:26.9580415Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2025-12-04T08:54:26.9611893Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2025-12-04T08:54:26.9641625Z ##[endgroup] 2025-12-04T08:54:26.9642110Z ##[group]Fetching submodules 2025-12-04T08:54:26.9642700Z [command]/usr/bin/git submodule sync --recursive 2025-12-04T08:54:26.9883235Z Synchronizing submodule url for 'android/libs/fbjni' 2025-12-04T08:54:26.9904470Z Synchronizing submodule url for 'third_party/FP16' 2025-12-04T08:54:26.9930518Z Synchronizing submodule url for 'third_party/FXdiv' 2025-12-04T08:54:26.9942497Z Synchronizing submodule url for 'third_party/NNPACK' 2025-12-04T08:54:26.9954499Z Synchronizing submodule url for 'third_party/NVTX' 2025-12-04T08:54:26.9966947Z Synchronizing submodule url for 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:26.9989705Z Synchronizing submodule url for 'third_party/XNNPACK' 2025-12-04T08:54:27.0009309Z Synchronizing submodule url for 'third_party/aiter' 2025-12-04T08:54:27.0047335Z Synchronizing submodule url for 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:27.0073809Z Synchronizing submodule url for 'third_party/benchmark' 2025-12-04T08:54:27.0097015Z Synchronizing submodule url for 'third_party/composable_kernel' 2025-12-04T08:54:27.0125972Z Synchronizing submodule url for 'third_party/cpp-httplib' 2025-12-04T08:54:27.0139401Z Synchronizing submodule url for 'third_party/cpuinfo' 2025-12-04T08:54:27.0164592Z Synchronizing submodule url for 'third_party/cudnn_frontend' 2025-12-04T08:54:27.0177605Z Synchronizing submodule url for 'third_party/cutlass' 2025-12-04T08:54:27.0193219Z Synchronizing submodule url for 'third_party/fbgemm' 2025-12-04T08:54:27.0208551Z Synchronizing submodule url for 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:27.0228330Z Synchronizing submodule url for 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:27.0258945Z Synchronizing submodule url for 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:27.0282180Z Synchronizing submodule url for 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:27.0298356Z Synchronizing submodule url for 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:27.0320533Z Synchronizing submodule url for 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:27.0338944Z Synchronizing submodule url for 'third_party/fbgemm/external/json' 2025-12-04T08:54:27.0360816Z Synchronizing submodule url for 'third_party/flash-attention' 2025-12-04T08:54:27.0373266Z Synchronizing submodule url for 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:27.0391287Z Synchronizing submodule url for 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:27.0411104Z Synchronizing submodule url for 'third_party/flatbuffers' 2025-12-04T08:54:27.0423642Z Synchronizing submodule url for 'third_party/fmt' 2025-12-04T08:54:27.0435196Z Synchronizing submodule url for 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:27.0446174Z Synchronizing submodule url for 'third_party/gloo' 2025-12-04T08:54:27.0457272Z Synchronizing submodule url for 'third_party/googletest' 2025-12-04T08:54:27.0467774Z Synchronizing submodule url for 'third_party/ideep' 2025-12-04T08:54:27.0484282Z Synchronizing submodule url for 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:27.0497903Z Synchronizing submodule url for 'third_party/ittapi' 2025-12-04T08:54:27.0521883Z Synchronizing submodule url for 'third_party/kineto' 2025-12-04T08:54:27.0553851Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:27.0564014Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:27.0575720Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:27.0587193Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:27.0599372Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:27.0629539Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:27.0651072Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:27.0673723Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:27.0696088Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:27.0708766Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:27.0720177Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:27.0742649Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:27.0761294Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:27.0787495Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:27.0799761Z Synchronizing submodule url for 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:27.0812599Z Synchronizing submodule url for 'third_party/kleidiai' 2025-12-04T08:54:27.0826306Z Synchronizing submodule url for 'third_party/mimalloc' 2025-12-04T08:54:27.0837760Z Synchronizing submodule url for 'third_party/nlohmann' 2025-12-04T08:54:27.0863518Z Synchronizing submodule url for 'third_party/onnx' 2025-12-04T08:54:27.0887787Z Synchronizing submodule url for 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:27.0911178Z Synchronizing submodule url for 'third_party/opentelemetry-cpp' 2025-12-04T08:54:27.0936168Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:27.0957220Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:27.0975840Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:27.0986634Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:27.1007335Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:27.1018012Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:27.1038138Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:27.1049654Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:27.1060276Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:27.1085674Z Synchronizing submodule url for 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:27.1105755Z Synchronizing submodule url for 'third_party/pocketfft' 2025-12-04T08:54:27.1116989Z Synchronizing submodule url for 'third_party/protobuf' 2025-12-04T08:54:27.1129676Z Synchronizing submodule url for 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:27.1140214Z Synchronizing submodule url for 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:27.1157389Z Synchronizing submodule url for 'third_party/psimd' 2025-12-04T08:54:27.1170614Z Synchronizing submodule url for 'third_party/pthreadpool' 2025-12-04T08:54:27.1182920Z Synchronizing submodule url for 'third_party/pybind11' 2025-12-04T08:54:27.1194234Z Synchronizing submodule url for 'third_party/python-peachpy' 2025-12-04T08:54:27.1205164Z Synchronizing submodule url for 'third_party/sleef' 2025-12-04T08:54:27.1216269Z Synchronizing submodule url for 'third_party/tensorpipe' 2025-12-04T08:54:27.1242115Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:27.1264200Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:27.1277105Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:27.1297586Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:27.1311507Z Synchronizing submodule url for 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:27.1356651Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2025-12-04T08:54:27.1645989Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2025-12-04T08:54:27.1701835Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2025-12-04T08:54:27.1768529Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2025-12-04T08:54:27.1844278Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2025-12-04T08:54:27.1921783Z Submodule path 'third_party/NVTX': checked out '3ebbc93ded7285963bff932c678fa367eb393ba6' 2025-12-04T08:54:27.2002216Z Submodule path 'third_party/VulkanMemoryAllocator': checked out '1d8f600fd424278486eade7ed3e877c99f0846b1' 2025-12-04T08:54:27.2193273Z Submodule path 'third_party/XNNPACK': checked out '51a0103656eff6fc9bfd39a4597923c4b542c883' 2025-12-04T08:54:27.2343730Z Submodule path 'third_party/aiter': checked out '01aae101b9e5e94d6c16a9514c9fb8df99c93150' 2025-12-04T08:54:27.2547643Z Submodule path 'third_party/aiter/3rdparty/composable_kernel': checked out 'cffe8fa2a442ac8e80dd236a1a5d24fe3d7e0cbf' 2025-12-04T08:54:27.2650560Z Submodule path 'third_party/benchmark': checked out '299e5928955cc62af9968370293b916f5130916f' 2025-12-04T08:54:27.2838153Z Submodule path 'third_party/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:27.2928751Z Submodule path 'third_party/cpp-httplib': checked out '89c932f313c6437c38f2982869beacc89c2f2246' 2025-12-04T08:54:27.3006610Z Submodule path 'third_party/cpuinfo': checked out 'f858c30bcb16f8effd5ff46996f0514539e17abc' 2025-12-04T08:54:27.3085746Z Submodule path 'third_party/cudnn_frontend': checked out '0b1577c8c83401237d601d0d0db5210506705396' 2025-12-04T08:54:27.3212689Z Submodule path 'third_party/cutlass': checked out 'f88806b1e31dfa579842638740216dd41fc6c588' 2025-12-04T08:54:27.3374294Z Submodule path 'third_party/fbgemm': checked out 'c0b988d39a9e47c794d699f29930ed4d7c7e13a4' 2025-12-04T08:54:27.3466022Z Submodule path 'third_party/fbgemm/external/asmjit': checked out 'a3199e8857792cd10b7589ff5d58343d2c9008ea' 2025-12-04T08:54:27.3681366Z Submodule path 'third_party/fbgemm/external/composable_kernel': checked out '7fe50dc3da2069d6645d9deb8c017a876472a977' 2025-12-04T08:54:27.3767830Z Submodule path 'third_party/fbgemm/external/cpuinfo': checked out '6543fec09b2f04ac4a666882998b534afc9c1349' 2025-12-04T08:54:27.3897765Z Submodule path 'third_party/fbgemm/external/cutlass': checked out '98125ce499b0fdf7ffbe0e3052f5b8709f4840f8' 2025-12-04T08:54:27.3955200Z Submodule path 'third_party/fbgemm/external/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:27.4022118Z Submodule path 'third_party/fbgemm/external/hipify_torch': checked out '63b6a7b541fa7f08f8475ca7d74054db36ff2691' 2025-12-04T08:54:27.4111712Z Submodule path 'third_party/fbgemm/external/json': checked out '9cca280a4d0ccf0c08f47a99aa71d1b0e52f8d03' 2025-12-04T08:54:27.4205219Z Submodule path 'third_party/flash-attention': checked out '979702c87a8713a8e0a5e9fee122b90d2ef13be5' 2025-12-04T08:54:27.4397480Z Submodule path 'third_party/flash-attention/csrc/composable_kernel': checked out '888317e698e9803c62bd38568abc9e05d7709f33' 2025-12-04T08:54:27.4552412Z Submodule path 'third_party/flash-attention/csrc/cutlass': checked out 'c506e16788cb08416a4a57e11a9067beeee29420' 2025-12-04T08:54:27.4664587Z Submodule path 'third_party/flatbuffers': checked out 'a2cd1ea3b6d3fee220106b5fed3f7ce8da9eb757' 2025-12-04T08:54:27.4740245Z Submodule path 'third_party/fmt': checked out '407c905e45ad75fc29bf0f9bb7c5c2fd3475976f' 2025-12-04T08:54:27.4838509Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2025-12-04T08:54:27.4951800Z Submodule path 'third_party/gloo': checked out '54cbae0d3a67fa890b4c3d9ee162b7860315e341' 2025-12-04T08:54:27.5034325Z Submodule path 'third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:27.5109717Z Submodule path 'third_party/ideep': checked out '719d8e6cd7f7a0e01b155657526d693acf97c2b3' 2025-12-04T08:54:27.5305538Z Submodule path 'third_party/ideep/mkl-dnn': checked out '8d263e693366ef8db40acc569cc7d8edf644556d' 2025-12-04T08:54:27.5375497Z Submodule path 'third_party/ittapi': checked out 'dec1d23ca65ab069d225dfe40dea14f455170959' 2025-12-04T08:54:27.5470107Z Submodule path 'third_party/kineto': checked out '31f85df8fbd89c188f14ef10f1ec65379786b943' 2025-12-04T08:54:27.5607933Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out 'd2ffe0a4e3acace628db49974246b66fc3e85fb1' 2025-12-04T08:54:27.5724722Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2025-12-04T08:54:27.5806860Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2025-12-04T08:54:27.5895570Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2025-12-04T08:54:27.5954237Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2025-12-04T08:54:27.6028254Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2025-12-04T08:54:27.6123985Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2025-12-04T08:54:27.6209160Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:27.6300383Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2025-12-04T08:54:27.6369800Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2025-12-04T08:54:27.6470305Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp': checked out 'b1234816facfdda29845c46696a02998a4af115a' 2025-12-04T08:54:27.6619137Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'd7ba35bbb649209c66e582d5a0244ba988a15159' 2025-12-04T08:54:27.6703405Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:27.6775939Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '40626af88bd7df9a5fb80be7b25ac85b122d6c21' 2025-12-04T08:54:27.6864284Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '52eb8108c5bdec04579160ae17225d66034bd723' 2025-12-04T08:54:27.6963232Z Submodule path 'third_party/kleidiai': checked out 'd7770c89632329a9914ef1a90289917597639cbe' 2025-12-04T08:54:27.7074309Z Submodule path 'third_party/mimalloc': checked out 'fbd8b99c2b828428947d70fdc046bb55609be93e' 2025-12-04T08:54:27.7201495Z Submodule path 'third_party/nlohmann': checked out '55f93686c01528224f448c19128836e7df245f72' 2025-12-04T08:54:27.7383100Z Submodule path 'third_party/onnx': checked out 'e709452ef2bbc1d113faf678c24e6d3467696e83' 2025-12-04T08:54:27.7467111Z Submodule path 'third_party/onnx/third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2025-12-04T08:54:27.7584370Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2025-12-04T08:54:27.7713838Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2025-12-04T08:54:27.7811206Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2025-12-04T08:54:27.7893127Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2025-12-04T08:54:27.8021697Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2025-12-04T08:54:27.8136345Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2025-12-04T08:54:27.8194947Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2025-12-04T08:54:27.8274720Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2025-12-04T08:54:27.8397676Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2025-12-04T08:54:27.8516102Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2025-12-04T08:54:27.8708321Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2025-12-04T08:54:27.8815842Z Submodule path 'third_party/pocketfft': checked out '0fa0ef591e38c2758e3184c6c23e497b9f732ffa' 2025-12-04T08:54:27.9030006Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2025-12-04T08:54:27.9121921Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2025-12-04T08:54:27.9205873Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2025-12-04T08:54:27.9305129Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2025-12-04T08:54:27.9412929Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2025-12-04T08:54:27.9542821Z Submodule path 'third_party/pybind11': checked out 'f5fbe867d2d26e4a0a9177a51f6e568868ad3dc8' 2025-12-04T08:54:27.9626722Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2025-12-04T08:54:27.9684649Z Submodule path 'third_party/sleef': checked out '5a1d179df9cf652951b59010a2d2075372d67f68' 2025-12-04T08:54:27.9784919Z Submodule path 'third_party/tensorpipe': checked out '2b4cd91092d335a697416b2a3cb398283246849d' 2025-12-04T08:54:27.9884544Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2025-12-04T08:54:27.9955577Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2025-12-04T08:54:28.0137215Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '5152db2cbfeb5582e9c27c5ea1dba2cd9e10759b' 2025-12-04T08:54:28.0234254Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2025-12-04T08:54:28.0291283Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2025-12-04T08:54:28.0355948Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2025-12-04T08:54:28.0630755Z Entering 'android/libs/fbjni' 2025-12-04T08:54:28.0664092Z Entering 'third_party/FP16' 2025-12-04T08:54:28.0700900Z Entering 'third_party/FXdiv' 2025-12-04T08:54:28.0733239Z Entering 'third_party/NNPACK' 2025-12-04T08:54:28.0771266Z Entering 'third_party/NVTX' 2025-12-04T08:54:28.0792771Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:28.0815552Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:28.0863946Z Entering 'third_party/aiter' 2025-12-04T08:54:28.0889220Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:28.0926233Z Entering 'third_party/benchmark' 2025-12-04T08:54:28.0949427Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:28.1001572Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:28.1046111Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:28.1076142Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:28.1099538Z Entering 'third_party/cutlass' 2025-12-04T08:54:28.1124124Z Entering 'third_party/fbgemm' 2025-12-04T08:54:28.1165137Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:28.1185137Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:28.1213239Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:28.1257980Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:28.1298310Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:28.1358604Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:28.1385268Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:28.1417300Z Entering 'third_party/flash-attention' 2025-12-04T08:54:28.1438296Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:28.1475122Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:28.1502201Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:28.1523567Z Entering 'third_party/fmt' 2025-12-04T08:54:28.1558407Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:28.1585205Z Entering 'third_party/gloo' 2025-12-04T08:54:28.1622293Z Entering 'third_party/googletest' 2025-12-04T08:54:28.1651411Z Entering 'third_party/ideep' 2025-12-04T08:54:28.1685701Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:28.1722004Z Entering 'third_party/ittapi' 2025-12-04T08:54:28.1746019Z Entering 'third_party/kineto' 2025-12-04T08:54:28.1778967Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:28.1814284Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:28.1847392Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:28.1865898Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:28.1894947Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:28.1916668Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:28.1960120Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:28.1980577Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:28.1998617Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:28.2016282Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:28.2033725Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:28.2052764Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:28.2104981Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:28.2139659Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:28.2167425Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:28.2190405Z Entering 'third_party/kleidiai' 2025-12-04T08:54:28.2214164Z Entering 'third_party/mimalloc' 2025-12-04T08:54:28.2236554Z Entering 'third_party/nlohmann' 2025-12-04T08:54:28.2269028Z Entering 'third_party/onnx' 2025-12-04T08:54:28.2310948Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:28.2344933Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:28.2365281Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:28.2396614Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:28.2434022Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:28.2459756Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:28.2494192Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:28.2512414Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:28.2537604Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:28.2556439Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:28.2586164Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:28.2622693Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:28.2685874Z Entering 'third_party/pocketfft' 2025-12-04T08:54:28.2706941Z Entering 'third_party/protobuf' 2025-12-04T08:54:28.2736359Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:28.2764756Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:28.2812653Z Entering 'third_party/psimd' 2025-12-04T08:54:28.2838394Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:28.2858641Z Entering 'third_party/pybind11' 2025-12-04T08:54:28.2887268Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:28.2907494Z Entering 'third_party/sleef' 2025-12-04T08:54:28.2940471Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:28.2971806Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:28.3016064Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:28.3046102Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:28.3065477Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:28.3083268Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:28.3114763Z ##[endgroup] 2025-12-04T08:54:28.3115343Z ##[group]Persisting credentials for submodules 2025-12-04T08:54:28.3125192Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2025-12-04T08:54:28.3423716Z Entering 'android/libs/fbjni' 2025-12-04T08:54:28.3465584Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3466047Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3512234Z Entering 'third_party/FP16' 2025-12-04T08:54:28.3542284Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3542736Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3578406Z Entering 'third_party/FXdiv' 2025-12-04T08:54:28.3607474Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3607925Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3639482Z Entering 'third_party/NNPACK' 2025-12-04T08:54:28.3667492Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3667942Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3699308Z Entering 'third_party/NVTX' 2025-12-04T08:54:28.3724519Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3724947Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3744617Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:28.3774499Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3774881Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3803623Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:28.3837492Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3837886Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3874838Z Entering 'third_party/aiter' 2025-12-04T08:54:28.3902867Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3903328Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3946362Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:28.3983365Z url.https://github.com/.insteadof 2025-12-04T08:54:28.3983827Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4020829Z Entering 'third_party/benchmark' 2025-12-04T08:54:28.4051866Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4052327Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4082894Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:28.4105271Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4105728Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4142987Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:28.4168681Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4169134Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4197652Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:28.4223510Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4223973Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4264786Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:28.4295080Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4295548Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4314896Z Entering 'third_party/cutlass' 2025-12-04T08:54:28.4338606Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4339072Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4387641Z Entering 'third_party/fbgemm' 2025-12-04T08:54:28.4411039Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4411495Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4442457Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:28.4469603Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4470063Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4502432Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:28.4517247Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4517636Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4555041Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:28.4577753Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4578402Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4606653Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:28.4629477Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4629932Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4663232Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:28.4691440Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4691892Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4721006Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:28.4740433Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4740910Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4758472Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:28.4780151Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4780618Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4801053Z Entering 'third_party/flash-attention' 2025-12-04T08:54:28.4821704Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4822162Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4863536Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:28.4884106Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4884567Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4916542Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:28.4930536Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4931000Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4964344Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:28.4992231Z url.https://github.com/.insteadof 2025-12-04T08:54:28.4992695Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5020482Z Entering 'third_party/fmt' 2025-12-04T08:54:28.5035944Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5036409Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5064792Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:28.5094812Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5095269Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5114226Z Entering 'third_party/gloo' 2025-12-04T08:54:28.5138402Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5138871Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5167557Z Entering 'third_party/googletest' 2025-12-04T08:54:28.5195560Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5196031Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5225111Z Entering 'third_party/ideep' 2025-12-04T08:54:28.5246620Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5247088Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5273075Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:28.5287364Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5288003Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5334260Z Entering 'third_party/ittapi' 2025-12-04T08:54:28.5363214Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5363673Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5383776Z Entering 'third_party/kineto' 2025-12-04T08:54:28.5404985Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5405459Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5435182Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:28.5455162Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5455623Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5496767Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:28.5514954Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5515422Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5535666Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:28.5549533Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5550012Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5565846Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:28.5590338Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5590805Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5607583Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:28.5624373Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5624845Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5655334Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:28.5677204Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5677675Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5709104Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:28.5727672Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5728198Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5758769Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:28.5772081Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5772543Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5813614Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:28.5832976Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5833449Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5875580Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:28.5900161Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5900829Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5918530Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:28.5940963Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5941430Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5959236Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:28.5976861Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5977330Z url.https://github.com/.insteadof 2025-12-04T08:54:28.5997341Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:28.6011907Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6012378Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6051277Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:28.6078854Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6079320Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6099963Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:28.6114676Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6114821Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6141490Z Entering 'third_party/kleidiai' 2025-12-04T08:54:28.6156101Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6156547Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6193615Z Entering 'third_party/mimalloc' 2025-12-04T08:54:28.6211075Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6211534Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6231831Z Entering 'third_party/nlohmann' 2025-12-04T08:54:28.6257219Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6257597Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6288299Z Entering 'third_party/onnx' 2025-12-04T08:54:28.6305203Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6305588Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6337524Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:28.6351834Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6352224Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6373270Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:28.6397542Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6397930Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6426995Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:28.6452372Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6452505Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6491161Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:28.6509748Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6510246Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6527924Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:28.6546755Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6547169Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6575416Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:28.6590507Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6590898Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6609528Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:28.6627066Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6627445Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6646255Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:28.6665203Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6665583Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6684934Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:28.6713118Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6713496Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6738412Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:28.6761494Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6761872Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6791971Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:28.6808271Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6808656Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6839788Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:28.6870900Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6871466Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6910445Z Entering 'third_party/pocketfft' 2025-12-04T08:54:28.6942126Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6942585Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6972752Z Entering 'third_party/protobuf' 2025-12-04T08:54:28.6993888Z url.https://github.com/.insteadof 2025-12-04T08:54:28.6994337Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7028304Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:28.7062637Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7063044Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7092341Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:28.7116419Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7116820Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7142451Z Entering 'third_party/psimd' 2025-12-04T08:54:28.7171078Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7171782Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7190171Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:28.7211020Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7211413Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7241249Z Entering 'third_party/pybind11' 2025-12-04T08:54:28.7263308Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7263699Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7282221Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:28.7315823Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7316302Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7336390Z Entering 'third_party/sleef' 2025-12-04T08:54:28.7354424Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7354846Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7384686Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:28.7405361Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7405815Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7435493Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:28.7458748Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7459206Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7490491Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:28.7506147Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7506702Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7536650Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:28.7564561Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7565052Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7594874Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:28.7615588Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7616043Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7633844Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:28.7652842Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7653306Z url.https://github.com/.insteadof 2025-12-04T08:54:28.7722424Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2025-12-04T08:54:28.7980496Z Entering 'android/libs/fbjni' 2025-12-04T08:54:28.8024463Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T08:54:28.8035236Z Entering 'third_party/FP16' 2025-12-04T08:54:28.8069367Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T08:54:28.8082038Z Entering 'third_party/FXdiv' 2025-12-04T08:54:28.8113096Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T08:54:28.8124711Z Entering 'third_party/NNPACK' 2025-12-04T08:54:28.8168800Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T08:54:28.8182472Z Entering 'third_party/NVTX' 2025-12-04T08:54:28.8232211Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T08:54:28.8256225Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:28.8283671Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T08:54:28.8306258Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:28.8356839Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T08:54:28.8373736Z Entering 'third_party/aiter' 2025-12-04T08:54:28.8416761Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T08:54:28.8429889Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:28.8481557Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T08:54:28.8512873Z Entering 'third_party/benchmark' 2025-12-04T08:54:28.8564175Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:28.8588691Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:28.8629226Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T08:54:28.8645050Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:28.8689184Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T08:54:28.8713202Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:28.8757803Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T08:54:28.8772268Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:28.8802535Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T08:54:28.8825452Z Entering 'third_party/cutlass' 2025-12-04T08:54:28.8851707Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T08:54:28.8878854Z Entering 'third_party/fbgemm' 2025-12-04T08:54:28.8920298Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T08:54:28.8932949Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:28.8968867Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T08:54:28.8978982Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:28.8999640Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T08:54:28.9028463Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:28.9050425Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T08:54:28.9061175Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:28.9083940Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T08:54:28.9111929Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:28.9168185Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T08:54:28.9176995Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:28.9205009Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T08:54:28.9215707Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:28.9247971Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T08:54:28.9261338Z Entering 'third_party/flash-attention' 2025-12-04T08:54:28.9305054Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T08:54:28.9328493Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:28.9371436Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T08:54:28.9398922Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:28.9451899Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T08:54:28.9481558Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:28.9514653Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T08:54:28.9537467Z Entering 'third_party/fmt' 2025-12-04T08:54:28.9572081Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:28.9581870Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:28.9625858Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T08:54:28.9635825Z Entering 'third_party/gloo' 2025-12-04T08:54:28.9680796Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T08:54:28.9695758Z Entering 'third_party/googletest' 2025-12-04T08:54:28.9749290Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:28.9762110Z Entering 'third_party/ideep' 2025-12-04T08:54:28.9810836Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T08:54:28.9824264Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:28.9861093Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T08:54:28.9878431Z Entering 'third_party/ittapi' 2025-12-04T08:54:28.9916533Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T08:54:28.9928061Z Entering 'third_party/kineto' 2025-12-04T08:54:28.9976113Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T08:54:28.9988345Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:29.0033135Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T08:54:29.0043963Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:29.0080825Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T08:54:29.0093921Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:29.0133434Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T08:54:29.0154997Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:29.0185642Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T08:54:29.0196268Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:29.0238685Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T08:54:29.0248732Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:29.0279053Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T08:54:29.0301864Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:29.0328691Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T08:54:29.0339568Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:29.0366863Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:29.0377246Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:29.0422714Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T08:54:29.0435739Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:29.0464861Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T08:54:29.0480473Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:29.0514297Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:29.0523280Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.0553126Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:29.0563464Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.0609352Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:29.0624894Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:29.0672894Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T08:54:29.0684296Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:29.0720643Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T08:54:29.0732531Z Entering 'third_party/kleidiai' 2025-12-04T08:54:29.0765106Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T08:54:29.0790418Z Entering 'third_party/mimalloc' 2025-12-04T08:54:29.0827678Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T08:54:29.0838573Z Entering 'third_party/nlohmann' 2025-12-04T08:54:29.0867025Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T08:54:29.0891952Z Entering 'third_party/onnx' 2025-12-04T08:54:29.0919408Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T08:54:29.0937871Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:29.0973184Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:29.0984920Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:29.1019349Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T08:54:29.1032285Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:29.1060355Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:29.1082221Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:29.1123484Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:29.1145590Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:29.1187929Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T08:54:29.1199943Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:29.1228630Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T08:54:29.1240024Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:29.1286122Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T08:54:29.1307169Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:29.1332559Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T08:54:29.1348888Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:29.1371002Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T08:54:29.1381634Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.1412376Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T08:54:29.1423580Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.1457854Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T08:54:29.1472173Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:29.1506283Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T08:54:29.1542385Z Entering 'third_party/pocketfft' 2025-12-04T08:54:29.1574093Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T08:54:29.1587875Z Entering 'third_party/protobuf' 2025-12-04T08:54:29.1624156Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T08:54:29.1636730Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:29.1668305Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T08:54:29.1681249Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:29.1714317Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:29.1736008Z Entering 'third_party/psimd' 2025-12-04T08:54:29.1762868Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T08:54:29.1784108Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:29.1810279Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T08:54:29.1828964Z Entering 'third_party/pybind11' 2025-12-04T08:54:29.1864460Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:29.1886536Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:29.1934846Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T08:54:29.1956894Z Entering 'third_party/sleef' 2025-12-04T08:54:29.2005158Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T08:54:29.2017489Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:29.2050394Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T08:54:29.2065040Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:29.2092012Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T08:54:29.2113045Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:29.2152216Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T08:54:29.2174056Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:29.2215803Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T08:54:29.2227261Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:29.2263790Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T08:54:29.2272041Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:29.2302533Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T08:54:29.2709577Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2025-12-04T08:54:29.2984783Z Entering 'android/libs/fbjni' 2025-12-04T08:54:29.3026539Z Entering 'third_party/FP16' 2025-12-04T08:54:29.3061950Z Entering 'third_party/FXdiv' 2025-12-04T08:54:29.3084282Z Entering 'third_party/NNPACK' 2025-12-04T08:54:29.3113166Z Entering 'third_party/NVTX' 2025-12-04T08:54:29.3136650Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:29.3169355Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:29.3216818Z Entering 'third_party/aiter' 2025-12-04T08:54:29.3255182Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:29.3308567Z Entering 'third_party/benchmark' 2025-12-04T08:54:29.3362137Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:29.3423571Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:29.3481249Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:29.3532725Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:29.3562832Z Entering 'third_party/cutlass' 2025-12-04T08:54:29.3591210Z Entering 'third_party/fbgemm' 2025-12-04T08:54:29.3642165Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:29.3674524Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:29.3703801Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:29.3730749Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:29.3762436Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:29.3782725Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:29.3828681Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:29.3852906Z Entering 'third_party/flash-attention' 2025-12-04T08:54:29.3873899Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:29.3924527Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:29.3967477Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:29.4004192Z Entering 'third_party/fmt' 2025-12-04T08:54:29.4050286Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:29.4088888Z Entering 'third_party/gloo' 2025-12-04T08:54:29.4132019Z Entering 'third_party/googletest' 2025-12-04T08:54:29.4154916Z Entering 'third_party/ideep' 2025-12-04T08:54:29.4186320Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:29.4221565Z Entering 'third_party/ittapi' 2025-12-04T08:54:29.4261800Z Entering 'third_party/kineto' 2025-12-04T08:54:29.4292375Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:29.4316984Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:29.4360352Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:29.4404274Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:29.4429794Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:29.4449753Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:29.4489463Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:29.4539226Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:29.4566923Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:29.4588576Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:29.4614505Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:29.4646437Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.4668391Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.4723316Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:29.4759996Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:29.4783027Z Entering 'third_party/kleidiai' 2025-12-04T08:54:29.4828476Z Entering 'third_party/mimalloc' 2025-12-04T08:54:29.4863389Z Entering 'third_party/nlohmann' 2025-12-04T08:54:29.4907049Z Entering 'third_party/onnx' 2025-12-04T08:54:29.4954245Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:29.4999608Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:29.5046521Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:29.5097689Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:29.5125979Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:29.5159623Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:29.5197876Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:29.5229397Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:29.5250565Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:29.5272621Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.5295247Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.5318388Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:29.5351643Z Entering 'third_party/pocketfft' 2025-12-04T08:54:29.5383271Z Entering 'third_party/protobuf' 2025-12-04T08:54:29.5430895Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:29.5462666Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:29.5485088Z Entering 'third_party/psimd' 2025-12-04T08:54:29.5511653Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:29.5532529Z Entering 'third_party/pybind11' 2025-12-04T08:54:29.5573398Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:29.5612753Z Entering 'third_party/sleef' 2025-12-04T08:54:29.5652806Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:29.5674420Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:29.5693364Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:29.5712095Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:29.5742156Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:29.5774874Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:29.5826982Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2025-12-04T08:54:29.6077043Z Entering 'android/libs/fbjni' 2025-12-04T08:54:29.6106531Z Entering 'third_party/FP16' 2025-12-04T08:54:29.6155974Z Entering 'third_party/FXdiv' 2025-12-04T08:54:29.6176952Z Entering 'third_party/NNPACK' 2025-12-04T08:54:29.6198627Z Entering 'third_party/NVTX' 2025-12-04T08:54:29.6220798Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T08:54:29.6248681Z Entering 'third_party/XNNPACK' 2025-12-04T08:54:29.6293187Z Entering 'third_party/aiter' 2025-12-04T08:54:29.6321268Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T08:54:29.6363993Z Entering 'third_party/benchmark' 2025-12-04T08:54:29.6396042Z Entering 'third_party/composable_kernel' 2025-12-04T08:54:29.6424854Z Entering 'third_party/cpp-httplib' 2025-12-04T08:54:29.6446184Z Entering 'third_party/cpuinfo' 2025-12-04T08:54:29.6479840Z Entering 'third_party/cudnn_frontend' 2025-12-04T08:54:29.6514595Z Entering 'third_party/cutlass' 2025-12-04T08:54:29.6550694Z Entering 'third_party/fbgemm' 2025-12-04T08:54:29.6585323Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T08:54:29.6604769Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T08:54:29.6652043Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T08:54:29.6698889Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T08:54:29.6726808Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T08:54:29.6759281Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T08:54:29.6787676Z Entering 'third_party/fbgemm/external/json' 2025-12-04T08:54:29.6828253Z Entering 'third_party/flash-attention' 2025-12-04T08:54:29.6863778Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T08:54:29.6903323Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T08:54:29.6952738Z Entering 'third_party/flatbuffers' 2025-12-04T08:54:29.6988322Z Entering 'third_party/fmt' 2025-12-04T08:54:29.7016763Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T08:54:29.7043220Z Entering 'third_party/gloo' 2025-12-04T08:54:29.7093344Z Entering 'third_party/googletest' 2025-12-04T08:54:29.7123380Z Entering 'third_party/ideep' 2025-12-04T08:54:29.7154261Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T08:54:29.7176586Z Entering 'third_party/ittapi' 2025-12-04T08:54:29.7216481Z Entering 'third_party/kineto' 2025-12-04T08:54:29.7254931Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T08:54:29.7273640Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T08:54:29.7320502Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T08:54:29.7354084Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T08:54:29.7392128Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T08:54:29.7419601Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T08:54:29.7463582Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T08:54:29.7489626Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T08:54:29.7522356Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T08:54:29.7546472Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T08:54:29.7594726Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T08:54:29.7632762Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.7662577Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.7691041Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T08:54:29.7724423Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T08:54:29.7764455Z Entering 'third_party/kleidiai' 2025-12-04T08:54:29.7808158Z Entering 'third_party/mimalloc' 2025-12-04T08:54:29.7847061Z Entering 'third_party/nlohmann' 2025-12-04T08:54:29.7884168Z Entering 'third_party/onnx' 2025-12-04T08:54:29.7922563Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T08:54:29.7959311Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T08:54:29.8000040Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T08:54:29.8039386Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T08:54:29.8081765Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T08:54:29.8127602Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T08:54:29.8175959Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T08:54:29.8202405Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T08:54:29.8253523Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T08:54:29.8296147Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T08:54:29.8319129Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T08:54:29.8344854Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T08:54:29.8393089Z Entering 'third_party/pocketfft' 2025-12-04T08:54:29.8424459Z Entering 'third_party/protobuf' 2025-12-04T08:54:29.8459728Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T08:54:29.8492493Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T08:54:29.8539147Z Entering 'third_party/psimd' 2025-12-04T08:54:29.8583203Z Entering 'third_party/pthreadpool' 2025-12-04T08:54:29.8608362Z Entering 'third_party/pybind11' 2025-12-04T08:54:29.8653644Z Entering 'third_party/python-peachpy' 2025-12-04T08:54:29.8682871Z Entering 'third_party/sleef' 2025-12-04T08:54:29.8730888Z Entering 'third_party/tensorpipe' 2025-12-04T08:54:29.8779507Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T08:54:29.8797764Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T08:54:29.8824891Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T08:54:29.8862612Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T08:54:29.8913224Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T08:54:29.8961115Z ##[endgroup] 2025-12-04T08:54:29.9124292Z [command]/usr/bin/git log -1 --format=%H 2025-12-04T08:54:29.9199868Z ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:29.9477872Z Prepare all required actions 2025-12-04T08:54:29.9478704Z Getting action download info 2025-12-04T08:54:30.1892198Z Download action repository 'aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2025-12-04T08:54:31.0562682Z ##[group]Run ./.github/actions/setup-rocm 2025-12-04T08:54:31.0563097Z env: 2025-12-04T08:54:31.0563378Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.0563711Z ##[endgroup] 2025-12-04T08:54:31.0596730Z ##[group]Run dpkg -l | grep -E " rocm" 2025-12-04T08:54:31.0597182Z dpkg -l | grep -E " rocm" 2025-12-04T08:54:31.0608453Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.0608913Z env: 2025-12-04T08:54:31.0609200Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.0609518Z ##[endgroup] 2025-12-04T08:54:31.0714023Z ii rocm-cmake 0.14.0.60401-83~22.04 amd64 rocm-cmake built using CMake 2025-12-04T08:54:31.0715091Z ii rocm-core 6.4.1.60401-83~22.04 amd64 ROCm Runtime software stack 2025-12-04T08:54:31.0715798Z ii rocm-dbgapi 0.77.2.60401-83~22.04 amd64 Library to provide AMD GPU debugger API 2025-12-04T08:54:31.0716630Z ii rocm-debug-agent 2.0.4.60401-83~22.04 amd64 Radeon Open Compute Debug Agent (ROCdebug-agent) 2025-12-04T08:54:31.0717435Z ii rocm-dev 6.4.1.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T08:54:31.0718415Z ii rocm-device-libs 1.0.0.60401-83~22.04 amd64 Radeon Open Compute - device libraries 2025-12-04T08:54:31.0719228Z ii rocm-gdb 15.2.60401-83~22.04 amd64 ROCgdb 2025-12-04T08:54:31.0719861Z ii rocm-llvm 19.0.0.25184.60401-83~22.04 amd64 ROCm core compiler 2025-12-04T08:54:31.0720547Z ii rocm-opencl 2.0.0.60401-83~22.04 amd64 clr built using CMake 2025-12-04T08:54:31.0721230Z ii rocm-opencl-dev 2.0.0.60401-83~22.04 amd64 clr built using CMake 2025-12-04T08:54:31.0721932Z ii rocm-smi-lib 7.5.0.60401-83~22.04 amd64 AMD System Management libraries 2025-12-04T08:54:31.0722667Z ii rocm-utils 6.4.1.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime software stack 2025-12-04T08:54:31.0723436Z ii rocminfo 1.0.0.60401-83~22.04 amd64 Radeon Open Compute (ROCm) Runtime rocminfo tool 2025-12-04T08:54:31.0746244Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T08:54:31.0746940Z # ignore expansion of "docker ps -q" since it could be empty 2025-12-04T08:54:31.0747474Z # shellcheck disable=SC2046 2025-12-04T08:54:31.0747897Z docker stop $(docker ps -q) || true 2025-12-04T08:54:31.0748390Z # Prune all stopped containers. 2025-12-04T08:54:31.0748800Z docker container prune -f 2025-12-04T08:54:31.0756844Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.0757288Z env: 2025-12-04T08:54:31.0757567Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.0757892Z ##[endgroup] 2025-12-04T08:54:31.1002557Z docker: 'docker stop' requires at least 1 argument 2025-12-04T08:54:31.1002671Z 2025-12-04T08:54:31.1002752Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2025-12-04T08:54:31.1002851Z 2025-12-04T08:54:31.1002913Z See 'docker stop --help' for more information 2025-12-04T08:54:31.1112161Z Total reclaimed space: 0B 2025-12-04T08:54:31.1150759Z ##[group]Run cat /etc/os-release || true 2025-12-04T08:54:31.1151228Z cat /etc/os-release || true 2025-12-04T08:54:31.1151694Z cat /etc/apt/sources.list.d/rocm.list || true 2025-12-04T08:54:31.1152426Z cat /opt/rocm/.info/version || true 2025-12-04T08:54:31.1152793Z whoami 2025-12-04T08:54:31.1162783Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.1163236Z env: 2025-12-04T08:54:31.1163504Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.1163821Z ##[endgroup] 2025-12-04T08:54:31.1196123Z PRETTY_NAME="Ubuntu 22.04.5 LTS" 2025-12-04T08:54:31.1196466Z NAME="Ubuntu" 2025-12-04T08:54:31.1196736Z VERSION_ID="22.04" 2025-12-04T08:54:31.1197045Z VERSION="22.04.5 LTS (Jammy Jellyfish)" 2025-12-04T08:54:31.1197436Z VERSION_CODENAME=jammy 2025-12-04T08:54:31.1197732Z ID=ubuntu 2025-12-04T08:54:31.1197975Z ID_LIKE=debian 2025-12-04T08:54:31.1198360Z HOME_URL="https://www.ubuntu.com/" 2025-12-04T08:54:31.1198768Z SUPPORT_URL="https://help.ubuntu.com/" 2025-12-04T08:54:31.1199245Z BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" 2025-12-04T08:54:31.1199916Z PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" 2025-12-04T08:54:31.1200778Z UBUNTU_CODENAME=jammy 2025-12-04T08:54:31.1207255Z deb [arch=amd64 signed-by=/etc/apt/keyrings/rocm.gpg] https://repo.radeon.com/rocm/apt/6.4.1 jammy main 2025-12-04T08:54:31.1217237Z 6.4.1-83 2025-12-04T08:54:31.1229155Z runner 2025-12-04T08:54:31.1258005Z ##[group]Run dpkg -l | grep -E " amdgpu" 2025-12-04T08:54:31.1258570Z dpkg -l | grep -E " amdgpu" 2025-12-04T08:54:31.1268034Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.1268815Z env: 2025-12-04T08:54:31.1269095Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.1269416Z ##[endgroup] 2025-12-04T08:54:31.1339102Z ii amdgpu-core 1:6.4.60401-2164967.22.04 all Core meta package for unified amdgpu driver. 2025-12-04T08:54:31.1339945Z ii amdgpu-install 6.4.60401-2164967.22.04 all AMDGPU driver repository and installer 2025-12-04T08:54:31.1355350Z ##[group]Run rocm-smi 2025-12-04T08:54:31.1355480Z rocm-smi 2025-12-04T08:54:31.1359832Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.1359983Z env: 2025-12-04T08:54:31.1360069Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.1360172Z ##[endgroup] 2025-12-04T08:54:31.1934010Z 2025-12-04T08:54:31.1934021Z 2025-12-04T08:54:31.1934484Z =========================================== ROCm System Management Interface =========================================== 2025-12-04T08:54:31.1935169Z ===================================================== Concise Info ===================================================== 2025-12-04T08:54:31.1935915Z Device Node IDs Temp Power Partitions SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2025-12-04T08:54:31.1937235Z  (DID, GUID) (Junction) (Socket) (Mem, Compute, ID)  2025-12-04T08:54:31.1937851Z ======================================================================================================================== 2025-12-04T08:54:31.1939160Z 0 5 0x74a5, 2987 28.0°C 128.0W NPS1, SPX, 0 N/A 900Mhz 0% auto 1000.0W 0% 0% 2025-12-04T08:54:31.1939775Z ======================================================================================================================== 2025-12-04T08:54:31.1940285Z ================================================= End of ROCm SMI Log ================================================== 2025-12-04T08:54:31.2007227Z ##[group]Run rocminfo 2025-12-04T08:54:31.2007562Z rocminfo 2025-12-04T08:54:31.2016299Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.2016751Z env: 2025-12-04T08:54:31.2017025Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.2017342Z ##[endgroup] 2025-12-04T08:54:31.2603637Z ROCk module version 6.12.12 is loaded 2025-12-04T08:54:31.2604194Z ===================== 2025-12-04T08:54:31.2604523Z HSA System Attributes 2025-12-04T08:54:31.2604849Z ===================== 2025-12-04T08:54:31.2605420Z Runtime Version: 1.15 2025-12-04T08:54:31.2605765Z Runtime Ext Version: 1.7 2025-12-04T08:54:31.2606122Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T08:54:31.2606697Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T08:54:31.2607327Z Machine Model: LARGE 2025-12-04T08:54:31.2607840Z System Endianness: LITTLE 2025-12-04T08:54:31.2608415Z Mwaitx: DISABLED 2025-12-04T08:54:31.2608759Z XNACK enabled: NO 2025-12-04T08:54:31.2609094Z DMAbuf Support: YES 2025-12-04T08:54:31.2609430Z VMM Support: YES 2025-12-04T08:54:31.2609644Z 2025-12-04T08:54:31.2609760Z ========== 2025-12-04T08:54:31.2610066Z HSA Agents 2025-12-04T08:54:31.2610359Z ========== 2025-12-04T08:54:31.2610647Z ******* 2025-12-04T08:54:31.2611072Z Agent 1 2025-12-04T08:54:31.2611367Z ******* 2025-12-04T08:54:31.2611742Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:31.2612241Z Uuid: CPU-XX 2025-12-04T08:54:31.2612728Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:31.2613231Z Vendor Name: CPU 2025-12-04T08:54:31.2613704Z Feature: None specified 2025-12-04T08:54:31.2614176Z Profile: FULL_PROFILE 2025-12-04T08:54:31.2614631Z Float Round Mode: NEAR 2025-12-04T08:54:31.2615129Z Max Queue Number: 0(0x0) 2025-12-04T08:54:31.2615605Z Queue Min Size: 0(0x0) 2025-12-04T08:54:31.2616074Z Queue Max Size: 0(0x0) 2025-12-04T08:54:31.2616553Z Queue Type: MULTI 2025-12-04T08:54:31.2617002Z Node: 0 2025-12-04T08:54:31.2617444Z Device Type: CPU 2025-12-04T08:54:31.2617896Z Cache Info: 2025-12-04T08:54:31.2642168Z L1: 49152(0xc000) KB 2025-12-04T08:54:31.2642639Z Chip ID: 0(0x0) 2025-12-04T08:54:31.2643110Z ASIC Revision: 0(0x0) 2025-12-04T08:54:31.2643602Z Cacheline Size: 64(0x40) 2025-12-04T08:54:31.2644096Z Max Clock Freq. (MHz): 3300 2025-12-04T08:54:31.2644553Z BDFID: 0 2025-12-04T08:54:31.2645019Z Internal Node ID: 0 2025-12-04T08:54:31.2645513Z Compute Unit: 64 2025-12-04T08:54:31.2645993Z SIMDs per CU: 0 2025-12-04T08:54:31.2646477Z Shader Engines: 0 2025-12-04T08:54:31.2646968Z Shader Arrs. per Eng.: 0 2025-12-04T08:54:31.2647485Z WatchPts on Addr. Ranges:1 2025-12-04T08:54:31.2647943Z Memory Properties: 2025-12-04T08:54:31.2648354Z Features: None 2025-12-04T08:54:31.2648706Z Pool Info: 2025-12-04T08:54:31.2649030Z Pool 1 2025-12-04T08:54:31.2649460Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:31.2649957Z Size: 1584733356(0x5e751cac) KB 2025-12-04T08:54:31.2650434Z Allocatable: TRUE 2025-12-04T08:54:31.2650932Z Alloc Granule: 4KB 2025-12-04T08:54:31.2651664Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2652188Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2652700Z Accessible by all: TRUE 2025-12-04T08:54:31.2653139Z Pool 2 2025-12-04T08:54:31.2653557Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:31.2654038Z Size: 1584733356(0x5e751cac) KB 2025-12-04T08:54:31.2654502Z Allocatable: TRUE 2025-12-04T08:54:31.2654992Z Alloc Granule: 4KB 2025-12-04T08:54:31.2655508Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2656021Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2656527Z Accessible by all: TRUE 2025-12-04T08:54:31.2657076Z Pool 3 2025-12-04T08:54:31.2657487Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T08:54:31.2657955Z Size: 1584733356(0x5e751cac) KB 2025-12-04T08:54:31.2658479Z Allocatable: TRUE 2025-12-04T08:54:31.2658975Z Alloc Granule: 4KB 2025-12-04T08:54:31.2659492Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2660000Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2660505Z Accessible by all: TRUE 2025-12-04T08:54:31.2660943Z Pool 4 2025-12-04T08:54:31.2661341Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:31.2661820Z Size: 1584733356(0x5e751cac) KB 2025-12-04T08:54:31.2662287Z Allocatable: TRUE 2025-12-04T08:54:31.2662788Z Alloc Granule: 4KB 2025-12-04T08:54:31.2663308Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2663817Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2664326Z Accessible by all: TRUE 2025-12-04T08:54:31.2664770Z ISA Info: 2025-12-04T08:54:31.2665090Z ******* 2025-12-04T08:54:31.2665407Z Agent 2 2025-12-04T08:54:31.2665713Z ******* 2025-12-04T08:54:31.2666094Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:31.2666570Z Uuid: CPU-XX 2025-12-04T08:54:31.2667053Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T08:54:31.2667568Z Vendor Name: CPU 2025-12-04T08:54:31.2668134Z Feature: None specified 2025-12-04T08:54:31.2668611Z Profile: FULL_PROFILE 2025-12-04T08:54:31.2669104Z Float Round Mode: NEAR 2025-12-04T08:54:31.2669598Z Max Queue Number: 0(0x0) 2025-12-04T08:54:31.2670085Z Queue Min Size: 0(0x0) 2025-12-04T08:54:31.2670563Z Queue Max Size: 0(0x0) 2025-12-04T08:54:31.2671034Z Queue Type: MULTI 2025-12-04T08:54:31.2671490Z Node: 1 2025-12-04T08:54:31.2671950Z Device Type: CPU 2025-12-04T08:54:31.2672373Z Cache Info: 2025-12-04T08:54:31.2672746Z L1: 49152(0xc000) KB 2025-12-04T08:54:31.2673193Z Chip ID: 0(0x0) 2025-12-04T08:54:31.2673763Z ASIC Revision: 0(0x0) 2025-12-04T08:54:31.2674259Z Cacheline Size: 64(0x40) 2025-12-04T08:54:31.2675026Z Max Clock Freq. (MHz): 3300 2025-12-04T08:54:31.2675519Z BDFID: 0 2025-12-04T08:54:31.2675994Z Internal Node ID: 1 2025-12-04T08:54:31.2676477Z Compute Unit: 64 2025-12-04T08:54:31.2676958Z SIMDs per CU: 0 2025-12-04T08:54:31.2677443Z Shader Engines: 0 2025-12-04T08:54:31.2677934Z Shader Arrs. per Eng.: 0 2025-12-04T08:54:31.2678521Z WatchPts on Addr. Ranges:1 2025-12-04T08:54:31.2679112Z Memory Properties: 2025-12-04T08:54:31.2679470Z Features: None 2025-12-04T08:54:31.2679822Z Pool Info: 2025-12-04T08:54:31.2680144Z Pool 1 2025-12-04T08:54:31.2680557Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:31.2681044Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T08:54:31.2681515Z Allocatable: TRUE 2025-12-04T08:54:31.2682015Z Alloc Granule: 4KB 2025-12-04T08:54:31.2682529Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2683059Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2683572Z Accessible by all: TRUE 2025-12-04T08:54:31.2684007Z Pool 2 2025-12-04T08:54:31.2684420Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:31.2684918Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T08:54:31.2685561Z Allocatable: TRUE 2025-12-04T08:54:31.2686052Z Alloc Granule: 4KB 2025-12-04T08:54:31.2686565Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2687095Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2687605Z Accessible by all: TRUE 2025-12-04T08:54:31.2688034Z Pool 3 2025-12-04T08:54:31.2688557Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T08:54:31.2689027Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T08:54:31.2689484Z Allocatable: TRUE 2025-12-04T08:54:31.2689983Z Alloc Granule: 4KB 2025-12-04T08:54:31.2690502Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2691007Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2691497Z Accessible by all: TRUE 2025-12-04T08:54:31.2691923Z Pool 4 2025-12-04T08:54:31.2692314Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:31.2692775Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T08:54:31.2693224Z Allocatable: TRUE 2025-12-04T08:54:31.2693715Z Alloc Granule: 4KB 2025-12-04T08:54:31.2694219Z Alloc Recommended Granule:4KB 2025-12-04T08:54:31.2694722Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2695218Z Accessible by all: TRUE 2025-12-04T08:54:31.2695652Z ISA Info: 2025-12-04T08:54:31.2696057Z ******* 2025-12-04T08:54:31.2696367Z Agent 3 2025-12-04T08:54:31.2696658Z ******* 2025-12-04T08:54:31.2697011Z Name: gfx942 2025-12-04T08:54:31.2697467Z Uuid: GPU-2beed1e7736038cc 2025-12-04T08:54:31.2697947Z Marketing Name: AMD Instinct MI325X 2025-12-04T08:54:31.2698509Z Vendor Name: AMD 2025-12-04T08:54:31.2698990Z Feature: KERNEL_DISPATCH 2025-12-04T08:54:31.2699467Z Profile: BASE_PROFILE 2025-12-04T08:54:31.2699946Z Float Round Mode: NEAR 2025-12-04T08:54:31.2700436Z Max Queue Number: 128(0x80) 2025-12-04T08:54:31.2700907Z Queue Min Size: 64(0x40) 2025-12-04T08:54:31.2701476Z Queue Max Size: 131072(0x20000) 2025-12-04T08:54:31.2701943Z Queue Type: MULTI 2025-12-04T08:54:31.2702390Z Node: 2 2025-12-04T08:54:31.2702840Z Device Type: GPU 2025-12-04T08:54:31.2703258Z Cache Info: 2025-12-04T08:54:31.2703614Z L1: 32(0x20) KB 2025-12-04T08:54:31.2704025Z L2: 4096(0x1000) KB 2025-12-04T08:54:31.2704437Z L3: 262144(0x40000) KB 2025-12-04T08:54:31.2704865Z Chip ID: 29861(0x74a5) 2025-12-04T08:54:31.2705321Z ASIC Revision: 1(0x1) 2025-12-04T08:54:31.2705803Z Cacheline Size: 128(0x80) 2025-12-04T08:54:31.2706284Z Max Clock Freq. (MHz): 2100 2025-12-04T08:54:31.2706750Z BDFID: 5376 2025-12-04T08:54:31.2707209Z Internal Node ID: 2 2025-12-04T08:54:31.2707684Z Compute Unit: 304 2025-12-04T08:54:31.2708200Z SIMDs per CU: 4 2025-12-04T08:54:31.2708685Z Shader Engines: 32 2025-12-04T08:54:31.2709165Z Shader Arrs. per Eng.: 1 2025-12-04T08:54:31.2709666Z WatchPts on Addr. Ranges:4 2025-12-04T08:54:31.2710176Z Coherent Host Access: FALSE 2025-12-04T08:54:31.2710621Z Memory Properties: 2025-12-04T08:54:31.2710984Z Features: KERNEL_DISPATCH 2025-12-04T08:54:31.2711430Z Fast F16 Operation: TRUE 2025-12-04T08:54:31.2711931Z Wavefront Size: 64(0x40) 2025-12-04T08:54:31.2712432Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:31.2712897Z Workgroup Max Size per Dimension: 2025-12-04T08:54:31.2713296Z x 1024(0x400) 2025-12-04T08:54:31.2713702Z y 1024(0x400) 2025-12-04T08:54:31.2714095Z z 1024(0x400) 2025-12-04T08:54:31.2714529Z Max Waves Per CU: 32(0x20) 2025-12-04T08:54:31.2715034Z Max Work-item Per CU: 2048(0x800) 2025-12-04T08:54:31.2715515Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:31.2715947Z Grid Max Size per Dimension: 2025-12-04T08:54:31.2716314Z x 4294967295(0xffffffff) 2025-12-04T08:54:31.2716714Z y 4294967295(0xffffffff) 2025-12-04T08:54:31.2717220Z z 4294967295(0xffffffff) 2025-12-04T08:54:31.2717691Z Max fbarriers/Workgrp: 32 2025-12-04T08:54:31.2727437Z Packet Processor uCode:: 185 2025-12-04T08:54:31.2727977Z SDMA engine uCode:: 24 2025-12-04T08:54:31.2728571Z IOMMU Support:: None 2025-12-04T08:54:31.2729005Z Pool Info: 2025-12-04T08:54:31.2729333Z Pool 1 2025-12-04T08:54:31.2729745Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T08:54:31.2730239Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:31.2730711Z Allocatable: TRUE 2025-12-04T08:54:31.2731200Z Alloc Granule: 4KB 2025-12-04T08:54:31.2731954Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:31.2732484Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2732981Z Accessible by all: FALSE 2025-12-04T08:54:31.2733422Z Pool 2 2025-12-04T08:54:31.2733835Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T08:54:31.2734305Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:31.2734766Z Allocatable: TRUE 2025-12-04T08:54:31.2735249Z Alloc Granule: 4KB 2025-12-04T08:54:31.2735758Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:31.2736273Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2736809Z Accessible by all: FALSE 2025-12-04T08:54:31.2737248Z Pool 3 2025-12-04T08:54:31.2737643Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T08:54:31.2738145Z Size: 268419072(0xfffc000) KB 2025-12-04T08:54:31.2738607Z Allocatable: TRUE 2025-12-04T08:54:31.2739094Z Alloc Granule: 4KB 2025-12-04T08:54:31.2739594Z Alloc Recommended Granule:2048KB 2025-12-04T08:54:31.2740100Z Alloc Alignment: 4KB 2025-12-04T08:54:31.2740598Z Accessible by all: FALSE 2025-12-04T08:54:31.2741016Z Pool 4 2025-12-04T08:54:31.2741416Z Segment: GROUP 2025-12-04T08:54:31.2741873Z Size: 64(0x40) KB 2025-12-04T08:54:31.2742337Z Allocatable: FALSE 2025-12-04T08:54:31.2742831Z Alloc Granule: 0KB 2025-12-04T08:54:31.2743354Z Alloc Recommended Granule:0KB 2025-12-04T08:54:31.2743862Z Alloc Alignment: 0KB 2025-12-04T08:54:31.2744366Z Accessible by all: FALSE 2025-12-04T08:54:31.2744807Z ISA Info: 2025-12-04T08:54:31.2745130Z ISA 1 2025-12-04T08:54:31.2745562Z Name: amdgcn-amd-amdhsa--gfx942:sramecc+:xnack- 2025-12-04T08:54:31.2746090Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T08:54:31.2746604Z Profiles: HSA_PROFILE_BASE 2025-12-04T08:54:31.2747120Z Default Rounding Mode: NEAR 2025-12-04T08:54:31.2747654Z Default Rounding Mode: NEAR 2025-12-04T08:54:31.2748309Z Fast f16: TRUE 2025-12-04T08:54:31.2748802Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:31.2749264Z Workgroup Max Size per Dimension: 2025-12-04T08:54:31.2749698Z x 1024(0x400) 2025-12-04T08:54:31.2750124Z y 1024(0x400) 2025-12-04T08:54:31.2750526Z z 1024(0x400) 2025-12-04T08:54:31.2750995Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:31.2751443Z Grid Max Size per Dimension: 2025-12-04T08:54:31.2751823Z x 4294967295(0xffffffff) 2025-12-04T08:54:31.2752236Z y 4294967295(0xffffffff) 2025-12-04T08:54:31.2752641Z z 4294967295(0xffffffff) 2025-12-04T08:54:31.2753203Z FBarrier Max Size: 32 2025-12-04T08:54:31.2753644Z ISA 2 2025-12-04T08:54:31.2754095Z Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- 2025-12-04T08:54:31.2754658Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T08:54:31.2755175Z Profiles: HSA_PROFILE_BASE 2025-12-04T08:54:31.2755675Z Default Rounding Mode: NEAR 2025-12-04T08:54:31.2756207Z Default Rounding Mode: NEAR 2025-12-04T08:54:31.2756686Z Fast f16: TRUE 2025-12-04T08:54:31.2757172Z Workgroup Max Size: 1024(0x400) 2025-12-04T08:54:31.2757641Z Workgroup Max Size per Dimension: 2025-12-04T08:54:31.2758043Z x 1024(0x400) 2025-12-04T08:54:31.2758528Z y 1024(0x400) 2025-12-04T08:54:31.2758938Z z 1024(0x400) 2025-12-04T08:54:31.2759381Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T08:54:31.2759821Z Grid Max Size per Dimension: 2025-12-04T08:54:31.2760211Z x 4294967295(0xffffffff) 2025-12-04T08:54:31.2760630Z y 4294967295(0xffffffff) 2025-12-04T08:54:31.2761042Z z 4294967295(0xffffffff) 2025-12-04T08:54:31.2761498Z FBarrier Max Size: 32 2025-12-04T08:54:31.2761924Z *** Done *** 2025-12-04T08:54:31.2786101Z ##[group]Run ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T08:54:31.2786678Z ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2025-12-04T08:54:31.2787599Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T08:54:31.2788542Z if [[ $ngpu -eq 0 ]]; then 2025-12-04T08:54:31.2789018Z  echo "Error: Failed to detect any GPUs on the runner" 2025-12-04T08:54:31.2789479Z  echo "$msg" 2025-12-04T08:54:31.2789811Z  exit 1 2025-12-04T08:54:31.2790113Z fi 2025-12-04T08:54:31.2798530Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.2798987Z env: 2025-12-04T08:54:31.2799262Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.2799603Z ##[endgroup] 2025-12-04T08:54:31.3456934Z ##[group]Run pytorch/pytorch/.github/actions/diskspace-cleanup@main 2025-12-04T08:54:31.3457432Z with: 2025-12-04T08:54:31.3457718Z diskspace-cutoff: 70 2025-12-04T08:54:31.3458022Z env: 2025-12-04T08:54:31.3458403Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.3458718Z ##[endgroup] 2025-12-04T08:54:31.3507684Z ##[group]Run set -ex 2025-12-04T08:54:31.3508193Z set -ex 2025-12-04T08:54:31.3508514Z diskspace_cutoff=70 2025-12-04T08:54:31.3509135Z docker_root_dir=$(docker info -f '{{.DockerRootDir}}') 2025-12-04T08:54:31.3509650Z if [ ! -d "$docker_root_dir" ]; then 2025-12-04T08:54:31.3510298Z  echo "Docker root directory ($docker_root_dir) does not exist. Skipping disk space check." 2025-12-04T08:54:31.3510897Z  exit 0 2025-12-04T08:54:31.3511173Z fi 2025-12-04T08:54:31.3511696Z diskspace=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T08:54:31.3512757Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2025-12-04T08:54:31.3513665Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2025-12-04T08:54:31.3514131Z  docker system prune -af 2025-12-04T08:54:31.3514676Z  diskspace_new=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2025-12-04T08:54:31.3515489Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2025-12-04T08:54:31.3516035Z  diskspace_cutoff_int=$((diskspace_cutoff + 0)) 2025-12-04T08:54:31.3516539Z  difference=$((100 - diskspace_cutoff_int)) 2025-12-04T08:54:31.3517209Z  echo "Error: Available diskspace is less than $difference percent. Not enough diskspace." 2025-12-04T08:54:31.3517835Z  echo "$msg" 2025-12-04T08:54:31.3518234Z  exit 1 2025-12-04T08:54:31.3518539Z  else 2025-12-04T08:54:31.3518900Z  difference=$((diskspace - diskspace_new)) 2025-12-04T08:54:31.3519397Z  echo "Diskspace saved: $difference percent" 2025-12-04T08:54:31.3519811Z  fi 2025-12-04T08:54:31.3520076Z fi 2025-12-04T08:54:31.3528402Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.3528872Z env: 2025-12-04T08:54:31.3529153Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.3529478Z ##[endgroup] 2025-12-04T08:54:31.3558175Z + diskspace_cutoff=70 2025-12-04T08:54:31.3563002Z ++ docker info -f '{{.DockerRootDir}}' 2025-12-04T08:54:31.3917072Z + docker_root_dir=/home/runner/docker-data 2025-12-04T08:54:31.3917255Z + '[' '!' -d /home/runner/docker-data ']' 2025-12-04T08:54:31.3925184Z ++ df -H --output=pcent /home/runner/docker-data 2025-12-04T08:54:31.3925732Z ++ sed -n 2p 2025-12-04T08:54:31.3926014Z ++ sed s/%// 2025-12-04T08:54:31.3926599Z ++ sed 's/ //' 2025-12-04T08:54:31.3941378Z + diskspace=' 5' 2025-12-04T08:54:31.3942173Z + msg='Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified' 2025-12-04T08:54:31.3943055Z + [[ 5 -ge 70 ]] 2025-12-04T08:54:31.3993166Z ##[group]Run RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T08:54:31.3993813Z RUNNER_ARTIFACT_DIR="${RUNNER_TEMP}/artifacts" 2025-12-04T08:54:31.3994333Z rm -rf "${RUNNER_ARTIFACT_DIR}" 2025-12-04T08:54:31.3994764Z mkdir -p "${RUNNER_ARTIFACT_DIR}" 2025-12-04T08:54:31.3995307Z echo "RUNNER_ARTIFACT_DIR=${RUNNER_ARTIFACT_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:31.3995802Z  2025-12-04T08:54:31.3996186Z RUNNER_TEST_RESULTS_DIR="${RUNNER_TEMP}/test-results" 2025-12-04T08:54:31.3996688Z rm -rf "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T08:54:31.3997108Z mkdir -p "${RUNNER_TEST_RESULTS_DIR}" 2025-12-04T08:54:31.3997680Z echo "RUNNER_TEST_RESULTS_DIR=${RUNNER_TEST_RESULTS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:31.3998293Z  2025-12-04T08:54:31.3998595Z RUNNER_DOCS_DIR="${RUNNER_TEMP}/docs" 2025-12-04T08:54:31.3999005Z rm -rf "${RUNNER_DOCS_DIR}" 2025-12-04T08:54:31.3999385Z mkdir -p "${RUNNER_DOCS_DIR}" 2025-12-04T08:54:31.3999873Z echo "RUNNER_DOCS_DIR=${RUNNER_DOCS_DIR}" >> "${GITHUB_ENV}" 2025-12-04T08:54:31.4010724Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.4011173Z env: 2025-12-04T08:54:31.4011457Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.4011778Z ##[endgroup] 2025-12-04T08:54:31.4115852Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:31.4116560Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:31.4117159Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:31.4127327Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.4127778Z env: 2025-12-04T08:54:31.4128118Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.4128532Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:31.4129071Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:31.4129587Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:31.4130201Z ##[endgroup] 2025-12-04T08:54:31.4212560Z ##[group]Run # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T08:54:31.4213462Z # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2025-12-04T08:54:31.4214101Z # Add render group for container creation. 2025-12-04T08:54:31.4214635Z render_gid=`cat /etc/group | grep render | cut -d: -f3` 2025-12-04T08:54:31.4215272Z # Ensure GPU isolation if pod is part of kubernetes setup with DEVICE_FLAG. 2025-12-04T08:54:31.4215913Z if [ -f "/etc/podinfo/gha-render-devices" ]; then 2025-12-04T08:54:31.4216442Z  DEVICE_FLAG=$(cat /etc/podinfo/gha-render-devices) 2025-12-04T08:54:31.4216887Z else 2025-12-04T08:54:31.4217208Z  DEVICE_FLAG="--device /dev/dri" 2025-12-04T08:54:31.4217566Z fi 2025-12-04T08:54:31.4218198Z # The --group-add daemon and --group-add bin are needed in the Ubuntu 24.04 and Almalinux OSs respectively. 2025-12-04T08:54:31.4219110Z # This is due to the device files (/dev/kfd & /dev/dri) being owned by video group on bare metal. 2025-12-04T08:54:31.4219921Z # This video group ID maps to subgid 1 inside the docker image due to the /etc/subgid entries. 2025-12-04T08:54:31.4220777Z # The group name corresponding to group ID 1 can change depending on the OS, so both are necessary. 2025-12-04T08:54:31.4222235Z echo "GPU_FLAG=--device=/dev/mem --device=/dev/kfd $DEVICE_FLAG --group-add video --group-add $render_gid --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host" >> "${GITHUB_ENV}" 2025-12-04T08:54:31.4231442Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:31.4231909Z env: 2025-12-04T08:54:31.4232192Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.4232601Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:31.4233160Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:31.4233685Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:31.4234076Z ##[endgroup] 2025-12-04T08:54:31.4362968Z ##[group]Run aws-actions/configure-aws-credentials@ececac1a45f3b08a01d2dd070d28d111c5fe6722 2025-12-04T08:54:31.4363595Z with: 2025-12-04T08:54:31.4364044Z role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_s3_and_ecr_read_only 2025-12-04T08:54:31.4364581Z aws-region: us-east-1 2025-12-04T08:54:31.4364926Z role-duration-seconds: 18000 2025-12-04T08:54:31.4365295Z audience: sts.amazonaws.com 2025-12-04T08:54:31.4365624Z env: 2025-12-04T08:54:31.4365895Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:31.4366306Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:31.4366852Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:31.4367363Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:31.4368893Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:31.4370098Z ##[endgroup] 2025-12-04T08:54:31.7739371Z Assuming role with OIDC 2025-12-04T08:54:32.1388505Z Authenticated as assumedRoleId AROAUPVRELQNLLCOPFEJR:GitHubActions 2025-12-04T08:54:32.2571053Z ##[group]Run aws-actions/amazon-ecr-login@062b18b96a7aff071d4dc91bc00c4c1a7945b076 2025-12-04T08:54:32.2571662Z with: 2025-12-04T08:54:32.2571976Z mask-password: true 2025-12-04T08:54:32.2572321Z registry-type: private 2025-12-04T08:54:32.2572673Z skip-logout: false 2025-12-04T08:54:32.2572985Z env: 2025-12-04T08:54:32.2573285Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:32.2573732Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:32.2574309Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:32.2574851Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:32.2576358Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:32.2577612Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:32.2577991Z AWS_REGION: us-east-1 2025-12-04T08:54:32.2578954Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:32.2579461Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:32.2586280Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:32.2586619Z ##[endgroup] 2025-12-04T08:54:32.7020966Z Logging into registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4258980Z ##[group]Run env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:33.4259712Z env | grep '^GITHUB' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:33.4260352Z env | grep '^CI' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:33.4261000Z env | grep '^RUNNER' >> "${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" 2025-12-04T08:54:33.4271210Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:33.4271689Z env: 2025-12-04T08:54:33.4271991Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:33.4272433Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:33.4273008Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:33.4273549Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:33.4274816Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:33.4276064Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:33.4276441Z AWS_REGION: us-east-1 2025-12-04T08:54:33.4276934Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:33.4277479Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:33.4284910Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:33.4285261Z ##[endgroup] 2025-12-04T08:54:33.4608623Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2025-12-04T08:54:33.4609240Z with: 2025-12-04T08:54:33.4610147Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4611143Z use-custom-docker-registry: true 2025-12-04T08:54:33.4611563Z docker-build-dir: .ci/docker 2025-12-04T08:54:33.4611962Z docker-build-script: ./build.sh 2025-12-04T08:54:33.4612359Z working-directory: . 2025-12-04T08:54:33.4612825Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4613330Z force-push: false 2025-12-04T08:54:33.4613636Z env: 2025-12-04T08:54:33.4613934Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:33.4614384Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:33.4614984Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:33.4615568Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:33.4616863Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:33.4618238Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:33.4618615Z AWS_REGION: us-east-1 2025-12-04T08:54:33.4619164Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:33.4619669Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:33.4626513Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:33.4626892Z ##[endgroup] 2025-12-04T08:54:33.4652338Z ##[group]Run set -ex 2025-12-04T08:54:33.4652759Z set -ex 2025-12-04T08:54:33.4653062Z  2025-12-04T08:54:33.4653568Z # If the docker build directory or the build script doesn't exist, the action will 2025-12-04T08:54:33.4654385Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2025-12-04T08:54:33.4655296Z # job could then download the pre-built image as usual 2025-12-04T08:54:33.4656144Z if [[ -d "${DOCKER_BUILD_DIR}" ]] && [[ -f "${DOCKER_BUILD_DIR}/${DOCKER_BUILD_SCRIPT}" ]] && [[ "${USE_CUSTOM_DOCKER_REGISTRY}" == "true" ]]; then 2025-12-04T08:54:33.4656945Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4657370Z else 2025-12-04T08:54:33.4657723Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4658607Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4659110Z  2025-12-04T08:54:33.4659790Z  echo "Not using custom ECR registry. Either it was not requested or there is no Docker build script in the ${REPO_NAME} repo..." 2025-12-04T08:54:33.4660557Z  exit 0 2025-12-04T08:54:33.4660854Z fi 2025-12-04T08:54:33.4661139Z  2025-12-04T08:54:33.4661603Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2025-12-04T08:54:33.4662347Z  # The docker image name already includes the ECR prefix and tag, so we can just 2025-12-04T08:54:33.4663011Z  # use it as it is, but first let's extract the tag 2025-12-04T08:54:33.4663627Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2025-12-04T08:54:33.4664265Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4664876Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4665376Z else 2025-12-04T08:54:33.4665736Z  if [[ "${DOCKER_IMAGE_NAME}" == *:* ]]; then 2025-12-04T08:54:33.4666231Z  CUSTOM_TAG_PREFIX=${DOCKER_IMAGE_NAME#*:} 2025-12-04T08:54:33.4666735Z  DOCKER_IMAGE_NAME=${DOCKER_IMAGE_NAME%%:*} 2025-12-04T08:54:33.4667159Z  fi 2025-12-04T08:54:33.4667950Z  DOCKER_TAG=${CUSTOM_TAG_PREFIX:+${CUSTOM_TAG_PREFIX}-}$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2025-12-04T08:54:33.4668780Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4669563Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4670411Z  echo "custom-tag-prefix=${CUSTOM_TAG_PREFIX}" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4670938Z fi 2025-12-04T08:54:33.4681478Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:33.4681955Z env: 2025-12-04T08:54:33.4682260Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:33.4682707Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:33.4683299Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:33.4683856Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:33.4685145Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:33.4686411Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:33.4686789Z AWS_REGION: us-east-1 2025-12-04T08:54:33.4687246Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:33.4687750Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:33.4694639Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:33.4694989Z REPO_NAME: pytorch 2025-12-04T08:54:33.4695914Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4696893Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T08:54:33.4697283Z DOCKER_BUILD_SCRIPT: ./build.sh 2025-12-04T08:54:33.4697786Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4698392Z USE_CUSTOM_DOCKER_REGISTRY: true 2025-12-04T08:54:33.4698780Z CUSTOM_TAG_PREFIX: 2025-12-04T08:54:33.4699242Z ##[endgroup] 2025-12-04T08:54:33.4735160Z + [[ -d .ci/docker ]] 2025-12-04T08:54:33.4735543Z + [[ -f .ci/docker/./build.sh ]] 2025-12-04T08:54:33.4735938Z + [[ true == \t\r\u\e ]] 2025-12-04T08:54:33.4736278Z + echo skip=false 2025-12-04T08:54:33.4737485Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2025-12-04T08:54:33.4748327Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4749252Z ++ awk -F '[:,]' '{print $2}' 2025-12-04T08:54:33.4767965Z + DOCKER_TAG=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4768938Z + echo docker-tag=pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4770191Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4816751Z ##[group]Run set +e 2025-12-04T08:54:33.4817191Z set +e 2025-12-04T08:54:33.4817496Z set -x 2025-12-04T08:54:33.4817788Z  2025-12-04T08:54:33.4818134Z login() { 2025-12-04T08:54:33.4818853Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T08:54:33.4819514Z } 2025-12-04T08:54:33.4819761Z  2025-12-04T08:54:33.4820040Z retry () { 2025-12-04T08:54:33.4820406Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T08:54:33.4820803Z } 2025-12-04T08:54:33.4821185Z  2025-12-04T08:54:33.4821603Z retry login "${DOCKER_REGISTRY}" 2025-12-04T08:54:33.4821986Z  2025-12-04T08:54:33.4822279Z START_TIME=$(date +%s) 2025-12-04T08:54:33.4822711Z # Wait up to 120 minutes 2025-12-04T08:54:33.4823437Z while [[ $(( $(date +%s) - 7200 )) -lt $START_TIME ]]; do 2025-12-04T08:54:33.4824044Z  # Check if image already exists, if it does then skip building it 2025-12-04T08:54:33.4824655Z  if docker manifest inspect "${DOCKER_IMAGE}"; then 2025-12-04T08:54:33.4825108Z  exit 0 2025-12-04T08:54:33.4825413Z  fi 2025-12-04T08:54:33.4825689Z  2025-12-04T08:54:33.4826171Z  # NB: This flag is used by Docker build workflow to push the image to ECR, so we can 2025-12-04T08:54:33.4826977Z  # use this to differentiate between the Docker build and regular build jobs. For the 2025-12-04T08:54:33.4827770Z  # latter, it will wait for the Docker images to become available before continuing 2025-12-04T08:54:33.4828462Z  if [ "${DOCKER_PUSH:-false}" == "true" ]; then 2025-12-04T08:54:33.4828979Z  # It's a Docker build job, let's build the image 2025-12-04T08:54:33.4829420Z  break 2025-12-04T08:54:33.4829694Z  else 2025-12-04T08:54:33.4830129Z  # It's a regular build job, wait for the image to become available 2025-12-04T08:54:33.4830633Z  sleep 300 2025-12-04T08:54:33.4830955Z  fi 2025-12-04T08:54:33.4831237Z done 2025-12-04T08:54:33.4831520Z  2025-12-04T08:54:33.4831966Z # NB: This part requires a full checkout. Otherwise, the merge base will 2025-12-04T08:54:33.4832651Z # be empty. The default action would be to continue rebuild the image 2025-12-04T08:54:33.4833287Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2025-12-04T08:54:33.4833858Z  # if we're on the base branch then use the parent commit 2025-12-04T08:54:33.4834359Z  MERGE_BASE=$(git rev-parse HEAD~) 2025-12-04T08:54:33.4834751Z else 2025-12-04T08:54:33.4835170Z  # otherwise we're on a PR, so use the most recent base commit 2025-12-04T08:54:33.4835954Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2025-12-04T08:54:33.4836398Z fi 2025-12-04T08:54:33.4836676Z  2025-12-04T08:54:33.4836986Z if [[ -z "${MERGE_BASE}" ]]; then 2025-12-04T08:54:33.4837442Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4837856Z  2025-12-04T08:54:33.4838779Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2025-12-04T08:54:33.4839437Z  exit 0 2025-12-04T08:54:33.4839694Z fi 2025-12-04T08:54:33.4839974Z  2025-12-04T08:54:33.4840373Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2025-12-04T08:54:33.4841197Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2025-12-04T08:54:33.4841885Z  exit 1 2025-12-04T08:54:33.4842184Z fi 2025-12-04T08:54:33.4842464Z  2025-12-04T08:54:33.4842939Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2025-12-04T08:54:33.4843735Z # If no image exists but the hash is the same as the previous hash then we should error out here 2025-12-04T08:54:33.4844450Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2025-12-04T08:54:33.4845260Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2025-12-04T08:54:33.4846168Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2025-12-04T08:54:33.4846719Z fi 2025-12-04T08:54:33.4846994Z  2025-12-04T08:54:33.4847345Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2025-12-04T08:54:33.4857965Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:33.4858500Z env: 2025-12-04T08:54:33.4858805Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:33.4859258Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:33.4859900Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:33.4860441Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:33.4861707Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:33.4862959Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:33.4863341Z AWS_REGION: us-east-1 2025-12-04T08:54:33.4863888Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:33.4864398Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:33.4871130Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:33.4871491Z DOCKER_BUILD_DIR: .ci/docker 2025-12-04T08:54:33.4871955Z BASE_REVISION: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T08:54:33.4872985Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4874155Z DOCKER_TAG: pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:33.4874904Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4875392Z DOCKER_PUSH: 2025-12-04T08:54:33.4875704Z ##[endgroup] 2025-12-04T08:54:33.4907924Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4908580Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4913500Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:33.4914151Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:33.4915026Z /home/runner/_work/_temp/16d572d4-9bab-4dba-9533-5dce113d33b9.sh: line 5: aws: command not found 2025-12-04T08:54:33.5018662Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:33.5029210Z + sleep 1 2025-12-04T08:54:34.5045456Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:34.5049228Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:34.5050040Z /home/runner/_work/_temp/16d572d4-9bab-4dba-9533-5dce113d33b9.sh: line 5: aws: command not found 2025-12-04T08:54:34.5051522Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:34.5146443Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:34.5161461Z + sleep 2 2025-12-04T08:54:36.5181419Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:36.5186373Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:36.5187172Z /home/runner/_work/_temp/16d572d4-9bab-4dba-9533-5dce113d33b9.sh: line 5: aws: command not found 2025-12-04T08:54:36.5188262Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:36.5287397Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:36.5301018Z ++ date +%s 2025-12-04T08:54:36.5307969Z + START_TIME=1764838476 2025-12-04T08:54:36.5311298Z ++ date +%s 2025-12-04T08:54:36.5319370Z + [[ 1764831276 -lt 1764838476 ]] 2025-12-04T08:54:36.5320416Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:38.0290105Z { 2025-12-04T08:54:38.0290592Z "schemaVersion": 2, 2025-12-04T08:54:38.0291230Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2025-12-04T08:54:38.0291802Z "config": { 2025-12-04T08:54:38.0292241Z "mediaType": "application/vnd.docker.container.image.v1+json", 2025-12-04T08:54:38.0292753Z "size": 30520, 2025-12-04T08:54:38.0293263Z "digest": "sha256:45252333063339f104d56e41f20304e9511ab21c7768e8d156b95ddf24a9dbe5" 2025-12-04T08:54:38.0293819Z }, 2025-12-04T08:54:38.0294081Z "layers": [ 2025-12-04T08:54:38.0294360Z { 2025-12-04T08:54:38.0294783Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0295364Z "size": 30447951, 2025-12-04T08:54:38.0296804Z "digest": "sha256:63e5bc7682b85ae57a1221210f64d62e7a90b0a30f19af4ca734b8242ae49d63" 2025-12-04T08:54:38.0297371Z }, 2025-12-04T08:54:38.0297634Z { 2025-12-04T08:54:38.0298039Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0298649Z "size": 1554, 2025-12-04T08:54:38.0299334Z "digest": "sha256:835841cca3b7e1464290cdb78e48773e03583413fbed852c3cc5165a392ea44d" 2025-12-04T08:54:38.0300328Z }, 2025-12-04T08:54:38.0300724Z { 2025-12-04T08:54:38.0301233Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0301911Z "size": 313275691, 2025-12-04T08:54:38.0302542Z "digest": "sha256:aac69780afc8611a5f94a235792d39ae055249c8319ef43b78675998a9b2f825" 2025-12-04T08:54:38.0327548Z }, 2025-12-04T08:54:38.0327971Z { 2025-12-04T08:54:38.0328649Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0329322Z "size": 704, 2025-12-04T08:54:38.0329865Z "digest": "sha256:029495b23122c840ca0e52d487afa8d2c4dbf1991cd7f204ec3e434dcf947bf4" 2025-12-04T08:54:38.0330439Z }, 2025-12-04T08:54:38.0330697Z { 2025-12-04T08:54:38.0331109Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0331605Z "size": 1218, 2025-12-04T08:54:38.0332111Z "digest": "sha256:d0fb85b008332051a3f7c052721ef68bde404b46c23fa43ad040373bd367826c" 2025-12-04T08:54:38.0332664Z }, 2025-12-04T08:54:38.0332908Z { 2025-12-04T08:54:38.0333315Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0333812Z "size": 484, 2025-12-04T08:54:38.0334313Z "digest": "sha256:59b63930883363c7d2aaab27cc61555d9f3e119dc18247a8624c98ebdaa354a5" 2025-12-04T08:54:38.0334865Z }, 2025-12-04T08:54:38.0335118Z { 2025-12-04T08:54:38.0335523Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0336354Z "size": 110363202, 2025-12-04T08:54:38.0336899Z "digest": "sha256:dc112c89d57aa1e85082e40a56e5bc743d64f834ae2f98afe91f60c248354d38" 2025-12-04T08:54:38.0337470Z }, 2025-12-04T08:54:38.0337719Z { 2025-12-04T08:54:38.0338193Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0338694Z "size": 4436, 2025-12-04T08:54:38.0339189Z "digest": "sha256:522eab2402e5001810155ef7eb56940b7c01a4fef62ac588886981c3b8ee8e1e" 2025-12-04T08:54:38.0339876Z + exit 0 2025-12-04T08:54:38.0340128Z }, 2025-12-04T08:54:38.0340371Z { 2025-12-04T08:54:38.0340768Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0341257Z "size": 1755, 2025-12-04T08:54:38.0341752Z "digest": "sha256:2b5a11b41761d8ea3b829e4772e4064cb6c4e4989126af324d0057661e4493a1" 2025-12-04T08:54:38.0342305Z }, 2025-12-04T08:54:38.0342553Z { 2025-12-04T08:54:38.0342955Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0343461Z "size": 724, 2025-12-04T08:54:38.0343963Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:38.0344737Z }, 2025-12-04T08:54:38.0344990Z { 2025-12-04T08:54:38.0345398Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0345903Z "size": 3185588166, 2025-12-04T08:54:38.0346432Z "digest": "sha256:73e33534e9eb94cf29418d65944168962b65fe21f55e9b8bad18c76e9b3a37b8" 2025-12-04T08:54:38.0346983Z }, 2025-12-04T08:54:38.0347234Z { 2025-12-04T08:54:38.0347636Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0348193Z "size": 396, 2025-12-04T08:54:38.0348715Z "digest": "sha256:5bfdaeb5578d6ffcd7db29c48303cbceb13c591210feaa216a8daa7a6d445b4b" 2025-12-04T08:54:38.0349293Z }, 2025-12-04T08:54:38.0349540Z { 2025-12-04T08:54:38.0349948Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0350452Z "size": 236863, 2025-12-04T08:54:38.0350992Z "digest": "sha256:c07d27e4d3a5ba4ad5325bb785b2e4f058fe5e10ec1aeeb413a1e152b073f203" 2025-12-04T08:54:38.0351730Z }, 2025-12-04T08:54:38.0351986Z { 2025-12-04T08:54:38.0352398Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0352900Z "size": 787, 2025-12-04T08:54:38.0353418Z "digest": "sha256:b21856d1bf420da6fa8ec7331b82ab355d4f4178644e7d3a3d3d0fbc3610109a" 2025-12-04T08:54:38.0353987Z }, 2025-12-04T08:54:38.0354239Z { 2025-12-04T08:54:38.0354648Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0355153Z "size": 106, 2025-12-04T08:54:38.0355666Z "digest": "sha256:cb19d84867e4063f55db9459c28c50a2abc37c06d3c1ca82ba95fa8427cc438a" 2025-12-04T08:54:38.0356233Z }, 2025-12-04T08:54:38.0356485Z { 2025-12-04T08:54:38.0356888Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0357388Z "size": 1496, 2025-12-04T08:54:38.0357901Z "digest": "sha256:8165374f8dccf88a7791a5d31afbe29e4d4542b4f1cf1904945e07f9af6bf8ba" 2025-12-04T08:54:38.0358612Z }, 2025-12-04T08:54:38.0358873Z { 2025-12-04T08:54:38.0359270Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0359775Z "size": 458789560, 2025-12-04T08:54:38.0360315Z "digest": "sha256:1aecc77354ceba59ec6f0d37a558f2dbb6d5c0854553ee8505ac8707b422da6d" 2025-12-04T08:54:38.0360887Z }, 2025-12-04T08:54:38.0361138Z { 2025-12-04T08:54:38.0361546Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0362215Z "size": 164, 2025-12-04T08:54:38.0362730Z "digest": "sha256:465d3fd643aa2ea0ad07335cda66f12f1d7e5e800c4e9385ec466bc8a1ceabda" 2025-12-04T08:54:38.0363298Z }, 2025-12-04T08:54:38.0363550Z { 2025-12-04T08:54:38.0363954Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0364457Z "size": 104, 2025-12-04T08:54:38.0364963Z "digest": "sha256:6c503e779d6f41ca7f51309875df2b725c171926aece7009c4b8a64d1ba3f58e" 2025-12-04T08:54:38.0365666Z }, 2025-12-04T08:54:38.0365919Z { 2025-12-04T08:54:38.0366332Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0366830Z "size": 724, 2025-12-04T08:54:38.0367329Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:38.0367878Z }, 2025-12-04T08:54:38.0368197Z { 2025-12-04T08:54:38.0368603Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0369101Z "size": 196, 2025-12-04T08:54:38.0369609Z "digest": "sha256:f7e9a021f0ee3d11a50dcb96378af8103a21f6c3c142f54529207648f3ed00b2" 2025-12-04T08:54:38.0370164Z }, 2025-12-04T08:54:38.0370412Z { 2025-12-04T08:54:38.0370814Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0371310Z "size": 2583, 2025-12-04T08:54:38.0371814Z "digest": "sha256:8e023b349080fb11ee55491bc9b842b30e9e3a90246d05b303a73dc62038caf2" 2025-12-04T08:54:38.0372380Z }, 2025-12-04T08:54:38.0372630Z { 2025-12-04T08:54:38.0373042Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0373549Z "size": 7577171420, 2025-12-04T08:54:38.0374083Z "digest": "sha256:8188df80e595a3dbcf84623c6a58a655269898cbb60029435f136d7f9d34ccaa" 2025-12-04T08:54:38.0374642Z }, 2025-12-04T08:54:38.0374891Z { 2025-12-04T08:54:38.0375289Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0375798Z "size": 135, 2025-12-04T08:54:38.0376312Z "digest": "sha256:3c2c2f8c74bfa16c4bf9a832c97bbb1d55205b2b4a2cead02cf74301ca1001fb" 2025-12-04T08:54:38.0376880Z }, 2025-12-04T08:54:38.0377124Z { 2025-12-04T08:54:38.0377685Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0378223Z "size": 104, 2025-12-04T08:54:38.0378735Z "digest": "sha256:2aa7784fbe3300f8bbfb6bb51cff3b01fd091e829c2bc7ab9e25261a0dd9b3bd" 2025-12-04T08:54:38.0379303Z }, 2025-12-04T08:54:38.0379547Z { 2025-12-04T08:54:38.0379951Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0380618Z "size": 612, 2025-12-04T08:54:38.0381123Z "digest": "sha256:2b3b5215d3ebe8789f0444457bfd5a6e218289b64aa07653ac3d03ddda5e6708" 2025-12-04T08:54:38.0381677Z }, 2025-12-04T08:54:38.0381921Z { 2025-12-04T08:54:38.0382316Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0382812Z "size": 838191945, 2025-12-04T08:54:38.0383345Z "digest": "sha256:99b1f1ea3e857834cebd01763d90fbd700aeb9c2d2ef23eda2cfff5652c9708b" 2025-12-04T08:54:38.0383906Z }, 2025-12-04T08:54:38.0384149Z { 2025-12-04T08:54:38.0384546Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0385030Z "size": 111, 2025-12-04T08:54:38.0385535Z "digest": "sha256:18d6daba0a5768a37ad106b57974f6b7efd35c43a87c246bcd3f43fea88f2d2b" 2025-12-04T08:54:38.0386097Z }, 2025-12-04T08:54:38.0386342Z { 2025-12-04T08:54:38.0386739Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0387241Z "size": 1555, 2025-12-04T08:54:38.0387757Z "digest": "sha256:5277f2a503ebd17ba9d9b86cc9bac86265504adeb449c0647616ddaacd3cbc41" 2025-12-04T08:54:38.0388382Z }, 2025-12-04T08:54:38.0388628Z { 2025-12-04T08:54:38.0389025Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0389521Z "size": 107, 2025-12-04T08:54:38.0390024Z "digest": "sha256:3198a9717aace920fd5de085319adf75091af05fc4318ce4b16a8a5b0e8d449e" 2025-12-04T08:54:38.0390582Z }, 2025-12-04T08:54:38.0390828Z { 2025-12-04T08:54:38.0391225Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0391722Z "size": 166, 2025-12-04T08:54:38.0392211Z "digest": "sha256:99a4918e5808277879449e97ccd7190db6b9aa2d742b57a3b831ce0198522bdd" 2025-12-04T08:54:38.0392754Z }, 2025-12-04T08:54:38.0392998Z { 2025-12-04T08:54:38.0393396Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0394018Z "size": 3526081, 2025-12-04T08:54:38.0394541Z "digest": "sha256:15bb11dfc6acc3537d527d6771c8e711e5605e99f82ec41e805d4600b8a97516" 2025-12-04T08:54:38.0395091Z }, 2025-12-04T08:54:38.0395337Z { 2025-12-04T08:54:38.0395746Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0396246Z "size": 107, 2025-12-04T08:54:38.0396752Z "digest": "sha256:bd87c8766e90e33db17514558ac591cc3f4149afd7abeaef4dd5770bbfa14210" 2025-12-04T08:54:38.0397311Z }, 2025-12-04T08:54:38.0397555Z { 2025-12-04T08:54:38.0397954Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0398519Z "size": 829, 2025-12-04T08:54:38.0399015Z "digest": "sha256:1969e15d0c13874ea5883ed829235a19ef6dc21c8aa6172032b78a8ffa6ff262" 2025-12-04T08:54:38.0399565Z }, 2025-12-04T08:54:38.0399808Z { 2025-12-04T08:54:38.0400206Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0400711Z "size": 26973054, 2025-12-04T08:54:38.0401240Z "digest": "sha256:24a03847d382b73c11969f8f73916a6bedf5ccea12f6f4290b3880f29ceda32a" 2025-12-04T08:54:38.0401795Z }, 2025-12-04T08:54:38.0402041Z { 2025-12-04T08:54:38.0402440Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0402935Z "size": 104, 2025-12-04T08:54:38.0403441Z "digest": "sha256:816e2e34e01839a35d624dbf4bd9ac9bea4c975104af47a0e6b6b6dee6c6f98d" 2025-12-04T08:54:38.0404000Z }, 2025-12-04T08:54:38.0404244Z { 2025-12-04T08:54:38.0404637Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0405129Z "size": 424, 2025-12-04T08:54:38.0405627Z "digest": "sha256:b168858b85373f8ddca549d79267a06de4fa945d04bf791c55c9ddc93957fa3c" 2025-12-04T08:54:38.0406182Z }, 2025-12-04T08:54:38.0406427Z { 2025-12-04T08:54:38.0406820Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0407319Z "size": 19309386, 2025-12-04T08:54:38.0407863Z "digest": "sha256:6b8d5ff02e267e38322afbb8a58ed63ce9d75b10e9e73255e6affcbc6b6539bf" 2025-12-04T08:54:38.0408600Z }, 2025-12-04T08:54:38.0408850Z { 2025-12-04T08:54:38.0409251Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0409742Z "size": 826, 2025-12-04T08:54:38.0410245Z "digest": "sha256:4e3b10a5dd6aed29f238d604925e2a4f873141c1087c8dd4fdde5c61e7560893" 2025-12-04T08:54:38.0410804Z }, 2025-12-04T08:54:38.0411051Z { 2025-12-04T08:54:38.0411448Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0411935Z "size": 724, 2025-12-04T08:54:38.0412421Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:38.0412967Z }, 2025-12-04T08:54:38.0413211Z { 2025-12-04T08:54:38.0413610Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0414101Z "size": 149, 2025-12-04T08:54:38.0414599Z "digest": "sha256:3092fab73b59190b9facfc49bf18f58612172bc2fd68dfa339a1118632616939" 2025-12-04T08:54:38.0415161Z }, 2025-12-04T08:54:38.0415416Z { 2025-12-04T08:54:38.0415817Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0416310Z "size": 136, 2025-12-04T08:54:38.0416820Z "digest": "sha256:20020dd28a15ba092fcbfe906ee39cdddfcc9d0b7eb42fdd6f4c08a984fa9c00" 2025-12-04T08:54:38.0417387Z }, 2025-12-04T08:54:38.0417631Z { 2025-12-04T08:54:38.0418029Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0418599Z "size": 140, 2025-12-04T08:54:38.0419102Z "digest": "sha256:ae5280ce969dcff08c091e9a5f7641f13561b2b0ee44d78b7c3f81d8fe8e6d32" 2025-12-04T08:54:38.0419661Z }, 2025-12-04T08:54:38.0419905Z { 2025-12-04T08:54:38.0420299Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0420792Z "size": 32, 2025-12-04T08:54:38.0421299Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T08:54:38.0421967Z }, 2025-12-04T08:54:38.0422212Z { 2025-12-04T08:54:38.0422616Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0423105Z "size": 222, 2025-12-04T08:54:38.0423614Z "digest": "sha256:fe17d9eb0fd26d3af4c724bf570d833978b131cedb7dc17a800aa388a246b3cd" 2025-12-04T08:54:38.0424177Z }, 2025-12-04T08:54:38.0424416Z { 2025-12-04T08:54:38.0424818Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0425310Z "size": 346, 2025-12-04T08:54:38.0425802Z "digest": "sha256:a51e0dab2d596e6563483f27c12660007160847d177ba4c31812a8f44ada5754" 2025-12-04T08:54:38.0426345Z }, 2025-12-04T08:54:38.0426588Z { 2025-12-04T08:54:38.0426985Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0427478Z "size": 88300, 2025-12-04T08:54:38.0427997Z "digest": "sha256:6eb176cefd72d37ecbcdf074289a8f1de732d8816cc695ece7e4709d098094d6" 2025-12-04T08:54:38.0428621Z }, 2025-12-04T08:54:38.0428876Z { 2025-12-04T08:54:38.0429281Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0429777Z "size": 106, 2025-12-04T08:54:38.0430278Z "digest": "sha256:e7b8cf2e8d5a4c56db9726ce62c1176032408b3b1c25a000592361cb4245e2b5" 2025-12-04T08:54:38.0430833Z }, 2025-12-04T08:54:38.0431079Z { 2025-12-04T08:54:38.0431479Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0431975Z "size": 1671, 2025-12-04T08:54:38.0432489Z "digest": "sha256:ef3a5060abce88884bc8bd815aa41c46427f34eeb132fe0ddd85a3f86e6dc83d" 2025-12-04T08:54:38.0433052Z }, 2025-12-04T08:54:38.0433299Z { 2025-12-04T08:54:38.0433693Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0434183Z "size": 724, 2025-12-04T08:54:38.0434846Z "digest": "sha256:9681563a88ff9e62494a2740e537440d3df978d466c9478d6a941fae8b57b084" 2025-12-04T08:54:38.0435393Z }, 2025-12-04T08:54:38.0435639Z { 2025-12-04T08:54:38.0436050Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0436666Z "size": 138, 2025-12-04T08:54:38.0437183Z "digest": "sha256:a6f4ec14b42b8f0a83d20aa6a985ddb6a1bf64e0ed3d44afd3484b87d4ed5ad3" 2025-12-04T08:54:38.0437747Z }, 2025-12-04T08:54:38.0437994Z { 2025-12-04T08:54:38.0438449Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0438943Z "size": 119, 2025-12-04T08:54:38.0439452Z "digest": "sha256:7e5a0c956cfbd6f8074fbfd3b1d416e6635d632835ec00c8dd4c015a21da19b4" 2025-12-04T08:54:38.0440013Z }, 2025-12-04T08:54:38.0440261Z { 2025-12-04T08:54:38.0440662Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0441159Z "size": 6238423049, 2025-12-04T08:54:38.0441693Z "digest": "sha256:b4f78730cfe76ce091b78b2e2e3d52be03f1097b3e4c3de5bd79f8d13a853132" 2025-12-04T08:54:38.0442262Z }, 2025-12-04T08:54:38.0442509Z { 2025-12-04T08:54:38.0442912Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0443412Z "size": 174, 2025-12-04T08:54:38.0443909Z "digest": "sha256:081028f24389b112683689fd362e8c0d6f358082710e72feab91cea6383feb4d" 2025-12-04T08:54:38.0444451Z }, 2025-12-04T08:54:38.0444700Z { 2025-12-04T08:54:38.0445102Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0445598Z "size": 1896, 2025-12-04T08:54:38.0446128Z "digest": "sha256:a534dcf4b9a9e5fabed742c8a8fc43c9cfe7346ea88ab3c177c3b14fd3afe00a" 2025-12-04T08:54:38.0446702Z }, 2025-12-04T08:54:38.0446949Z { 2025-12-04T08:54:38.0447347Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0447849Z "size": 197577597, 2025-12-04T08:54:38.0448425Z "digest": "sha256:2e77500302cc13224427e1d74e471bd79d5109ba6a5099a83df1d10b786f71ba" 2025-12-04T08:54:38.0448976Z }, 2025-12-04T08:54:38.0449223Z { 2025-12-04T08:54:38.0449628Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0450236Z "size": 304, 2025-12-04T08:54:38.0450756Z "digest": "sha256:bc08246bb4ba18c3ec5bc69e16b6b4e929c5bd0f3fae10eeb0b1a622a63d6fa2" 2025-12-04T08:54:38.0451327Z }, 2025-12-04T08:54:38.0451576Z { 2025-12-04T08:54:38.0451978Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0452472Z "size": 32, 2025-12-04T08:54:38.0452982Z "digest": "sha256:4f4fb700ef54461cfa02571ae0db9a0dc1e0cdb5577484a6d75e68dc38e8acc1" 2025-12-04T08:54:38.0453544Z }, 2025-12-04T08:54:38.0453791Z { 2025-12-04T08:54:38.0454191Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0454682Z "size": 106, 2025-12-04T08:54:38.0455185Z "digest": "sha256:ff0c473ca120ebdcaa2ba10b3274e82032edd5196019e76d4e7584553704ae81" 2025-12-04T08:54:38.0455746Z }, 2025-12-04T08:54:38.0455993Z { 2025-12-04T08:54:38.0456394Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2025-12-04T08:54:38.0456892Z "size": 54145662, 2025-12-04T08:54:38.0457438Z "digest": "sha256:6bbc14b250efb3cdaad12c91573c6bb9129ad3e3432f0ed1a7eaebc9958d162f" 2025-12-04T08:54:38.0458001Z } 2025-12-04T08:54:38.0458297Z ] 2025-12-04T08:54:38.0458552Z } 2025-12-04T08:54:38.0497740Z ##[group]Run set -eux 2025-12-04T08:54:38.0498590Z set -eux 2025-12-04T08:54:38.0499121Z # It's ok if this steps fails, it would then be an anonymous user like what we used to have 2025-12-04T08:54:38.0500536Z aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token | jq --raw-output '.SecretString' | jq -r .docker_hub_readonly_token | docker login --username pytorchbot --password-stdin || true 2025-12-04T08:54:38.0512369Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:38.0512859Z env: 2025-12-04T08:54:38.0513162Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:38.0513608Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:38.0514182Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:38.0514735Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:38.0516140Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:38.0517348Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:38.0517724Z AWS_REGION: us-east-1 2025-12-04T08:54:38.0518328Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:38.0518823Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:38.0525522Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:38.0525867Z ##[endgroup] 2025-12-04T08:54:38.0564931Z + aws secretsmanager get-secret-value --secret-id docker_hub_readonly_token 2025-12-04T08:54:38.0565539Z + jq --raw-output .SecretString 2025-12-04T08:54:38.0566207Z /home/runner/_work/_temp/61bec0a1-afbf-41f5-ad2d-408d5baad57f.sh: line 3: aws: command not found 2025-12-04T08:54:38.0566896Z + jq -r .docker_hub_readonly_token 2025-12-04T08:54:38.0567410Z + docker login --username pytorchbot --password-stdin 2025-12-04T08:54:38.0678792Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:38.0685717Z + true 2025-12-04T08:54:38.0812082Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2025-12-04T08:54:38.0812654Z with: 2025-12-04T08:54:38.0813528Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:38.0814593Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:38.0815086Z env: 2025-12-04T08:54:38.0815388Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:38.0815835Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:38.0816412Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:38.0816951Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:38.0818309Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:38.0819803Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:38.0820179Z AWS_REGION: us-east-1 2025-12-04T08:54:38.0820703Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:38.0821211Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:38.0828004Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:38.0828419Z ##[endgroup] 2025-12-04T08:54:38.0847779Z ##[group]Run set -x 2025-12-04T08:54:38.0848232Z set -x 2025-12-04T08:54:38.0848531Z set +e 2025-12-04T08:54:38.0848826Z  2025-12-04T08:54:38.0849107Z login() { 2025-12-04T08:54:38.0849716Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2025-12-04T08:54:38.0850345Z } 2025-12-04T08:54:38.0850628Z  2025-12-04T08:54:38.0850923Z retry () { 2025-12-04T08:54:38.0851290Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2025-12-04T08:54:38.0851695Z } 2025-12-04T08:54:38.0851975Z  2025-12-04T08:54:38.0852295Z retry login "${DOCKER_REGISTRY}" 2025-12-04T08:54:38.0852681Z  2025-12-04T08:54:38.0853286Z IMAGE_SIZE=$(docker manifest inspect "${DOCKER_IMAGE}" | jq '[.layers[].size, .config.size] | add / 1024 / 1024') 2025-12-04T08:54:38.0854102Z echo "Compressed size of image in MB: ${IMAGE_SIZE}" 2025-12-04T08:54:38.0854571Z  2025-12-04T08:54:38.0854846Z set -e 2025-12-04T08:54:38.0855289Z # ignore output since only exit code is used for conditional 2025-12-04T08:54:38.0855902Z # only pull docker image if it's not available locally 2025-12-04T08:54:38.0856573Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2025-12-04T08:54:38.0857193Z  retry docker pull "${DOCKER_IMAGE}" 2025-12-04T08:54:38.0857602Z fi 2025-12-04T08:54:38.0867739Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T08:54:38.0868256Z env: 2025-12-04T08:54:38.0868554Z GIT_DEFAULT_BRANCH: main 2025-12-04T08:54:38.0869010Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T08:54:38.0869587Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T08:54:38.0870135Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T08:54:38.0871399Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T08:54:38.0872648Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T08:54:38.0873022Z AWS_REGION: us-east-1 2025-12-04T08:54:38.0873461Z AWS_ACCESS_KEY_ID: *** 2025-12-04T08:54:38.0873954Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T08:54:38.0880787Z AWS_SESSION_TOKEN: *** 2025-12-04T08:54:38.0881687Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:38.0882932Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:38.0883428Z ##[endgroup] 2025-12-04T08:54:38.0919142Z + set +e 2025-12-04T08:54:38.0919561Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:38.0920115Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:38.0923331Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:38.0924075Z /home/runner/_work/_temp/85f5cc5c-1a95-4eee-9057-3311ffbfbc1c.sh: line 5: aws: command not found 2025-12-04T08:54:38.0924938Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:38.1026608Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:38.1038575Z + sleep 1 2025-12-04T08:54:39.1057679Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:39.1062658Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:39.1063466Z /home/runner/_work/_temp/85f5cc5c-1a95-4eee-9057-3311ffbfbc1c.sh: line 5: aws: command not found 2025-12-04T08:54:39.1064350Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:39.1162842Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:39.1178329Z + sleep 2 2025-12-04T08:54:41.1196788Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:41.1203576Z + aws ecr get-login-password --region us-east-1 2025-12-04T08:54:41.1204467Z /home/runner/_work/_temp/85f5cc5c-1a95-4eee-9057-3311ffbfbc1c.sh: line 5: aws: command not found 2025-12-04T08:54:41.1205350Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T08:54:41.1304203Z Error: Cannot perform an interactive login from a non TTY device 2025-12-04T08:54:41.1321547Z ++ docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:41.1322688Z ++ jq '[.layers[].size, .config.size] | add / 1024 / 1024' 2025-12-04T08:54:42.6123953Z + IMAGE_SIZE=18171.470620155334 2025-12-04T08:54:42.6124540Z + echo 'Compressed size of image in MB: 18171.470620155334' 2025-12-04T08:54:42.6125022Z + set -e 2025-12-04T08:54:42.6125992Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:42.6127041Z Compressed size of image in MB: 18171.470620155334 2025-12-04T08:54:42.6262621Z + retry docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:42.6264125Z + docker pull 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T08:54:43.7561404Z pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a: Pulling from pytorch/ci-image 2025-12-04T08:54:43.7562382Z 63e5bc7682b8: Pulling fs layer 2025-12-04T08:54:43.7562899Z 835841cca3b7: Pulling fs layer 2025-12-04T08:54:43.7563270Z aac69780afc8: Pulling fs layer 2025-12-04T08:54:43.7563614Z 029495b23122: Pulling fs layer 2025-12-04T08:54:43.7563974Z d0fb85b00833: Pulling fs layer 2025-12-04T08:54:43.7564318Z 59b639308833: Pulling fs layer 2025-12-04T08:54:43.7564683Z dc112c89d57a: Pulling fs layer 2025-12-04T08:54:43.7565031Z 522eab2402e5: Pulling fs layer 2025-12-04T08:54:43.7565379Z 2b5a11b41761: Pulling fs layer 2025-12-04T08:54:43.7566192Z 029495b23122: Waiting 2025-12-04T08:54:43.7566712Z 9681563a88ff: Pulling fs layer 2025-12-04T08:54:43.7567188Z 59b639308833: Waiting 2025-12-04T08:54:43.7567653Z d0fb85b00833: Waiting 2025-12-04T08:54:43.7568343Z 73e33534e9eb: Pulling fs layer 2025-12-04T08:54:43.7568793Z 5bfdaeb5578d: Pulling fs layer 2025-12-04T08:54:43.7569308Z 522eab2402e5: Waiting 2025-12-04T08:54:43.7569735Z dc112c89d57a: Waiting 2025-12-04T08:54:43.7590664Z c07d27e4d3a5: Pulling fs layer 2025-12-04T08:54:43.7591131Z b21856d1bf42: Pulling fs layer 2025-12-04T08:54:43.7592172Z cb19d84867e4: Pulling fs layer 2025-12-04T08:54:43.7592540Z 8165374f8dcc: Pulling fs layer 2025-12-04T08:54:43.7592912Z 1aecc77354ce: Pulling fs layer 2025-12-04T08:54:43.7593280Z 465d3fd643aa: Pulling fs layer 2025-12-04T08:54:43.7593629Z 9681563a88ff: Waiting 2025-12-04T08:54:43.7593944Z 73e33534e9eb: Waiting 2025-12-04T08:54:43.7594270Z 6c503e779d6f: Pulling fs layer 2025-12-04T08:54:43.7594617Z 2b5a11b41761: Waiting 2025-12-04T08:54:43.7594931Z 5bfdaeb5578d: Waiting 2025-12-04T08:54:43.7595262Z f7e9a021f0ee: Pulling fs layer 2025-12-04T08:54:43.7595611Z c07d27e4d3a5: Waiting 2025-12-04T08:54:43.7595928Z 8e023b349080: Pulling fs layer 2025-12-04T08:54:43.7596273Z b21856d1bf42: Waiting 2025-12-04T08:54:43.7596596Z 8188df80e595: Pulling fs layer 2025-12-04T08:54:43.7597184Z 3c2c2f8c74bf: Pulling fs layer 2025-12-04T08:54:43.7597533Z cb19d84867e4: Waiting 2025-12-04T08:54:43.7597860Z 2aa7784fbe33: Pulling fs layer 2025-12-04T08:54:43.7598288Z 1aecc77354ce: Waiting 2025-12-04T08:54:43.7598620Z 2b3b5215d3eb: Pulling fs layer 2025-12-04T08:54:43.7598955Z 8165374f8dcc: Waiting 2025-12-04T08:54:43.7599258Z 3c2c2f8c74bf: Waiting 2025-12-04T08:54:43.7599567Z 8188df80e595: Waiting 2025-12-04T08:54:43.7599887Z 99b1f1ea3e85: Pulling fs layer 2025-12-04T08:54:43.7600234Z 2aa7784fbe33: Waiting 2025-12-04T08:54:43.7600544Z 2b3b5215d3eb: Waiting 2025-12-04T08:54:43.7600849Z f7e9a021f0ee: Waiting 2025-12-04T08:54:43.7601171Z 18d6daba0a57: Pulling fs layer 2025-12-04T08:54:43.7601521Z 6c503e779d6f: Waiting 2025-12-04T08:54:43.7601868Z 99b1f1ea3e85: Waiting 2025-12-04T08:54:43.7602192Z 5277f2a503eb: Pulling fs layer 2025-12-04T08:54:43.7602554Z 3198a9717aac: Pulling fs layer 2025-12-04T08:54:43.7602898Z 8e023b349080: Waiting 2025-12-04T08:54:43.7603201Z 18d6daba0a57: Waiting 2025-12-04T08:54:43.7603517Z 5277f2a503eb: Waiting 2025-12-04T08:54:43.7603821Z 465d3fd643aa: Waiting 2025-12-04T08:54:43.7604139Z 99a4918e5808: Pulling fs layer 2025-12-04T08:54:43.7604479Z 3198a9717aac: Waiting 2025-12-04T08:54:43.7604811Z 15bb11dfc6ac: Pulling fs layer 2025-12-04T08:54:43.7605171Z bd87c8766e90: Pulling fs layer 2025-12-04T08:54:43.7605527Z 1969e15d0c13: Pulling fs layer 2025-12-04T08:54:43.7605877Z 24a03847d382: Pulling fs layer 2025-12-04T08:54:43.7606262Z 816e2e34e018: Pulling fs layer 2025-12-04T08:54:43.7606594Z 99a4918e5808: Waiting 2025-12-04T08:54:43.7606905Z bd87c8766e90: Waiting 2025-12-04T08:54:43.7607216Z 15bb11dfc6ac: Waiting 2025-12-04T08:54:43.7607537Z b168858b8537: Pulling fs layer 2025-12-04T08:54:43.7607880Z 24a03847d382: Waiting 2025-12-04T08:54:43.7608319Z 6b8d5ff02e26: Pulling fs layer 2025-12-04T08:54:43.7608664Z 816e2e34e018: Waiting 2025-12-04T08:54:43.7608985Z 4e3b10a5dd6a: Pulling fs layer 2025-12-04T08:54:43.7609329Z 1969e15d0c13: Waiting 2025-12-04T08:54:43.7609642Z b168858b8537: Waiting 2025-12-04T08:54:43.7609948Z 4e3b10a5dd6a: Waiting 2025-12-04T08:54:43.7610270Z 3092fab73b59: Pulling fs layer 2025-12-04T08:54:43.7610623Z 6b8d5ff02e26: Waiting 2025-12-04T08:54:43.7610955Z 20020dd28a15: Pulling fs layer 2025-12-04T08:54:43.7611324Z ae5280ce969d: Pulling fs layer 2025-12-04T08:54:43.7611674Z 3092fab73b59: Waiting 2025-12-04T08:54:43.7611981Z 20020dd28a15: Waiting 2025-12-04T08:54:43.7612307Z 4f4fb700ef54: Pulling fs layer 2025-12-04T08:54:43.7612668Z fe17d9eb0fd2: Pulling fs layer 2025-12-04T08:54:43.7613039Z a51e0dab2d59: Pulling fs layer 2025-12-04T08:54:43.7613391Z ae5280ce969d: Waiting 2025-12-04T08:54:43.7613721Z 6eb176cefd72: Pulling fs layer 2025-12-04T08:54:43.7614071Z 4f4fb700ef54: Waiting 2025-12-04T08:54:43.7614402Z e7b8cf2e8d5a: Pulling fs layer 2025-12-04T08:54:43.7614754Z 6eb176cefd72: Waiting 2025-12-04T08:54:43.7615085Z ef3a5060abce: Pulling fs layer 2025-12-04T08:54:43.7615438Z e7b8cf2e8d5a: Waiting 2025-12-04T08:54:43.7615751Z ef3a5060abce: Waiting 2025-12-04T08:54:43.7616082Z a6f4ec14b42b: Pulling fs layer 2025-12-04T08:54:43.7616447Z 7e5a0c956cfb: Pulling fs layer 2025-12-04T08:54:43.7616799Z a6f4ec14b42b: Waiting 2025-12-04T08:54:43.7617125Z b4f78730cfe7: Pulling fs layer 2025-12-04T08:54:43.7617607Z 7e5a0c956cfb: Waiting 2025-12-04T08:54:43.7617940Z 081028f24389: Pulling fs layer 2025-12-04T08:54:43.7618365Z a534dcf4b9a9: Pulling fs layer 2025-12-04T08:54:43.7618732Z 2e77500302cc: Pulling fs layer 2025-12-04T08:54:43.7619078Z 081028f24389: Waiting 2025-12-04T08:54:43.7619388Z b4f78730cfe7: Waiting 2025-12-04T08:54:43.7619697Z a534dcf4b9a9: Waiting 2025-12-04T08:54:43.7620025Z bc08246bb4ba: Pulling fs layer 2025-12-04T08:54:43.7620398Z ff0c473ca120: Pulling fs layer 2025-12-04T08:54:43.7620751Z 2e77500302cc: Waiting 2025-12-04T08:54:43.7621077Z 6bbc14b250ef: Pulling fs layer 2025-12-04T08:54:43.7621427Z bc08246bb4ba: Waiting 2025-12-04T08:54:43.7621742Z ff0c473ca120: Waiting 2025-12-04T08:54:43.7622054Z 6bbc14b250ef: Waiting 2025-12-04T08:54:44.3798250Z 835841cca3b7: Verifying Checksum 2025-12-04T08:54:44.3798705Z 835841cca3b7: Download complete 2025-12-04T08:54:45.0315667Z 029495b23122: Verifying Checksum 2025-12-04T08:54:45.0316205Z 029495b23122: Download complete 2025-12-04T08:54:45.6376048Z 63e5bc7682b8: Verifying Checksum 2025-12-04T08:54:45.6376541Z 63e5bc7682b8: Download complete 2025-12-04T08:54:45.6646424Z d0fb85b00833: Verifying Checksum 2025-12-04T08:54:45.6646908Z d0fb85b00833: Download complete 2025-12-04T08:54:46.2241229Z 59b639308833: Verifying Checksum 2025-12-04T08:54:46.2241697Z 59b639308833: Download complete 2025-12-04T08:54:46.6043635Z 63e5bc7682b8: Pull complete 2025-12-04T08:54:46.6127495Z 835841cca3b7: Pull complete 2025-12-04T08:54:46.8455577Z 522eab2402e5: Download complete 2025-12-04T08:54:47.5324241Z 2b5a11b41761: Verifying Checksum 2025-12-04T08:54:47.5324770Z 2b5a11b41761: Download complete 2025-12-04T08:54:48.1812195Z 9681563a88ff: Download complete 2025-12-04T08:54:49.3775929Z dc112c89d57a: Verifying Checksum 2025-12-04T08:54:49.3776503Z dc112c89d57a: Download complete 2025-12-04T08:54:50.0302282Z 5bfdaeb5578d: Verifying Checksum 2025-12-04T08:54:50.0302800Z 5bfdaeb5578d: Download complete 2025-12-04T08:54:51.0415097Z c07d27e4d3a5: Verifying Checksum 2025-12-04T08:54:51.0415629Z c07d27e4d3a5: Download complete 2025-12-04T08:54:51.7228442Z b21856d1bf42: Verifying Checksum 2025-12-04T08:54:51.7228965Z b21856d1bf42: Download complete 2025-12-04T08:54:52.3831023Z cb19d84867e4: Download complete 2025-12-04T08:54:53.0651393Z 8165374f8dcc: Download complete 2025-12-04T08:54:54.2096827Z aac69780afc8: Verifying Checksum 2025-12-04T08:54:54.2097350Z aac69780afc8: Download complete 2025-12-04T08:54:54.9156920Z 465d3fd643aa: Verifying Checksum 2025-12-04T08:54:54.9157465Z 465d3fd643aa: Download complete 2025-12-04T08:54:55.5380191Z 6c503e779d6f: Verifying Checksum 2025-12-04T08:54:55.5380665Z 6c503e779d6f: Download complete 2025-12-04T08:54:56.1619020Z f7e9a021f0ee: Verifying Checksum 2025-12-04T08:54:56.1619480Z f7e9a021f0ee: Download complete 2025-12-04T08:54:56.8248423Z 8e023b349080: Download complete 2025-12-04T08:55:01.6636261Z aac69780afc8: Pull complete 2025-12-04T08:55:01.6694433Z 029495b23122: Pull complete 2025-12-04T08:55:01.6801917Z d0fb85b00833: Pull complete 2025-12-04T08:55:01.6873931Z 59b639308833: Pull complete 2025-12-04T08:55:03.6467950Z dc112c89d57a: Pull complete 2025-12-04T08:55:03.6549452Z 522eab2402e5: Pull complete 2025-12-04T08:55:03.6599443Z 2b5a11b41761: Pull complete 2025-12-04T08:55:03.6657918Z 9681563a88ff: Pull complete 2025-12-04T08:55:05.1187103Z 1aecc77354ce: Verifying Checksum 2025-12-04T08:55:05.1187632Z 1aecc77354ce: Download complete 2025-12-04T08:55:05.7498043Z 3c2c2f8c74bf: Download complete 2025-12-04T08:55:06.3695341Z 2aa7784fbe33: Verifying Checksum 2025-12-04T08:55:06.3695865Z 2aa7784fbe33: Download complete 2025-12-04T08:55:06.9732161Z 2b3b5215d3eb: Verifying Checksum 2025-12-04T08:55:26.6507068Z 99b1f1ea3e85: Verifying Checksum 2025-12-04T08:55:26.6507638Z 99b1f1ea3e85: Download complete 2025-12-04T08:55:27.2462556Z 18d6daba0a57: Download complete 2025-12-04T08:55:27.8192308Z 5277f2a503eb: Download complete 2025-12-04T08:55:28.4104145Z 3198a9717aac: Download complete 2025-12-04T08:55:29.0009520Z 99a4918e5808: Download complete 2025-12-04T08:55:30.1699948Z 15bb11dfc6ac: Verifying Checksum 2025-12-04T08:55:30.1701065Z 15bb11dfc6ac: Download complete 2025-12-04T08:55:30.7547531Z bd87c8766e90: Verifying Checksum 2025-12-04T08:55:30.7547983Z bd87c8766e90: Download complete 2025-12-04T08:55:31.3335395Z 1969e15d0c13: Verifying Checksum 2025-12-04T08:55:31.3335888Z 1969e15d0c13: Download complete 2025-12-04T08:55:32.8881611Z 24a03847d382: Verifying Checksum 2025-12-04T08:55:32.8882313Z 24a03847d382: Download complete 2025-12-04T08:55:33.4832033Z 816e2e34e018: Verifying Checksum 2025-12-04T08:55:33.4832608Z 816e2e34e018: Download complete 2025-12-04T08:55:34.0488029Z b168858b8537: Download complete 2025-12-04T08:55:35.4734177Z 6b8d5ff02e26: Verifying Checksum 2025-12-04T08:55:35.4734756Z 6b8d5ff02e26: Download complete 2025-12-04T08:55:36.0644084Z 4e3b10a5dd6a: Verifying Checksum 2025-12-04T08:55:36.0644551Z 4e3b10a5dd6a: Download complete 2025-12-04T08:55:36.6507263Z 3092fab73b59: Download complete 2025-12-04T08:55:37.2288055Z 20020dd28a15: Download complete 2025-12-04T08:55:37.8004567Z ae5280ce969d: Download complete 2025-12-04T08:55:38.1111034Z 4f4fb700ef54: Verifying Checksum 2025-12-04T08:55:38.1111574Z 4f4fb700ef54: Download complete 2025-12-04T08:55:38.6794261Z fe17d9eb0fd2: Verifying Checksum 2025-12-04T08:55:38.6794714Z fe17d9eb0fd2: Download complete 2025-12-04T08:55:39.2563106Z a51e0dab2d59: Verifying Checksum 2025-12-04T08:55:39.2563617Z a51e0dab2d59: Download complete 2025-12-04T08:55:39.9943486Z 6eb176cefd72: Verifying Checksum 2025-12-04T08:55:39.9943968Z 6eb176cefd72: Download complete 2025-12-04T08:55:40.5745457Z e7b8cf2e8d5a: Verifying Checksum 2025-12-04T08:55:40.5745921Z e7b8cf2e8d5a: Download complete 2025-12-04T08:55:41.1861879Z ef3a5060abce: Verifying Checksum 2025-12-04T08:55:41.1862364Z ef3a5060abce: Download complete 2025-12-04T08:55:41.7648536Z a6f4ec14b42b: Verifying Checksum 2025-12-04T08:55:41.7648980Z a6f4ec14b42b: Download complete 2025-12-04T08:55:42.3253146Z 7e5a0c956cfb: Verifying Checksum 2025-12-04T08:55:42.3253623Z 7e5a0c956cfb: Download complete 2025-12-04T09:03:01.1180351Z 73e33534e9eb: Verifying Checksum 2025-12-04T09:03:01.1180821Z 73e33534e9eb: Download complete 2025-12-04T09:03:01.8132254Z 081028f24389: Verifying Checksum 2025-12-04T09:03:01.8132721Z 081028f24389: Download complete 2025-12-04T09:03:02.3912025Z a534dcf4b9a9: Download complete 2025-12-04T09:03:07.9365286Z 2e77500302cc: Verifying Checksum 2025-12-04T09:03:07.9365804Z 2e77500302cc: Download complete 2025-12-04T09:03:08.5293933Z bc08246bb4ba: Verifying Checksum 2025-12-04T09:03:08.5294451Z bc08246bb4ba: Download complete 2025-12-04T09:03:09.1090775Z ff0c473ca120: Download complete 2025-12-04T09:03:11.4031422Z 6bbc14b250ef: Verifying Checksum 2025-12-04T09:03:11.4031910Z 6bbc14b250ef: Download complete 2025-12-04T09:03:42.9312727Z 73e33534e9eb: Pull complete 2025-12-04T09:03:42.9401517Z 5bfdaeb5578d: Pull complete 2025-12-04T09:03:42.9634117Z c07d27e4d3a5: Pull complete 2025-12-04T09:03:42.9688720Z b21856d1bf42: Pull complete 2025-12-04T09:03:42.9740602Z cb19d84867e4: Pull complete 2025-12-04T09:03:42.9802847Z 8165374f8dcc: Pull complete 2025-12-04T09:03:49.8950518Z 1aecc77354ce: Pull complete 2025-12-04T09:03:49.9001192Z 465d3fd643aa: Pull complete 2025-12-04T09:03:49.9093257Z 6c503e779d6f: Pull complete 2025-12-04T09:03:49.9210057Z f7e9a021f0ee: Pull complete 2025-12-04T09:03:49.9265524Z 8e023b349080: Pull complete 2025-12-04T09:21:04.9384872Z 8188df80e595: Verifying Checksum 2025-12-04T09:21:04.9385434Z 8188df80e595: Download complete 2025-12-04T09:22:38.3863445Z 8188df80e595: Pull complete 2025-12-04T09:22:38.3917831Z 3c2c2f8c74bf: Pull complete 2025-12-04T09:22:38.3977035Z 2aa7784fbe33: Pull complete 2025-12-04T09:22:38.4038491Z 2b3b5215d3eb: Pull complete 2025-12-04T09:22:49.2532046Z 99b1f1ea3e85: Pull complete 2025-12-04T09:22:49.2578373Z 18d6daba0a57: Pull complete 2025-12-04T09:22:49.2628975Z 5277f2a503eb: Pull complete 2025-12-04T09:22:49.2679050Z 3198a9717aac: Pull complete 2025-12-04T09:22:49.2759104Z 99a4918e5808: Pull complete 2025-12-04T09:22:49.3164086Z 15bb11dfc6ac: Pull complete 2025-12-04T09:22:49.3219301Z bd87c8766e90: Pull complete 2025-12-04T09:22:49.3254527Z 1969e15d0c13: Pull complete 2025-12-04T09:22:49.6630977Z 24a03847d382: Pull complete 2025-12-04T09:22:49.6681431Z 816e2e34e018: Pull complete 2025-12-04T09:22:49.6758461Z b168858b8537: Pull complete 2025-12-04T09:22:49.8325209Z 6b8d5ff02e26: Pull complete 2025-12-04T09:22:49.8381416Z 4e3b10a5dd6a: Pull complete 2025-12-04T09:22:49.8518271Z 3092fab73b59: Pull complete 2025-12-04T09:22:49.8578394Z 20020dd28a15: Pull complete 2025-12-04T09:22:49.8633699Z ae5280ce969d: Pull complete 2025-12-04T09:22:49.8679248Z 4f4fb700ef54: Pull complete 2025-12-04T09:22:49.8743694Z fe17d9eb0fd2: Pull complete 2025-12-04T09:22:49.8797568Z a51e0dab2d59: Pull complete 2025-12-04T09:22:49.8859913Z 6eb176cefd72: Pull complete 2025-12-04T09:22:49.8902208Z e7b8cf2e8d5a: Pull complete 2025-12-04T09:22:49.8936640Z ef3a5060abce: Pull complete 2025-12-04T09:22:49.9066168Z a6f4ec14b42b: Pull complete 2025-12-04T09:22:49.9115372Z 7e5a0c956cfb: Pull complete 2025-12-04T09:26:22.4102359Z b4f78730cfe7: Verifying Checksum 2025-12-04T09:26:22.4102921Z b4f78730cfe7: Download complete 2025-12-04T09:27:32.3747267Z b4f78730cfe7: Pull complete 2025-12-04T09:27:32.3821487Z 081028f24389: Pull complete 2025-12-04T09:27:32.3887880Z a534dcf4b9a9: Pull complete 2025-12-04T09:27:36.9774227Z 2e77500302cc: Pull complete 2025-12-04T09:27:36.9846359Z bc08246bb4ba: Pull complete 2025-12-04T09:27:36.9995017Z ff0c473ca120: Pull complete 2025-12-04T09:27:38.2035930Z 6bbc14b250ef: Pull complete 2025-12-04T09:27:38.2068597Z Digest: sha256:5e190224966743059cf8506170eaec525eada34e38cf646e02d1dbeadfe5a366 2025-12-04T09:27:38.2077575Z Status: Downloaded newer image for 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:27:38.2089350Z 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:27:38.2167956Z Prepare all required actions 2025-12-04T09:27:38.2182496Z ##[group]Run ./.github/actions/get-workflow-job-id 2025-12-04T09:27:38.2182641Z with: 2025-12-04T09:27:38.2182882Z github-token: *** 2025-12-04T09:27:38.2182983Z env: 2025-12-04T09:27:38.2183077Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:27:38.2183216Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:27:38.2183392Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:27:38.2183556Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:27:38.2183939Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:27:38.2184330Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:27:38.2184446Z AWS_REGION: us-east-1 2025-12-04T09:27:38.2184568Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:27:38.2184725Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:27:38.2186772Z AWS_SESSION_TOKEN: *** 2025-12-04T09:27:38.2186876Z ##[endgroup] 2025-12-04T09:27:38.2193431Z ##[group]Run set -eux 2025-12-04T09:27:38.2193543Z set -eux 2025-12-04T09:27:38.2193714Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2025-12-04T09:27:38.2198047Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:27:38.2198247Z env: 2025-12-04T09:27:38.2198342Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:27:38.2198475Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:27:38.2198649Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:27:38.2198812Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:27:38.2199200Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:27:38.2199568Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:27:38.2199684Z AWS_REGION: us-east-1 2025-12-04T09:27:38.2199853Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:27:38.2200006Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:27:38.2202043Z AWS_SESSION_TOKEN: *** 2025-12-04T09:27:38.2202193Z GITHUB_TOKEN: *** 2025-12-04T09:27:38.2202291Z ##[endgroup] 2025-12-04T09:27:38.2219251Z + python3 .github/scripts/get_workflow_job_id.py 19922849170 linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp 2025-12-04T09:27:39.2665241Z Setting output job-id=57116213162 2025-12-04T09:27:39.2666133Z Setting output job-name=linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:27:39.2865772Z Prepare all required actions 2025-12-04T09:27:39.2866306Z Getting action download info 2025-12-04T09:27:39.6755393Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2025-12-04T09:27:40.8037454Z Download action repository 'actions/download-artifact@v4' (SHA:d3f86a106a0bac45b974a628896c90dbdf5c8093) 2025-12-04T09:27:41.8447533Z ##[group]Run ./.github/actions/download-build-artifacts 2025-12-04T09:27:41.8448008Z with: 2025-12-04T09:27:41.8448716Z name: linux-jammy-rocm-py3.10 2025-12-04T09:27:41.8449104Z s3-bucket: gha-artifacts 2025-12-04T09:27:41.8449432Z env: 2025-12-04T09:27:41.8449719Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:27:41.8450143Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:27:41.8450705Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:27:41.8451229Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:27:41.8452524Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:27:41.8453775Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:27:41.8454141Z AWS_REGION: us-east-1 2025-12-04T09:27:41.8454585Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:27:41.8455068Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:27:41.8461873Z AWS_SESSION_TOKEN: *** 2025-12-04T09:27:41.8462202Z ##[endgroup] 2025-12-04T09:27:41.8541129Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:27:41.8541562Z with: 2025-12-04T09:27:41.8541879Z name: linux-jammy-rocm-py3.10 2025-12-04T09:27:41.8542264Z s3-bucket: gha-artifacts 2025-12-04T09:27:41.8542603Z region: us-east-1 2025-12-04T09:27:41.8542899Z env: 2025-12-04T09:27:41.8543189Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:27:41.8543617Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:27:41.8544200Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:27:41.8544749Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:27:41.8546024Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:27:41.8547248Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:27:41.8547618Z AWS_REGION: us-east-1 2025-12-04T09:27:41.8548055Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:27:41.8548602Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:27:41.8555334Z AWS_SESSION_TOKEN: *** 2025-12-04T09:27:41.8555668Z ##[endgroup] 2025-12-04T09:27:42.0801985Z (node:17217) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:27:42.0802609Z 2025-12-04T09:27:42.0802879Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:27:42.0803592Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:27:42.0804293Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:27:42.3534246Z Found 1 objects with prefix pytorch/pytorch/19922849170/linux-jammy-rocm-py3.10/ 2025-12-04T09:27:42.3535072Z Starting download (1/1): /home/runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:30:54.7332365Z Finished download (1/1): /home/runner/_work/pytorch/pytorch/artifacts.zip 2025-12-04T09:30:54.7345850Z Artifact download has finished successfully 2025-12-04T09:30:54.7670350Z ##[group]Run unzip -o artifacts.zip 2025-12-04T09:30:54.7670789Z unzip -o artifacts.zip 2025-12-04T09:30:54.7680879Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:30:54.7681358Z env: 2025-12-04T09:30:54.7681656Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:30:54.7682092Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:30:54.7682954Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:30:54.7683681Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:30:54.7684933Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:30:54.7686577Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:30:54.7686964Z AWS_REGION: us-east-1 2025-12-04T09:30:54.7687442Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:30:54.7687939Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:30:54.7694728Z AWS_SESSION_TOKEN: *** 2025-12-04T09:30:54.7695078Z ##[endgroup] 2025-12-04T09:30:54.7750761Z Archive: artifacts.zip 2025-12-04T09:30:54.7752662Z creating: dist/ 2025-12-04T09:30:54.7840156Z inflating: dist/.ninja_log 2025-12-04T09:30:57.7466190Z inflating: dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:30:57.7468556Z creating: build/ 2025-12-04T09:30:57.7469076Z creating: build/custom_test_artifacts/ 2025-12-04T09:30:57.7469666Z creating: build/custom_test_artifacts/custom-op-build/ 2025-12-04T09:30:57.7470289Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2025-12-04T09:30:57.7471020Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:30:57.7471848Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:30:57.7472652Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/ 2025-12-04T09:30:57.7473459Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:30:57.7474307Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:30:57.7475164Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:30:57.7476120Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:30:57.7477086Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:30:57.7477988Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:30:57.7478917Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:30:57.7479756Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:30:57.7480736Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:30:57.7481735Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:30:57.7482648Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:30:57.7483637Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:30:57.7484696Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:30:57.7485601Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:30:57.7486349Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:30:57.7487115Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2025-12-04T09:30:57.7487909Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2025-12-04T09:30:57.7488832Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2025-12-04T09:30:57.7489813Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2025-12-04T09:30:57.7491467Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2025-12-04T09:30:57.7492560Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2025-12-04T09:30:57.7493476Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2025-12-04T09:30:57.7494401Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2025-12-04T09:30:57.7495322Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2025-12-04T09:30:57.7496244Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2025-12-04T09:30:57.7497152Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2025-12-04T09:30:57.7498099Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2025-12-04T09:30:57.7598022Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2025-12-04T09:30:57.7599038Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2025-12-04T09:30:57.7599998Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2025-12-04T09:30:57.7601044Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2025-12-04T09:30:57.7602049Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2025-12-04T09:30:57.7602981Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2025-12-04T09:30:57.7603946Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2025-12-04T09:30:57.7604917Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2025-12-04T09:30:57.7605893Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2025-12-04T09:30:57.7606866Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2025-12-04T09:30:57.7607813Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2025-12-04T09:30:57.7611678Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2025-12-04T09:30:57.7655921Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2025-12-04T09:30:57.7657016Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:30:57.7657971Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:30:57.7658894Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2025-12-04T09:30:57.7659697Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2025-12-04T09:30:57.7660469Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2025-12-04T09:30:57.7661274Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:30:57.7662064Z inflating: build/custom_test_artifacts/custom-op-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:30:57.7662803Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2025-12-04T09:30:57.7663474Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2025-12-04T09:30:57.7664165Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2025-12-04T09:30:57.7750826Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2025-12-04T09:30:57.7780633Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2025-12-04T09:30:57.7781321Z creating: build/custom_test_artifacts/jit-hook-build/ 2025-12-04T09:30:57.7782082Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2025-12-04T09:30:57.7782881Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:30:57.7783711Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:30:57.7784488Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/ 2025-12-04T09:30:57.7785264Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:30:57.7786096Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:30:57.7786906Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:30:57.7787843Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:30:57.7788856Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:30:57.7789750Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:30:57.7790604Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:30:57.7791429Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:30:57.7792391Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:30:57.7793397Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:30:57.7794298Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:30:57.7795280Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:30:57.7796325Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:30:57.7797238Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:30:57.7797959Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:30:57.7798782Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2025-12-04T09:30:57.7799585Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2025-12-04T09:30:57.7800472Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2025-12-04T09:30:57.7801487Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2025-12-04T09:30:57.7802456Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2025-12-04T09:30:57.7803359Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2025-12-04T09:30:57.7804306Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2025-12-04T09:30:57.7805255Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2025-12-04T09:30:57.7806340Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2025-12-04T09:30:57.7807269Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2025-12-04T09:30:57.7808214Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2025-12-04T09:30:57.7809215Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2025-12-04T09:30:57.7838807Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2025-12-04T09:30:57.7839815Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:30:57.7840855Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:30:57.7841762Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2025-12-04T09:30:57.7842533Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2025-12-04T09:30:57.7843290Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2025-12-04T09:30:57.7844071Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:30:57.7844847Z inflating: build/custom_test_artifacts/jit-hook-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:30:57.7845580Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2025-12-04T09:30:57.7846239Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2025-12-04T09:30:57.7846904Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2025-12-04T09:30:57.7863251Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2025-12-04T09:30:57.7863902Z creating: build/custom_test_artifacts/custom-backend-build/ 2025-12-04T09:30:57.7864547Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2025-12-04T09:30:57.7865281Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2025-12-04T09:30:57.7866191Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2025-12-04T09:30:57.7867038Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/ 2025-12-04T09:30:57.7867872Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeSystem.cmake 2025-12-04T09:30:57.7868814Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/ 2025-12-04T09:30:57.7869832Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/tmp/ 2025-12-04T09:30:57.7870852Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/CMakeCCompilerId.c 2025-12-04T09:30:57.7871866Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdC/a.out 2025-12-04T09:30:57.7872807Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCCompiler.cmake 2025-12-04T09:30:57.7873732Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/ 2025-12-04T09:30:57.7874626Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/tmp/ 2025-12-04T09:30:57.7875650Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/CMakeCXXCompilerId.cpp 2025-12-04T09:30:57.7876714Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CompilerIdCXX/a.out 2025-12-04T09:30:57.7877680Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeCXXCompiler.cmake 2025-12-04T09:30:57.7878795Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_C.bin 2025-12-04T09:30:57.7879906Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.31.6/CMakeDetermineCompilerABI_CXX.bin 2025-12-04T09:30:57.7880859Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2025-12-04T09:30:57.7881647Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2025-12-04T09:30:57.7882459Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2025-12-04T09:30:57.7883320Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2025-12-04T09:30:57.7884281Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2025-12-04T09:30:57.7885469Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2025-12-04T09:30:57.7886629Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2025-12-04T09:30:57.7887605Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2025-12-04T09:30:57.7888657Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2025-12-04T09:30:57.7889669Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2025-12-04T09:30:57.7890689Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2025-12-04T09:30:57.7891706Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2025-12-04T09:30:57.7892699Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2025-12-04T09:30:57.7893774Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2025-12-04T09:30:57.7943108Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2025-12-04T09:30:57.7944194Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2025-12-04T09:30:57.7945233Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2025-12-04T09:30:57.7946375Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2025-12-04T09:30:57.7947471Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2025-12-04T09:30:57.7948566Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2025-12-04T09:30:57.7949637Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2025-12-04T09:30:57.7950719Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2025-12-04T09:30:57.7951801Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2025-12-04T09:30:57.7952883Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2025-12-04T09:30:57.7953935Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2025-12-04T09:30:57.7956559Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2025-12-04T09:30:57.7986475Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2025-12-04T09:30:57.7987684Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2025-12-04T09:30:57.7988750Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2025-12-04T09:30:57.7989650Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2025-12-04T09:30:57.7990484Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2025-12-04T09:30:57.7991299Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2025-12-04T09:30:57.7992153Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_outer_vec.cc 2025-12-04T09:30:57.7992993Z inflating: build/custom_test_artifacts/custom-backend-build/hipblaslt_test_vec_ext.cc 2025-12-04T09:30:57.7993785Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2025-12-04T09:30:57.7994499Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2025-12-04T09:30:57.7995372Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2025-12-04T09:30:57.8045288Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2025-12-04T09:30:57.8066080Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2025-12-04T09:30:57.8066729Z creating: build/lib/ 2025-12-04T09:30:57.8112114Z inflating: build/lib/libprotobuf-lite.a 2025-12-04T09:30:57.8357123Z inflating: build/lib/libprotobuf.a 2025-12-04T09:30:57.8631393Z inflating: build/lib/libprotoc.a 2025-12-04T09:30:57.8636610Z inflating: build/lib/libpthreadpool.a 2025-12-04T09:30:57.8640651Z inflating: build/lib/libcpuinfo.a 2025-12-04T09:30:57.8644750Z inflating: build/lib/libcpuinfo_internals.a 2025-12-04T09:30:57.8645304Z inflating: build/lib/libclog.a 2025-12-04T09:30:57.8655879Z inflating: build/lib/libpytorch_qnnpack.a 2025-12-04T09:30:57.8656648Z inflating: build/lib/libnnpack_reference_layers.a 2025-12-04T09:30:57.8666610Z inflating: build/lib/libnnpack.a 2025-12-04T09:30:57.8768512Z inflating: build/lib/libmicrokernels-prod.a 2025-12-04T09:30:57.9238177Z inflating: build/lib/libmicrokernels-all.a 2025-12-04T09:30:57.9276207Z inflating: build/lib/libgtest.a 2025-12-04T09:30:57.9285557Z inflating: build/lib/libgmock.a 2025-12-04T09:30:57.9286047Z inflating: build/lib/libgtest_main.a 2025-12-04T09:30:57.9286476Z inflating: build/lib/libgmock_main.a 2025-12-04T09:30:57.9335755Z inflating: build/lib/libXNNPACK.a 2025-12-04T09:30:57.9377527Z inflating: build/lib/libbenchmark.a 2025-12-04T09:30:57.9378054Z inflating: build/lib/libbenchmark_main.a 2025-12-04T09:30:57.9378587Z inflating: build/lib/libjitprofiling.a 2025-12-04T09:30:57.9382317Z inflating: build/lib/libittnotify.a 2025-12-04T09:30:57.9418685Z inflating: build/lib/libasmjit.a 2025-12-04T09:30:58.0046685Z inflating: build/lib/libfbgemm.a 2025-12-04T09:30:58.0063459Z inflating: build/lib/libtensorpipe_uv.a 2025-12-04T09:30:58.0359573Z inflating: build/lib/libtensorpipe.a 2025-12-04T09:30:58.0425672Z inflating: build/lib/libgloo.a 2025-12-04T09:30:58.0451306Z inflating: build/lib/libonnx_proto.a 2025-12-04T09:30:58.0673864Z inflating: build/lib/libgloo_hip.a 2025-12-04T09:30:58.1071060Z inflating: build/lib/libonnx.a 2025-12-04T09:30:58.6619430Z inflating: build/lib/libdnnl.a 2025-12-04T09:30:58.6630130Z inflating: build/lib/libfmt.a 2025-12-04T09:30:58.6799765Z inflating: build/lib/libkineto.a 2025-12-04T09:30:58.6864376Z inflating: build/lib/libc10.so 2025-12-04T09:30:58.6864903Z inflating: build/lib/libtorch_global_deps.so 2025-12-04T09:30:58.6865747Z inflating: build/lib/libcaffe2_nvrtc.so 2025-12-04T09:30:58.6891067Z inflating: build/lib/libc10_hip.so 2025-12-04T09:30:58.7164973Z inflating: build/lib/libfbgemm_genai.a 2025-12-04T09:31:00.4202881Z inflating: build/lib/libtorch_cpu.so 2025-12-04T09:31:00.4204763Z inflating: build/lib/libshm.so 2025-12-04T09:31:01.2591443Z inflating: build/lib/libtorch_hip.so 2025-12-04T09:31:01.2592044Z inflating: build/lib/libtorch.so 2025-12-04T09:31:01.2602191Z inflating: build/lib/libjitbackend_test.so 2025-12-04T09:31:01.2615689Z inflating: build/lib/libbackend_with_compiler.so 2025-12-04T09:31:01.2654970Z inflating: build/lib/libtorchbind_test.so 2025-12-04T09:31:01.2669965Z inflating: build/lib/libaoti_custom_ops.so 2025-12-04T09:31:01.3965731Z inflating: build/lib/libtorch_python.so 2025-12-04T09:31:01.3984990Z inflating: build/lib/libnnapi_backend.so 2025-12-04T09:31:01.3985443Z creating: build/bin/ 2025-12-04T09:31:01.3985817Z creating: build/bin/CMakeFiles/ 2025-12-04T09:31:01.3986261Z inflating: build/bin/cmake_install.cmake 2025-12-04T09:31:01.3986716Z inflating: build/bin/CTestTestfile.cmake 2025-12-04T09:31:01.4263391Z inflating: build/bin/protoc-3.13.0.0 2025-12-04T09:31:01.4529569Z inflating: build/bin/protoc 2025-12-04T09:31:01.4563676Z inflating: build/bin/c10_AllocatorConfig_test 2025-12-04T09:31:01.4596339Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2025-12-04T09:31:01.4630216Z inflating: build/bin/c10_DeviceGuard_test 2025-12-04T09:31:01.4664434Z inflating: build/bin/c10_Device_test 2025-12-04T09:31:01.4703323Z inflating: build/bin/c10_DispatchKeySet_test 2025-12-04T09:31:01.4738539Z inflating: build/bin/c10_Scalar_test 2025-12-04T09:31:01.4771117Z inflating: build/bin/c10_StreamGuard_test 2025-12-04T09:31:01.4808162Z inflating: build/bin/c10_SymInt_test 2025-12-04T09:31:01.4844871Z inflating: build/bin/c10_SizesAndStrides_test 2025-12-04T09:31:01.4879584Z inflating: build/bin/c10_Bitset_test 2025-12-04T09:31:01.4924625Z inflating: build/bin/c10_cow_test 2025-12-04T09:31:01.4960265Z inflating: build/bin/c10_InlineDeviceGuard_test 2025-12-04T09:31:01.4996807Z inflating: build/bin/c10_InlineStreamGuard_test 2025-12-04T09:31:01.5029515Z inflating: build/bin/c10_ArrayRef_test 2025-12-04T09:31:01.5062113Z inflating: build/bin/c10_ConstexprCrc_test 2025-12-04T09:31:01.5094804Z inflating: build/bin/c10_DeadlockDetection_test 2025-12-04T09:31:01.5129658Z inflating: build/bin/c10_IntrusiveList_test 2025-12-04T09:31:01.5162883Z inflating: build/bin/c10_Half_test 2025-12-04T09:31:01.5197928Z inflating: build/bin/c10_Enumerate_test 2025-12-04T09:31:01.5232149Z inflating: build/bin/c10_LeftRight_test 2025-12-04T09:31:01.5265130Z inflating: build/bin/c10_NetworkFlow_test 2025-12-04T09:31:01.5295565Z inflating: build/bin/c10_Semaphore_test 2025-12-04T09:31:01.5326529Z inflating: build/bin/c10_Synchronized_test 2025-12-04T09:31:01.5358376Z inflating: build/bin/c10_TypeIndex_test 2025-12-04T09:31:01.5392187Z inflating: build/bin/c10_ThreadLocal_test 2025-12-04T09:31:01.5424092Z inflating: build/bin/c10_accumulate_test 2025-12-04T09:31:01.5458304Z inflating: build/bin/c10_bfloat16_test 2025-12-04T09:31:01.5488665Z inflating: build/bin/c10_error_test 2025-12-04T09:31:01.5519779Z inflating: build/bin/c10_bit_cast_test 2025-12-04T09:31:01.5553707Z inflating: build/bin/c10_complex_test 2025-12-04T09:31:01.5585811Z inflating: build/bin/c10_exception_test 2025-12-04T09:31:01.5620787Z inflating: build/bin/c10_complex_math_test 2025-12-04T09:31:01.5651720Z inflating: build/bin/c10_flags_test 2025-12-04T09:31:01.5683240Z inflating: build/bin/c10_irange_test 2025-12-04T09:31:01.5714257Z inflating: build/bin/c10_generic_math_test 2025-12-04T09:31:01.5804152Z inflating: build/bin/c10_intrusive_ptr_test 2025-12-04T09:31:01.5838951Z inflating: build/bin/c10_logging_test 2025-12-04T09:31:01.5869760Z inflating: build/bin/c10_nofatal_test 2025-12-04T09:31:01.5902698Z inflating: build/bin/c10_lazy_test 2025-12-04T09:31:01.5940365Z inflating: build/bin/c10_ordered_preserving_dict_test 2025-12-04T09:31:01.5972988Z inflating: build/bin/c10_registry_test 2025-12-04T09:31:01.6004825Z inflating: build/bin/c10_ssize_test 2025-12-04T09:31:01.6049719Z inflating: build/bin/c10_optional_test 2025-12-04T09:31:01.6137364Z inflating: build/bin/c10_small_vector_test 2025-12-04T09:31:01.6171762Z inflating: build/bin/c10_string_util_test 2025-12-04T09:31:01.6202446Z inflating: build/bin/c10_tempfile_test 2025-12-04T09:31:01.6232963Z inflating: build/bin/c10_string_view_test 2025-12-04T09:31:01.6259772Z inflating: build/bin/c10_intrusive_ptr_benchmark 2025-12-04T09:31:01.6294197Z inflating: build/bin/c10_typeid_test 2025-12-04T09:31:01.6324248Z inflating: build/bin/c10_hip_HIPAssertionsTest_1_var_test 2025-12-04T09:31:01.6354517Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_stream 2025-12-04T09:31:01.6384727Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2025-12-04T09:31:01.6414813Z inflating: build/bin/c10_hip_HIPAssertionsTest_from_2_processes 2025-12-04T09:31:01.6445265Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2025-12-04T09:31:01.6475126Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2025-12-04T09:31:01.6505252Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2025-12-04T09:31:01.6535540Z inflating: build/bin/c10_hip_HIPTest 2025-12-04T09:31:01.6863463Z inflating: build/bin/vec_test_all_types_DEFAULT 2025-12-04T09:31:01.7199014Z inflating: build/bin/vec_test_all_types_AVX512 2025-12-04T09:31:01.7541509Z inflating: build/bin/vec_test_all_types_AVX2 2025-12-04T09:31:01.7599142Z inflating: build/bin/test_aoti_abi_check 2025-12-04T09:31:01.7629405Z inflating: build/bin/test_vec_half_DEFAULT 2025-12-04T09:31:01.7660188Z inflating: build/bin/test_vec_half_AVX2 2025-12-04T09:31:01.7690858Z inflating: build/bin/test_vec_half_AVX512 2025-12-04T09:31:01.7723371Z inflating: build/bin/BackoffTest 2025-12-04T09:31:01.7757569Z inflating: build/bin/FileStoreTest 2025-12-04T09:31:01.7791920Z inflating: build/bin/TCPStoreTest 2025-12-04T09:31:01.7825080Z inflating: build/bin/HashStoreTest 2025-12-04T09:31:01.7865872Z inflating: build/bin/ProcessGroupGlooTest 2025-12-04T09:31:01.7867151Z inflating: build/bin/example_allreduce 2025-12-04T09:31:01.7869603Z inflating: build/bin/torch_shm_manager 2025-12-04T09:31:01.7902850Z inflating: build/bin/static_runtime_bench 2025-12-04T09:31:01.8047097Z inflating: build/bin/static_runtime_test 2025-12-04T09:31:01.8090833Z inflating: build/bin/Dict_test 2025-12-04T09:31:01.8123119Z inflating: build/bin/Dimname_test 2025-12-04T09:31:01.8162456Z inflating: build/bin/MaybeOwned_test 2025-12-04T09:31:01.8197256Z inflating: build/bin/NamedTensor_test 2025-12-04T09:31:01.8233229Z inflating: build/bin/apply_utils_test 2025-12-04T09:31:01.8269140Z inflating: build/bin/atest 2025-12-04T09:31:01.8308017Z inflating: build/bin/basic 2025-12-04T09:31:01.8341890Z inflating: build/bin/broadcast_test 2025-12-04T09:31:01.8373210Z inflating: build/bin/cpu_allocator_test 2025-12-04T09:31:01.8408844Z inflating: build/bin/cpu_generator_test 2025-12-04T09:31:01.8441168Z inflating: build/bin/cpu_profiling_allocator_test 2025-12-04T09:31:01.8496281Z inflating: build/bin/cpu_rng_test 2025-12-04T09:31:01.8528208Z inflating: build/bin/dlconvertor_test 2025-12-04T09:31:01.8563292Z inflating: build/bin/extension_backend_test 2025-12-04T09:31:01.8597181Z inflating: build/bin/half_test 2025-12-04T09:31:01.8655057Z inflating: build/bin/ivalue_test 2025-12-04T09:31:01.8685852Z inflating: build/bin/lazy_tensor_test 2025-12-04T09:31:01.8718323Z inflating: build/bin/math_kernel_test 2025-12-04T09:31:01.8750643Z inflating: build/bin/memory_format_test 2025-12-04T09:31:01.8783504Z inflating: build/bin/memory_overlapping_test 2025-12-04T09:31:01.8815966Z inflating: build/bin/mobile_memory_cleanup 2025-12-04T09:31:01.8850547Z inflating: build/bin/native_test 2025-12-04T09:31:01.8882042Z inflating: build/bin/operator_name_test 2025-12-04T09:31:01.8913155Z inflating: build/bin/operators_test 2025-12-04T09:31:01.8945224Z inflating: build/bin/packedtensoraccessor_test 2025-12-04T09:31:01.8985891Z inflating: build/bin/pow_test 2025-12-04T09:31:01.9020430Z inflating: build/bin/quantized_test 2025-12-04T09:31:01.9050953Z inflating: build/bin/reduce_ops_test 2025-12-04T09:31:01.9082286Z inflating: build/bin/reportMemoryUsage_test 2025-12-04T09:31:01.9116603Z inflating: build/bin/scalar_tensor_test 2025-12-04T09:31:01.9151570Z inflating: build/bin/scalar_test 2025-12-04T09:31:01.9183296Z inflating: build/bin/StorageUtils_test 2025-12-04T09:31:01.9215197Z inflating: build/bin/stride_properties_test 2025-12-04T09:31:01.9262449Z inflating: build/bin/tensor_iterator_test 2025-12-04T09:31:01.9295805Z inflating: build/bin/test_parallel 2025-12-04T09:31:01.9327074Z inflating: build/bin/thread_init_test 2025-12-04T09:31:01.9360961Z inflating: build/bin/type_ptr_test 2025-12-04T09:31:01.9396736Z inflating: build/bin/type_test 2025-12-04T09:31:01.9428867Z inflating: build/bin/undefined_tensor_test 2025-12-04T09:31:01.9459368Z inflating: build/bin/verify_api_visibility 2025-12-04T09:31:01.9502063Z inflating: build/bin/legacy_vmap_test 2025-12-04T09:31:01.9533706Z inflating: build/bin/weakref_test 2025-12-04T09:31:01.9565409Z inflating: build/bin/wrapdim_test 2025-12-04T09:31:01.9629322Z inflating: build/bin/List_test 2025-12-04T09:31:01.9660682Z inflating: build/bin/xla_tensor_test 2025-12-04T09:31:01.9696891Z inflating: build/bin/IListRef_test 2025-12-04T09:31:01.9766884Z inflating: build/bin/kernel_function_legacy_test 2025-12-04T09:31:01.9806983Z inflating: build/bin/KernelFunction_test 2025-12-04T09:31:01.9863707Z inflating: build/bin/kernel_function_test 2025-12-04T09:31:01.9937366Z inflating: build/bin/kernel_lambda_legacy_test 2025-12-04T09:31:01.9997489Z inflating: build/bin/kernel_lambda_test 2025-12-04T09:31:02.0034016Z inflating: build/bin/kernel_stackbased_test 2025-12-04T09:31:02.0090328Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2025-12-04T09:31:02.0121933Z inflating: build/bin/CppSignature_test 2025-12-04T09:31:02.0152276Z inflating: build/bin/op_allowlist_test 2025-12-04T09:31:02.0330046Z inflating: build/bin/op_registration_test 2025-12-04T09:31:02.0360466Z inflating: build/bin/hip_complex_math_test 2025-12-04T09:31:02.0394290Z inflating: build/bin/backend_fallback_test 2025-12-04T09:31:02.0424834Z inflating: build/bin/hip_complex_test 2025-12-04T09:31:02.0465421Z inflating: build/bin/inline_container_test 2025-12-04T09:31:02.0497828Z inflating: build/bin/hip_apply_test 2025-12-04T09:31:02.0528185Z inflating: build/bin/hip_distributions_test 2025-12-04T09:31:02.0558479Z inflating: build/bin/hip_generator_test 2025-12-04T09:31:02.0588585Z inflating: build/bin/hip_half_test 2025-12-04T09:31:02.0618842Z inflating: build/bin/hip_integer_divider_test 2025-12-04T09:31:02.0649112Z inflating: build/bin/hip_optional_test 2025-12-04T09:31:02.0679439Z inflating: build/bin/hip_packedtensoraccessor_test 2025-12-04T09:31:02.0709533Z inflating: build/bin/hip_vectorized_test 2025-12-04T09:31:02.0741341Z inflating: build/bin/hip_dlconvertor_test 2025-12-04T09:31:02.1363794Z inflating: build/bin/test_jit 2025-12-04T09:31:02.1562641Z inflating: build/bin/test_lazy 2025-12-04T09:31:02.1596996Z inflating: build/bin/test_dist_autograd 2025-12-04T09:31:02.1638465Z inflating: build/bin/test_cpp_rpc 2025-12-04T09:31:02.1639470Z inflating: build/bin/parallel_benchmark 2025-12-04T09:31:02.2306729Z inflating: build/bin/test_api 2025-12-04T09:31:02.2307214Z creating: .additional_ci_files/ 2025-12-04T09:31:02.2343078Z inflating: .additional_ci_files/test-times.json 2025-12-04T09:31:02.2475352Z inflating: .additional_ci_files/test-class-times.json 2025-12-04T09:31:02.2517308Z ##[group]Run rm artifacts.zip 2025-12-04T09:31:02.2517754Z rm artifacts.zip 2025-12-04T09:31:02.2528420Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:02.2528908Z env: 2025-12-04T09:31:02.2529207Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:02.2529636Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:02.2530201Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:02.2530734Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:02.2531981Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:02.2533206Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:02.2533579Z AWS_REGION: us-east-1 2025-12-04T09:31:02.2534069Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:02.2534748Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:02.2541675Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:02.2542013Z ##[endgroup] 2025-12-04T09:31:02.3659885Z ##[group]Run df -H 2025-12-04T09:31:02.3660247Z df -H 2025-12-04T09:31:02.3670609Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:02.3671087Z env: 2025-12-04T09:31:02.3671385Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:02.3671819Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:02.3672395Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:02.3672936Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:02.3674203Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:02.3675452Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:02.3675834Z AWS_REGION: us-east-1 2025-12-04T09:31:02.3676343Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:02.3676858Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:02.3683657Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:02.3683996Z ##[endgroup] 2025-12-04T09:31:02.4023127Z Filesystem Size Used Avail Use% Mounted on 2025-12-04T09:31:02.4023613Z overlay 16T 766G 15T 5% / 2025-12-04T09:31:02.4024049Z tmpfs 68M 0 68M 0% /dev 2025-12-04T09:31:02.4024485Z /dev/md0 16T 766G 15T 5% /run 2025-12-04T09:31:02.4025092Z shm 68M 4.1k 68M 1% /dev/shm 2025-12-04T09:31:02.4025676Z amdprj2-k8s_2 5.5T 120G 5.4T 3% /home/runner/pytorch-data 2025-12-04T09:31:02.4026305Z tmpfs 3.3T 13k 3.3T 1% /run/secrets/kubernetes.io/serviceaccount 2025-12-04T09:31:02.4026855Z tmpfs 1.7T 0 1.7T 0% /proc/acpi 2025-12-04T09:31:02.4027304Z tmpfs 1.7T 0 1.7T 0% /proc/scsi 2025-12-04T09:31:02.4027752Z tmpfs 1.7T 0 1.7T 0% /sys/firmware 2025-12-04T09:31:02.4028331Z tmpfs 1.7T 0 1.7T 0% /sys/devices/virtual/powercap 2025-12-04T09:31:02.4074546Z Prepare all required actions 2025-12-04T09:31:02.4075087Z Getting action download info 2025-12-04T09:31:02.8426413Z ##[group]Run ./.github/actions/download-td-artifacts 2025-12-04T09:31:02.8426883Z with: 2025-12-04T09:31:02.8427170Z env: 2025-12-04T09:31:02.8427473Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:02.8427912Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:02.8428577Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:02.8429117Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:02.8430387Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:02.8431635Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:02.8432021Z AWS_REGION: us-east-1 2025-12-04T09:31:02.8432519Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:02.8433055Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:02.8439992Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:02.8440336Z ##[endgroup] 2025-12-04T09:31:02.8477950Z ##[group]Run seemethere/download-artifact-s3@v4 2025-12-04T09:31:02.8478467Z with: 2025-12-04T09:31:02.8478769Z name: td_results 2025-12-04T09:31:02.8479104Z s3-bucket: gha-artifacts 2025-12-04T09:31:02.8479458Z region: us-east-1 2025-12-04T09:31:02.8479766Z env: 2025-12-04T09:31:02.8480065Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:02.8480509Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:02.8481097Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:02.8481641Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:02.8482909Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:02.8484320Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:02.8484701Z AWS_REGION: us-east-1 2025-12-04T09:31:02.8485125Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:02.8485623Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:02.8493022Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:02.8493365Z ##[endgroup] 2025-12-04T09:31:03.0854928Z (node:17260) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2025-12-04T09:31:03.0855558Z 2025-12-04T09:31:03.0855825Z Please migrate your code to use AWS SDK for JavaScript (v3). 2025-12-04T09:31:03.0856501Z For more information, check the migration guide at https://a.co/7PzMCcy 2025-12-04T09:31:03.0857188Z (Use `node --trace-warnings ...` to show where the warning was created) 2025-12-04T09:31:03.3791676Z Found 1 objects with prefix pytorch/pytorch/19922849170/td_results/ 2025-12-04T09:31:03.3792433Z Starting download (1/1): /home/runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:31:03.7895541Z Finished download (1/1): /home/runner/_work/pytorch/pytorch/td_results.json 2025-12-04T09:31:03.7906518Z Artifact download has finished successfully 2025-12-04T09:31:03.8180246Z ##[group]Run mkdir -p .additional_ci_files 2025-12-04T09:31:03.8180752Z mkdir -p .additional_ci_files 2025-12-04T09:31:03.8181291Z mv td_results.json .additional_ci_files/td_results.json || true 2025-12-04T09:31:03.8191888Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:03.8192366Z env: 2025-12-04T09:31:03.8192669Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:03.8193103Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:03.8193673Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:03.8194211Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:03.8195735Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:03.8196993Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:03.8197370Z AWS_REGION: us-east-1 2025-12-04T09:31:03.8197900Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:03.8198474Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:03.8205145Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:03.8205485Z ##[endgroup] 2025-12-04T09:31:03.8318696Z ##[group]Run .github/scripts/parse_ref.py 2025-12-04T09:31:03.8319205Z .github/scripts/parse_ref.py 2025-12-04T09:31:03.8333325Z shell: /usr/bin/bash -e {0} 2025-12-04T09:31:03.8333685Z env: 2025-12-04T09:31:03.8333988Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:03.8334438Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:03.8335026Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:03.8335567Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:03.8337284Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:03.8338650Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:03.8339031Z AWS_REGION: us-east-1 2025-12-04T09:31:03.8339504Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:03.8340050Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:03.8346824Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:03.8347163Z ##[endgroup] 2025-12-04T09:31:03.8450580Z Setting output branch=main 2025-12-04T09:31:03.8606042Z Prepare all required actions 2025-12-04T09:31:03.8606655Z Getting action download info 2025-12-04T09:31:04.0987932Z ##[group]Run ./.github/actions/filter-test-configs 2025-12-04T09:31:04.0988479Z with: 2025-12-04T09:31:04.0989026Z github-token: *** 2025-12-04T09:31:04.0999167Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:31:04.1010452Z job-name: linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:04.1011136Z env: 2025-12-04T09:31:04.1011444Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:04.1011895Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:04.1012487Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:04.1013037Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:04.1014314Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:04.1015540Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:04.1015922Z AWS_REGION: us-east-1 2025-12-04T09:31:04.1016519Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:04.1017011Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:04.1023906Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:04.1024244Z ##[endgroup] 2025-12-04T09:31:04.1071176Z ##[group]Run nick-fields/retry@v3.0.0 2025-12-04T09:31:04.1071583Z with: 2025-12-04T09:31:04.1071857Z shell: bash 2025-12-04T09:31:04.1072159Z timeout_minutes: 10 2025-12-04T09:31:04.1072482Z max_attempts: 5 2025-12-04T09:31:04.1072946Z retry_wait_seconds: 30 2025-12-04T09:31:04.1073905Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:31:04.1074898Z polling_interval_seconds: 1 2025-12-04T09:31:04.1075266Z warning_on_retry: true 2025-12-04T09:31:04.1075611Z continue_on_error: false 2025-12-04T09:31:04.1075949Z env: 2025-12-04T09:31:04.1076239Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:04.1076684Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:04.1077265Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:04.1077808Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:04.1079130Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:04.1080375Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:04.1080748Z AWS_REGION: us-east-1 2025-12-04T09:31:04.1081164Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:04.1081647Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:04.1088382Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:04.1088870Z GITHUB_TOKEN: *** 2025-12-04T09:31:04.1089177Z ##[endgroup] 2025-12-04T09:31:04.1518653Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.2 2025-12-04T09:31:04.2970681Z Defaulting to user installation because normal site-packages is not writeable 2025-12-04T09:31:04.3935729Z Collecting requests==2.27.1 2025-12-04T09:31:04.4284037Z Downloading requests-2.27.1-py2.py3-none-any.whl (63 kB) 2025-12-04T09:31:04.4402812Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 63.1/63.1 KB 5.0 MB/s eta 0:00:00 2025-12-04T09:31:04.4849544Z Collecting pyyaml==6.0.2 2025-12-04T09:31:04.4902026Z Downloading PyYAML-6.0.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (751 kB) 2025-12-04T09:31:04.5398310Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 751.2/751.2 KB 16.3 MB/s eta 0:00:00 2025-12-04T09:31:04.5651362Z Collecting certifi>=2017.4.17 2025-12-04T09:31:04.5702823Z Downloading certifi-2025.11.12-py3-none-any.whl (159 kB) 2025-12-04T09:31:04.5764035Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 159.4/159.4 KB 29.9 MB/s eta 0:00:00 2025-12-04T09:31:04.5901116Z Collecting idna<4,>=2.5 2025-12-04T09:31:04.5952703Z Downloading idna-3.11-py3-none-any.whl (71 kB) 2025-12-04T09:31:04.5980931Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 71.0/71.0 KB 32.2 MB/s eta 0:00:00 2025-12-04T09:31:04.6255252Z Collecting urllib3<1.27,>=1.21.1 2025-12-04T09:31:04.6309159Z Downloading urllib3-1.26.20-py2.py3-none-any.whl (144 kB) 2025-12-04T09:31:04.6373760Z ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ 144.2/144.2 KB 26.6 MB/s eta 0:00:00 2025-12-04T09:31:04.7272062Z Collecting charset-normalizer~=2.0.0 2025-12-04T09:31:04.7324141Z Downloading charset_normalizer-2.0.12-py3-none-any.whl (39 kB) 2025-12-04T09:31:04.7842263Z Installing collected packages: urllib3, pyyaml, idna, charset-normalizer, certifi, requests 2025-12-04T09:31:04.8779069Z WARNING: The script normalizer is installed in '/home/runner/.local/bin' which is not on PATH. 2025-12-04T09:31:04.8779407Z Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location. 2025-12-04T09:31:04.8946486Z Successfully installed certifi-2025.11.12 charset-normalizer-2.0.12 idna-3.11 pyyaml-6.0.2 requests-2.27.1 urllib3-1.26.20 2025-12-04T09:31:05.1512298Z Command completed after 1 attempt(s). 2025-12-04T09:31:05.1589828Z ##[group]Run set -x 2025-12-04T09:31:05.1590196Z set -x 2025-12-04T09:31:05.1590487Z  2025-12-04T09:31:05.1590972Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:31:05.1591577Z # in runner workspace 2025-12-04T09:31:05.1592078Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2025-12-04T09:31:05.1602375Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:05.1602853Z env: 2025-12-04T09:31:05.1603158Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:05.1603599Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:05.1604167Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:05.1604722Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:05.1606028Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:05.1607271Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:05.1607649Z AWS_REGION: us-east-1 2025-12-04T09:31:05.1608163Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:05.1608663Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:05.1615482Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:05.1615822Z ##[endgroup] 2025-12-04T09:31:05.1654271Z + python3 /home/runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2025-12-04T09:31:05.1738975Z Setting output branch=main 2025-12-04T09:31:05.1791239Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:31:05.1791814Z echo "Workflow: ${GITHUB_WORKFLOW}" 2025-12-04T09:31:05.1792277Z echo "Job name: ${JOB_NAME}" 2025-12-04T09:31:05.1792663Z  2025-12-04T09:31:05.1793145Z # Use relative path here as this could be checked out anywhere, not necessarily 2025-12-04T09:31:05.1793760Z # in runner workspace 2025-12-04T09:31:05.1794310Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2025-12-04T09:31:05.1794911Z  --workflow "${GITHUB_WORKFLOW}" \ 2025-12-04T09:31:05.1795341Z  --job-name "${JOB_NAME}" \ 2025-12-04T09:31:05.1805982Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" \ 2025-12-04T09:31:05.1816459Z  --selected-test-configs "" \ 2025-12-04T09:31:05.1816904Z  --pr-number "${PR_NUMBER}" \ 2025-12-04T09:31:05.1817314Z  --tag "${TAG}" \ 2025-12-04T09:31:05.1817705Z  --event-name "${EVENT_NAME}" \ 2025-12-04T09:31:05.1818179Z  --schedule "${SCHEDULE}" \ 2025-12-04T09:31:05.1818589Z  --branch "${HEAD_BRANCH}" 2025-12-04T09:31:05.1828763Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:05.1829239Z env: 2025-12-04T09:31:05.1829541Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:05.1829981Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:05.1830557Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:05.1831096Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:05.1832367Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:05.1833602Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:05.1833981Z AWS_REGION: us-east-1 2025-12-04T09:31:05.1834478Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:05.1834985Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:05.1841818Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:05.1842369Z GITHUB_TOKEN: *** 2025-12-04T09:31:05.1843013Z JOB_NAME: linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:05.1843682Z PR_NUMBER: 2025-12-04T09:31:05.1843979Z TAG: 2025-12-04T09:31:05.1844262Z EVENT_NAME: schedule 2025-12-04T09:31:05.1844594Z SCHEDULE: 29 8 * * * 2025-12-04T09:31:05.1844909Z HEAD_BRANCH: main 2025-12-04T09:31:05.1845221Z ##[endgroup] 2025-12-04T09:31:05.1875457Z Workflow: trunk-rocm-mi300 2025-12-04T09:31:05.1876180Z Job name: linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:05.7367930Z INFO:root:Issue https://github.com/pytorch/pytorch/issues/167616 created by jithunnair-amd has unstable all the test jobs for trunk-rocm-mi300 / linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:06.1478908Z Setting output keep-going=True 2025-12-04T09:31:06.1479402Z Setting output ci-verbose-test-logs=False 2025-12-04T09:31:06.1479871Z Setting output ci-test-showlocals=False 2025-12-04T09:31:06.1480307Z Setting output ci-no-test-timeout=False 2025-12-04T09:31:06.1480719Z Setting output ci-no-td=False 2025-12-04T09:31:06.1481121Z Setting output ci-td-distributed=False 2025-12-04T09:31:06.1481536Z Setting output is-unstable=True 2025-12-04T09:31:06.1481937Z Setting output reenabled-issues= 2025-12-04T09:31:06.1505283Z Setting output test-matrix={"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]} 2025-12-04T09:31:06.1528467Z Setting output is-test-matrix-empty=False 2025-12-04T09:31:06.1660852Z ##[group]Run echo "Filtered matrix:" 2025-12-04T09:31:06.1661402Z echo "Filtered matrix:" 2025-12-04T09:31:06.1685023Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 1, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 2, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 3, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 4, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 5, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "default", "shard": 6, "num_shards": 6, "runner": "linux.rocm.gpu.gfx942.1.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 1, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 2, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "mem_leak_check": "mem_leak_check", "unstable": "unstable", "rerun_disabled_tests": "rerun_disabled_tests"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable", "mem_leak_check": "mem_leak_check"}, {"config": "distributed", "shard": 3, "num_shards": 3, "runner": "linux.rocm.gpu.gfx942.4.b", "rerun_disabled_tests": "rerun_disabled_tests", "unstable": "unstable"}]}" 2025-12-04T09:31:06.1709054Z  2025-12-04T09:31:06.1709342Z echo 2025-12-04T09:31:06.1709719Z echo "Is the current job unstable? True" 2025-12-04T09:31:06.1710146Z  2025-12-04T09:31:06.1710424Z echo 2025-12-04T09:31:06.1710763Z echo "Is keep-going label set? True" 2025-12-04T09:31:06.1711177Z  2025-12-04T09:31:06.1711449Z echo 2025-12-04T09:31:06.1711763Z echo "Reenabled issues? " 2025-12-04T09:31:06.1722419Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:06.1722934Z env: 2025-12-04T09:31:06.1723239Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:06.1723687Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:06.1724273Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:06.1724826Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:06.1726142Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:06.1727417Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:06.1727810Z AWS_REGION: us-east-1 2025-12-04T09:31:06.1728383Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:06.1728900Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:06.1735901Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:06.1736255Z ##[endgroup] 2025-12-04T09:31:06.1774639Z Filtered matrix: 2025-12-04T09:31:06.1802478Z {include: [{config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 1, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 2, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 3, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 4, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 5, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: default, shard: 6, num_shards: 6, runner: linux.rocm.gpu.gfx942.1.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 1, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 2, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, mem_leak_check: mem_leak_check, unstable: unstable, rerun_disabled_tests: rerun_disabled_tests}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable, mem_leak_check: mem_leak_check}, {config: distributed, shard: 3, num_shards: 3, runner: linux.rocm.gpu.gfx942.4.b, rerun_disabled_tests: rerun_disabled_tests, unstable: unstable}]} 2025-12-04T09:31:06.1825445Z 2025-12-04T09:31:06.1825707Z Is the current job unstable? True 2025-12-04T09:31:06.1825976Z 2025-12-04T09:31:06.1826129Z Is keep-going label set? True 2025-12-04T09:31:06.1826386Z 2025-12-04T09:31:06.1826515Z Reenabled issues? 2025-12-04T09:31:06.1878928Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:31:06.1879612Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2025-12-04T09:31:06.1888325Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:06.1888973Z env: 2025-12-04T09:31:06.1889280Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:06.1889723Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:06.1890299Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:06.1890844Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:06.1892107Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:06.1893357Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:06.1893735Z AWS_REGION: us-east-1 2025-12-04T09:31:06.1894229Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:06.1894797Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:06.1901662Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:06.1902004Z JOB_TIMEOUT: 600 2025-12-04T09:31:06.1902317Z ##[endgroup] 2025-12-04T09:31:06.1977691Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:31:06.1978780Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:31:06.1979357Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2025-12-04T09:31:06.1990140Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T09:31:06.1990615Z env: 2025-12-04T09:31:06.1990916Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:06.1991355Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:06.1991927Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:06.1992467Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:06.1993776Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:06.1995080Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:06.1995451Z AWS_REGION: us-east-1 2025-12-04T09:31:06.1995926Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:06.1996418Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:06.2003257Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:06.2003600Z ##[endgroup] 2025-12-04T09:31:06.2165088Z ##[group]Run set -x 2025-12-04T09:31:06.2165515Z set -x 2025-12-04T09:31:06.2165808Z  2025-12-04T09:31:06.2166153Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2025-12-04T09:31:06.2166660Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2025-12-04T09:31:06.2167160Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2025-12-04T09:31:06.2167622Z  TEST_COMMAND=.ci/caffe2/test.sh 2025-12-04T09:31:06.2168012Z else 2025-12-04T09:31:06.2168420Z  TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:31:06.2168803Z fi 2025-12-04T09:31:06.2169078Z  2025-12-04T09:31:06.2169503Z # detached container should get cleaned up by teardown_ec2_linux 2025-12-04T09:31:06.2170180Z # TODO: Stop building test binaries as part of the build phase 2025-12-04T09:31:06.2170756Z # Used for GPU_FLAG since that doesn't play nice 2025-12-04T09:31:06.2171262Z # shellcheck disable=SC2086,SC2090 2025-12-04T09:31:06.2171692Z container_name=$(docker run \ 2025-12-04T09:31:06.2172104Z  ${GPU_FLAG:-} \ 2025-12-04T09:31:06.2172475Z  -e BUILD_ENVIRONMENT \ 2025-12-04T09:31:06.2172861Z  -e PR_NUMBER \ 2025-12-04T09:31:06.2173217Z  -e GITHUB_ACTIONS \ 2025-12-04T09:31:06.2173589Z  -e GITHUB_REPOSITORY \ 2025-12-04T09:31:06.2173972Z  -e GITHUB_WORKFLOW \ 2025-12-04T09:31:06.2174813Z  -e GITHUB_JOB \ 2025-12-04T09:31:06.2175167Z  -e GITHUB_RUN_ID \ 2025-12-04T09:31:06.2175530Z  -e GITHUB_RUN_NUMBER \ 2025-12-04T09:31:06.2175915Z  -e GITHUB_RUN_ATTEMPT \ 2025-12-04T09:31:06.2176492Z  -e JOB_ID \ 2025-12-04T09:31:06.2176829Z  -e JOB_NAME \ 2025-12-04T09:31:06.2177175Z  -e BASE_SHA \ 2025-12-04T09:31:06.2177507Z  -e BRANCH \ 2025-12-04T09:31:06.2177825Z  -e SHA1 \ 2025-12-04T09:31:06.2178209Z  -e AWS_DEFAULT_REGION \ 2025-12-04T09:31:06.2178594Z  -e IN_WHEEL_TEST \ 2025-12-04T09:31:06.2178956Z  -e SHARD_NUMBER \ 2025-12-04T09:31:06.2179319Z  -e TEST_CONFIG \ 2025-12-04T09:31:06.2179678Z  -e NUM_TEST_SHARDS \ 2025-12-04T09:31:06.2180053Z  -e REENABLED_ISSUES \ 2025-12-04T09:31:06.2180443Z  -e CONTINUE_THROUGH_ERROR \ 2025-12-04T09:31:06.2180843Z  -e VERBOSE_TEST_LOGS \ 2025-12-04T09:31:06.2181222Z  -e TEST_SHOWLOCALS \ 2025-12-04T09:31:06.2181591Z  -e NO_TEST_TIMEOUT \ 2025-12-04T09:31:06.2181947Z  -e NO_TD \ 2025-12-04T09:31:06.2182324Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2025-12-04T09:31:06.2182795Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2025-12-04T09:31:06.2183262Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2025-12-04T09:31:06.2183701Z  -e TESTS_TO_INCLUDE \ 2025-12-04T09:31:06.2184089Z  -e HUGGING_FACE_HUB_TOKEN \ 2025-12-04T09:31:06.2184488Z  -e DASHBOARD_TAG \ 2025-12-04T09:31:06.2184963Z  --env-file="${RUNNER_TEMP}/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:31:06.2185486Z  --ulimit stack=10485760:83886080 \ 2025-12-04T09:31:06.2185896Z  --ulimit core=0 \ 2025-12-04T09:31:06.2186334Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2025-12-04T09:31:06.2186846Z  --security-opt seccomp=unconfined \ 2025-12-04T09:31:06.2187289Z  --cap-add=SYS_PTRACE \ 2025-12-04T09:31:06.2187678Z  --shm-size="8g" \ 2025-12-04T09:31:06.2188032Z  --tty \ 2025-12-04T09:31:06.2188447Z  --detach \ 2025-12-04T09:31:06.2188812Z  --name="${container_name}" \ 2025-12-04T09:31:06.2189226Z  --user jenkins \ 2025-12-04T09:31:06.2189683Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2025-12-04T09:31:06.2190191Z  -w /var/lib/jenkins/workspace \ 2025-12-04T09:31:06.2190785Z  "${DOCKER_IMAGE}" 2025-12-04T09:31:06.2191131Z ) 2025-12-04T09:31:06.2191469Z # save container name for later step 2025-12-04T09:31:06.2191994Z echo "CONTAINER_NAME=${container_name}" >> "$GITHUB_ENV" 2025-12-04T09:31:06.2192882Z # jenkins user does not have write permission to mounted workspace; work-around by copying within container to jenkins home 2025-12-04T09:31:06.2194033Z docker exec -t "${container_name}" sh -c "cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && ${TEST_COMMAND}" 2025-12-04T09:31:06.2203465Z shell: /usr/bin/bash -e {0} 2025-12-04T09:31:06.2203828Z env: 2025-12-04T09:31:06.2204132Z GIT_DEFAULT_BRANCH: main 2025-12-04T09:31:06.2204590Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T09:31:06.2205172Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T09:31:06.2205717Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T09:31:06.2206991Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T09:31:06.2208330Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T09:31:06.2208712Z AWS_REGION: us-east-1 2025-12-04T09:31:06.2209163Z AWS_ACCESS_KEY_ID: *** 2025-12-04T09:31:06.2209664Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T09:31:06.2216388Z AWS_SESSION_TOKEN: *** 2025-12-04T09:31:06.2216791Z BUILD_ENVIRONMENT: linux-jammy-rocm-py3.10 2025-12-04T09:31:06.2217216Z PR_NUMBER: 2025-12-04T09:31:06.2217557Z GITHUB_REPOSITORY: pytorch/pytorch 2025-12-04T09:31:06.2217985Z GITHUB_WORKFLOW: trunk-rocm-mi300 2025-12-04T09:31:06.2218557Z GITHUB_JOB: test 2025-12-04T09:31:06.2218888Z GITHUB_RUN_ID: 19922849170 2025-12-04T09:31:06.2219253Z GITHUB_RUN_NUMBER: 689 2025-12-04T09:31:06.2219599Z GITHUB_RUN_ATTEMPT: 1 2025-12-04T09:31:06.2219940Z JOB_ID: 57116213162 2025-12-04T09:31:06.2220580Z JOB_NAME: linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:06.2221257Z BRANCH: main 2025-12-04T09:31:06.2221623Z SHA1: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:06.2222131Z BASE_SHA: ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:06.2222572Z TEST_CONFIG: default 2025-12-04T09:31:06.2222895Z SHARD_NUMBER: 4 2025-12-04T09:31:06.2223205Z NUM_TEST_SHARDS: 6 2025-12-04T09:31:06.2223532Z REENABLED_ISSUES: 2025-12-04T09:31:06.2223871Z CONTINUE_THROUGH_ERROR: True 2025-12-04T09:31:06.2224253Z VERBOSE_TEST_LOGS: False 2025-12-04T09:31:06.2224610Z TEST_SHOWLOCALS: False 2025-12-04T09:31:06.2224957Z NO_TEST_TIMEOUT: False 2025-12-04T09:31:06.2225285Z NO_TD: False 2025-12-04T09:31:06.2226147Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:31:06.2227117Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 1 2025-12-04T09:31:06.2227546Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2025-12-04T09:31:06.2227945Z TESTS_TO_INCLUDE: 2025-12-04T09:31:06.2228329Z DASHBOARD_TAG: 2025-12-04T09:31:06.2228796Z HUGGING_FACE_HUB_TOKEN: *** 2025-12-04T09:31:06.2229163Z ##[endgroup] 2025-12-04T09:31:06.2261002Z + [[ default == \m\u\l\t\i\g\p\u ]] 2025-12-04T09:31:06.2261542Z + [[ linux-jammy-rocm-py3.10 == *onnx* ]] 2025-12-04T09:31:06.2261996Z + TEST_COMMAND=.ci/pytorch/test.sh 2025-12-04T09:31:06.2271808Z +++ nproc --ignore=2 2025-12-04T09:31:06.2283461Z ++ docker run --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BASE_SHA -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e MAX_JOBS=126 -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e TESTS_TO_INCLUDE -e HUGGING_FACE_HUB_TOKEN -e DASHBOARD_TAG --env-file=/home/runner/_work/_temp/github_env_19922849170 --ulimit stack=10485760:83886080 --ulimit core=0 --env-file=/tmp/github_env_19922849170 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --shm-size=8g --tty --detach --name= --user jenkins -v /home/runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/ci-image:pytorch-linux-jammy-rocm-n-py3-f0cd68561080d537ef3d3d6f81b25a6416ad600a 2025-12-04T09:31:06.4864069Z + container_name=2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T09:31:06.4866089Z + echo CONTAINER_NAME=2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T09:31:06.4867518Z + docker exec -t 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 sh -c 'cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && .ci/pytorch/test.sh' 2025-12-04T09:31:09.6361254Z Processing ./dist/torch-2.10.0a0+gitffd9b0f-cp310-cp310-linux_x86_64.whl 2025-12-04T09:31:10.1919066Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.18.0) 2025-12-04T09:31:10.1920792Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (4.12.2) 2025-12-04T09:31:10.1922508Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (1.13.3) 2025-12-04T09:31:10.1925412Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2.8.8) 2025-12-04T09:31:10.1926786Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (3.1.6) 2025-12-04T09:31:10.1928028Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.10.0a0+gitffd9b0f) (2025.10.0) 2025-12-04T09:31:10.2094739Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch==2.10.0a0+gitffd9b0f) (1.3.0) 2025-12-04T09:31:10.2120042Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.10.0a0+gitffd9b0f) (3.0.3) 2025-12-04T09:31:10.4149175Z Installing collected packages: torch 2025-12-04T09:31:15.9935005Z Successfully installed torch-2.10.0a0+gitffd9b0f 2025-12-04T09:31:16.0316851Z + export TERM=vt100 2025-12-04T09:31:16.0317211Z + TERM=vt100 2025-12-04T09:31:16.0321549Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:31:16.0334940Z + source .ci/pytorch/common.sh 2025-12-04T09:31:16.0340745Z +++ dirname .ci/pytorch/common.sh 2025-12-04T09:31:16.0357812Z ++ source .ci/pytorch/common_utils.sh 2025-12-04T09:31:16.0360251Z +++ declare -f -t trap_add 2025-12-04T09:31:16.0366211Z ++ set -ex -o pipefail 2025-12-04T09:31:16.0367952Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:31:16.0368623Z ++ unset HIP_PLATFORM 2025-12-04T09:31:16.0369016Z ++ export PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:31:16.0369436Z ++ PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:31:16.0369812Z ++ BUILD_TEST_LIBTORCH=0 2025-12-04T09:31:16.0372390Z ++ dirname .ci/pytorch/test.sh 2025-12-04T09:31:16.0386118Z + source .ci/pytorch/common-build.sh 2025-12-04T09:31:16.0388895Z ++ [[ linux-jammy-rocm-py3.10 != *win-* ]] 2025-12-04T09:31:16.0399588Z ++++ dirname .ci/pytorch/common-build.sh 2025-12-04T09:31:16.0414938Z +++ cd .ci/pytorch 2025-12-04T09:31:16.0415677Z +++ pwd -P 2025-12-04T09:31:16.0419743Z ++ script_dir=/var/lib/jenkins/pytorch/.ci/pytorch 2025-12-04T09:31:16.0420338Z ++ [[ linux-jammy-rocm-py3.10 == *-pch* ]] 2025-12-04T09:31:16.0420750Z ++ which sccache 2025-12-04T09:31:16.0439372Z ++ [[ -z '' ]] 2025-12-04T09:31:16.0439703Z ++ unset SCCACHE_BUCKET 2025-12-04T09:31:16.0440035Z ++ unset SCCACHE_REGION 2025-12-04T09:31:16.0440378Z ++ sccache --stop-server 2025-12-04T09:31:16.0472725Z ++ true 2025-12-04T09:31:16.0473059Z ++ rm -f /var/lib/jenkins/sccache_error.log 2025-12-04T09:31:16.0490014Z ++ trap_add sccache_epilogue EXIT 2025-12-04T09:31:16.0490413Z ++ trap_add_cmd=sccache_epilogue 2025-12-04T09:31:16.0490774Z ++ shift 2025-12-04T09:31:16.0491074Z ++ for trap_add_name in "$@" 2025-12-04T09:31:16.0499330Z ++++ trap -p EXIT 2025-12-04T09:31:16.0503034Z +++ eval 'extract_trap_cmd ' 2025-12-04T09:31:16.0503401Z ++++ extract_trap_cmd 2025-12-04T09:31:16.0503729Z ++++ printf '%s\n' '' 2025-12-04T09:31:16.0504079Z +++ printf '%s\n' sccache_epilogue 2025-12-04T09:31:16.0506599Z ++ trap -- ' 2025-12-04T09:31:16.0506946Z sccache_epilogue' EXIT 2025-12-04T09:31:16.0507268Z ++ [[ -n '' ]] 2025-12-04T09:31:16.0507600Z ++ [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:31:16.0508168Z ++ SCCACHE_ERROR_LOG=/var/lib/jenkins/sccache_error.log 2025-12-04T09:31:16.0508627Z ++ SCCACHE_IDLE_TIMEOUT=0 2025-12-04T09:31:16.0508981Z ++ sccache --start-server 2025-12-04T09:31:16.0536706Z sccache: Starting the server... 2025-12-04T09:31:16.0757987Z sccache: Listening on address 127.0.0.1:4226 2025-12-04T09:31:16.0768042Z ++ sccache --zero-stats 2025-12-04T09:31:16.0788940Z Statistics zeroed. 2025-12-04T09:31:16.0792011Z ++ which ccache 2025-12-04T09:31:16.0800160Z + [[ linux-jammy-rocm-py3.10 != *rocm* ]] 2025-12-04T09:31:16.0800316Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:31:16.0800653Z + echo 'Environment variables:' 2025-12-04T09:31:16.0800774Z Environment variables: 2025-12-04T09:31:16.0800878Z + env 2025-12-04T09:31:16.0809069Z GITHUB_WORKSPACE=/home/runner/_work/pytorch/pytorch 2025-12-04T09:31:16.0809241Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:31:16.0809396Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:31:16.0809573Z HOSTNAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp 2025-12-04T09:31:16.0809823Z GITHUB_PATH=/home/runner/_work/_temp/_runner_file_commands/add_path_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0810034Z GITHUB_ACTION=__run_2 2025-12-04T09:31:16.0810150Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:31:16.0810276Z GITHUB_RUN_NUMBER=689 2025-12-04T09:31:16.0810433Z TEST_CONFIG=default 2025-12-04T09:31:16.0810573Z RUNNER_NAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp 2025-12-04T09:31:16.0810731Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:31:16.0810863Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:31:16.0811015Z RUNNER_ARTIFACT_DIR=/home/runner/_work/_temp/artifacts 2025-12-04T09:31:16.0811170Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:31:16.0811299Z GITHUB_REF_TYPE=branch 2025-12-04T09:31:16.0811425Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0811730Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:31:16.0815492Z *** 2025-12-04T09:31:16.0815595Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:31:16.0815719Z GITHUB_ACTIONS=true 2025-12-04T09:31:16.0815842Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0815998Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0816217Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk-rocm-mi300.yml@refs/heads/main 2025-12-04T09:31:16.0816415Z UCC_HOME=/usr 2025-12-04T09:31:16.0816522Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:31:16.0816645Z VERBOSE_TEST_LOGS=False 2025-12-04T09:31:16.0816759Z GITHUB_REF=refs/heads/main 2025-12-04T09:31:16.0816871Z RUNNER_OS=Linux 2025-12-04T09:31:16.0816973Z SHARD_NUMBER=4 2025-12-04T09:31:16.0817075Z GITHUB_REF_PROTECTED=true 2025-12-04T09:31:16.0817194Z RUNNER_MANUALLY_TRAP_SIG=1 2025-12-04T09:31:16.0817308Z HOME=/var/lib/jenkins 2025-12-04T09:31:16.0817436Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:31:16.0817673Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:31:16.0817814Z RUNNER_DOCS_DIR=/home/runner/_work/_temp/docs 2025-12-04T09:31:16.0817949Z LANG=C.UTF-8 2025-12-04T09:31:16.0818092Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:31:16.0818235Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:31:16.0818380Z RUNNER_TRACKING_ID=github_91304fcc-5ca8-4114-964f-2b85ba81bbf4 2025-12-04T09:31:16.0818530Z RUNNER_ARCH=X64 2025-12-04T09:31:16.0818636Z RUNNER_TEMP=/home/runner/_work/_temp 2025-12-04T09:31:16.0818759Z NUM_TEST_SHARDS=6 2025-12-04T09:31:16.0818853Z UCX_HOME=/usr 2025-12-04T09:31:16.0819041Z GITHUB_STATE=/home/runner/_work/_temp/_runner_file_commands/save_state_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0819347Z JOB_NAME=linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:16.0819559Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:31:16.0819760Z GITHUB_ENV=/home/runner/_work/_temp/_runner_file_commands/set_env_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0820005Z GITHUB_EVENT_PATH=/home/runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:31:16.0820171Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:31:16.0820333Z GITHUB_ACTIONS_RUNNER_EXTRA_USER_AGENT=actions-runner-controller/0.12.1 2025-12-04T09:31:16.0820498Z DASHBOARD_TAG= 2025-12-04T09:31:16.0820599Z GITHUB_RUN_ID=19922849170 2025-12-04T09:31:16.0820811Z GITHUB_STEP_SUMMARY=/home/runner/_work/_temp/_runner_file_commands/step_summary_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0821039Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:31:16.0821155Z PR_NUMBER= 2025-12-04T09:31:16.0821253Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:31:16.0821364Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:31:16.0821551Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:31:16.0821689Z TERM=vt100 2025-12-04T09:31:16.0821785Z INSTALLED_VISION=yes 2025-12-04T09:31:16.0821888Z BRANCH=main 2025-12-04T09:31:16.0821989Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:31:16.0822112Z TESTS_TO_INCLUDE= 2025-12-04T09:31:16.0822279Z GITHUB_ACTION_PATH=/home/runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:31:16.0822471Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:31:16.0822613Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:31:16.0822767Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:31:16.0822920Z REENABLED_ISSUES= 2025-12-04T09:31:16.0823019Z SHLVL=1 2025-12-04T09:31:16.0823112Z MAX_JOBS=126 2025-12-04T09:31:16.0823247Z RUNNER_TEST_RESULTS_DIR=/home/runner/_work/_temp/test-results 2025-12-04T09:31:16.0823403Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:31:16.0823524Z RUNNER_TOOL_CACHE=/home/runner/_work/_tool 2025-12-04T09:31:16.0823684Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0823842Z GITHUB_REF_NAME=main 2025-12-04T09:31:16.0823947Z ROCM_PATH=/opt/rocm 2025-12-04T09:31:16.0824050Z GITHUB_JOB=test 2025-12-04T09:31:16.0824146Z NO_TEST_TIMEOUT=False 2025-12-04T09:31:16.0824256Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:31:16.0824371Z LC_ALL=C.UTF-8 2025-12-04T09:31:16.0824470Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:31:16.0824593Z RUNNER_WORKSPACE=/home/runner/_work/pytorch 2025-12-04T09:31:16.0824724Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:31:16.0824836Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:31:16.0825197Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:31:16.0825547Z GITHUB_BASE_REF= 2025-12-04T09:31:16.0825646Z CI=true 2025-12-04T09:31:16.0825745Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:31:16.0825861Z JOB_ID=57116213162 2025-12-04T09:31:16.0825962Z GITHUB_HEAD_REF= 2025-12-04T09:31:16.0826060Z GITHUB_ACTION_REF= 2025-12-04T09:31:16.0826165Z TEST_SHOWLOCALS=False 2025-12-04T09:31:16.0826280Z GITHUB_WORKFLOW=trunk-rocm-mi300 2025-12-04T09:31:16.0826405Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:31:16.0826647Z GITHUB_OUTPUT=/home/runner/_work/_temp/_runner_file_commands/set_output_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0826850Z NO_TD=False 2025-12-04T09:31:16.0826940Z OLDPWD=/var/lib/jenkins 2025-12-04T09:31:16.0827040Z _=/usr/bin/env 2025-12-04T09:31:16.0827174Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2025-12-04T09:31:16.0874799Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2025-12-04T09:31:16.0875039Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2025-12-04T09:31:16.0875256Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2025-12-04T09:31:16.0875474Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2025-12-04T09:31:16.0875648Z + BUILD_DIR=build 2025-12-04T09:31:16.0875752Z + BUILD_RENAMED_DIR=build_renamed 2025-12-04T09:31:16.0875875Z + BUILD_BIN_DIR=build/bin 2025-12-04T09:31:16.0875983Z + SHARD_NUMBER=4 2025-12-04T09:31:16.0876090Z + NUM_TEST_SHARDS=6 2025-12-04T09:31:16.0876199Z + export TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:31:16.0876328Z + TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:31:16.0876441Z + export VALGRIND=ON 2025-12-04T09:31:16.0876546Z + VALGRIND=ON 2025-12-04T09:31:16.0876666Z + [[ linux-jammy-rocm-py3.10 == *clang9* ]] 2025-12-04T09:31:16.0876805Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:31:16.0876931Z + detect_cuda_arch 2025-12-04T09:31:16.0877041Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:31:16.0877179Z + [[ linux-jammy-rocm-py3.10 == *s390x* ]] 2025-12-04T09:31:16.0877304Z + [[ 0 == \1 ]] 2025-12-04T09:31:16.0877401Z + [[ True == \1 ]] 2025-12-04T09:31:16.0877512Z + [[ linux-jammy-rocm-py3.10 != *bazel* ]] 2025-12-04T09:31:16.0879974Z ++ realpath build/custom_test_artifacts 2025-12-04T09:31:16.0893621Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/pytorch/build/custom_test_artifacts 2025-12-04T09:31:16.0894238Z + [[ -n '' ]] 2025-12-04T09:31:16.0894573Z + echo 'Environment variables' 2025-12-04T09:31:16.0894966Z Environment variables 2025-12-04T09:31:16.0895280Z + env 2025-12-04T09:31:16.0901881Z GITHUB_WORKSPACE=/home/runner/_work/pytorch/pytorch 2025-12-04T09:31:16.0902352Z CONTINUE_THROUGH_ERROR=True 2025-12-04T09:31:16.0902848Z BUILD_ENVIRONMENT=linux-jammy-rocm-py3.10 2025-12-04T09:31:16.0903371Z HOSTNAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp 2025-12-04T09:31:16.0904152Z GITHUB_PATH=/home/runner/_work/_temp/_runner_file_commands/add_path_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0904824Z GITHUB_ACTION=__run_2 2025-12-04T09:31:16.0905177Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 2025-12-04T09:31:16.0905569Z GITHUB_RUN_NUMBER=689 2025-12-04T09:31:16.0905892Z TEST_CONFIG=default 2025-12-04T09:31:16.0906327Z RUNNER_NAME=linux.rocm.gpu.gfx942.1.b-gwk9b-runner-ppkbp 2025-12-04T09:31:16.0906826Z GITHUB_REPOSITORY_OWNER_ID=21003710 2025-12-04T09:31:16.0907230Z AWS_DEFAULT_REGION=us-east-1 2025-12-04T09:31:16.0907672Z RUNNER_ARTIFACT_DIR=/home/runner/_work/_temp/artifacts 2025-12-04T09:31:16.0908212Z GITHUB_TRIGGERING_ACTOR=pytorchmergebot 2025-12-04T09:31:16.0908611Z GITHUB_REF_TYPE=branch 2025-12-04T09:31:16.0909004Z BASE_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0909623Z HUGGING_FACE_HUB_TOKEN=*** 2025-12-04T09:31:16.0910038Z *** 2025-12-04T09:31:16.0910333Z GITHUB_REPOSITORY_ID=65600975 2025-12-04T09:31:16.0910689Z GITHUB_ACTIONS=true 2025-12-04T09:31:16.0911056Z SHA1=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0911534Z GITHUB_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0912229Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk-rocm-mi300.yml@refs/heads/main 2025-12-04T09:31:16.0912846Z UCC_HOME=/usr 2025-12-04T09:31:16.0913163Z TORCH_SERIALIZATION_DEBUG=1 2025-12-04T09:31:16.0913536Z RUNNER_ENVIRONMENT=self-hosted 2025-12-04T09:31:16.0913901Z VERBOSE_TEST_LOGS=False 2025-12-04T09:31:16.0914245Z GITHUB_REF=refs/heads/main 2025-12-04T09:31:16.0914582Z RUNNER_OS=Linux 2025-12-04T09:31:16.0914878Z SHARD_NUMBER=4 2025-12-04T09:31:16.0915509Z GITHUB_REF_PROTECTED=true 2025-12-04T09:31:16.0915863Z RUNNER_MANUALLY_TRAP_SIG=1 2025-12-04T09:31:16.0916201Z HOME=/var/lib/jenkins 2025-12-04T09:31:16.0916576Z GITHUB_API_URL=https://api.github.com 2025-12-04T09:31:16.0916996Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2025-12-04T09:31:16.0917423Z RUNNER_DOCS_DIR=/home/runner/_work/_temp/docs 2025-12-04T09:31:16.0917827Z LANG=C.UTF-8 2025-12-04T09:31:16.0918239Z UCX_COMMIT=29831d319e6be55cb8c768ca61de335c934ca39e 2025-12-04T09:31:16.0918684Z PYTORCH_TEST_WITH_ROCM=1 2025-12-04T09:31:16.0919145Z RUNNER_TRACKING_ID=github_91304fcc-5ca8-4114-964f-2b85ba81bbf4 2025-12-04T09:31:16.0919613Z RUNNER_ARCH=X64 2025-12-04T09:31:16.0919935Z RUNNER_TEMP=/home/runner/_work/_temp 2025-12-04T09:31:16.0920318Z NUM_TEST_SHARDS=6 2025-12-04T09:31:16.0920616Z UCX_HOME=/usr 2025-12-04T09:31:16.0921233Z GITHUB_STATE=/home/runner/_work/_temp/_runner_file_commands/save_state_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0922234Z JOB_NAME=linux-jammy-rocm-py3.10 / test (default, 4, 6, linux.rocm.gpu.gfx942.1.b, mem_leak_check, unstable) 2025-12-04T09:31:16.0922917Z MAGMA_HOME=/opt/rocm/magma 2025-12-04T09:31:16.0923532Z GITHUB_ENV=/home/runner/_work/_temp/_runner_file_commands/set_env_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0924326Z GITHUB_EVENT_PATH=/home/runner/_work/_temp/_github_workflow/event.json 2025-12-04T09:31:16.0924848Z GITHUB_EVENT_NAME=schedule 2025-12-04T09:31:16.0925355Z GITHUB_ACTIONS_RUNNER_EXTRA_USER_AGENT=actions-runner-controller/0.12.1 2025-12-04T09:31:16.0925877Z DASHBOARD_TAG= 2025-12-04T09:31:16.0926180Z GITHUB_RUN_ID=19922849170 2025-12-04T09:31:16.0926854Z GITHUB_STEP_SUMMARY=/home/runner/_work/_temp/_runner_file_commands/step_summary_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0927703Z GITHUB_ACTOR=pytorchmergebot 2025-12-04T09:31:16.0928061Z PR_NUMBER= 2025-12-04T09:31:16.0928414Z GITHUB_RUN_ATTEMPT=1 2025-12-04T09:31:16.0928732Z VALGRIND=ON 2025-12-04T09:31:16.0929039Z ANACONDA_PYTHON_VERSION=3.10 2025-12-04T09:31:16.0929488Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2025-12-04T09:31:16.0929920Z TERM=vt100 2025-12-04T09:31:16.0930209Z INSTALLED_VISION=yes 2025-12-04T09:31:16.0930524Z BRANCH=main 2025-12-04T09:31:16.0930832Z OPENSSL_ROOT_DIR=/opt/openssl 2025-12-04T09:31:16.0931189Z TESTS_TO_INCLUDE= 2025-12-04T09:31:16.0931707Z GITHUB_ACTION_PATH=/home/runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2025-12-04T09:31:16.0932325Z GITHUB_SERVER_URL=https://github.com 2025-12-04T09:31:16.0932776Z PYTORCH_ROCM_ARCH=gfx90a;gfx942;gfx950;gfx1100 2025-12-04T09:31:16.0933265Z UCC_COMMIT=9f4b242cbbd8b1462cbc732eb29316cdfa124b77 2025-12-04T09:31:16.0933694Z REENABLED_ISSUES= 2025-12-04T09:31:16.0933991Z SHLVL=1 2025-12-04T09:31:16.0934272Z MAX_JOBS=126 2025-12-04T09:31:16.0934685Z RUNNER_TEST_RESULTS_DIR=/home/runner/_work/_temp/test-results 2025-12-04T09:31:16.0935176Z GITHUB_ACTOR_ID=97764156 2025-12-04T09:31:16.0935548Z RUNNER_TOOL_CACHE=/home/runner/_work/_tool 2025-12-04T09:31:16.0936067Z GITHUB_WORKFLOW_SHA=ffd9b0fb4355e97af82fc42cf185c3ffa0fc0a32 2025-12-04T09:31:16.0936549Z GITHUB_REF_NAME=main 2025-12-04T09:31:16.0936865Z ROCM_PATH=/opt/rocm 2025-12-04T09:31:16.0937170Z GITHUB_JOB=test 2025-12-04T09:31:16.0937472Z NO_TEST_TIMEOUT=False 2025-12-04T09:31:16.0937825Z GITHUB_REPOSITORY=pytorch/pytorch 2025-12-04T09:31:16.0938312Z LC_ALL=C.UTF-8 2025-12-04T09:31:16.0938619Z GITHUB_RETENTION_DAYS=90 2025-12-04T09:31:16.0939000Z RUNNER_WORKSPACE=/home/runner/_work/pytorch 2025-12-04T09:31:16.0939413Z OPENSSL_DIR=/opt/openssl 2025-12-04T09:31:16.0939762Z GITHUB_ACTION_REPOSITORY= 2025-12-04T09:31:16.0940921Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:31:16.0942067Z GITHUB_BASE_REF= 2025-12-04T09:31:16.0942362Z CI=true 2025-12-04T09:31:16.0942665Z GITHUB_REPOSITORY_OWNER=pytorch 2025-12-04T09:31:16.0943142Z JOB_ID=57116213162 2025-12-04T09:31:16.0943444Z GITHUB_HEAD_REF= 2025-12-04T09:31:16.0943744Z GITHUB_ACTION_REF= 2025-12-04T09:31:16.0944053Z TEST_SHOWLOCALS=False 2025-12-04T09:31:16.0944409Z GITHUB_WORKFLOW=trunk-rocm-mi300 2025-12-04T09:31:16.0944801Z DEBIAN_FRONTEND=noninteractive 2025-12-04T09:31:16.0945470Z GITHUB_OUTPUT=/home/runner/_work/_temp/_runner_file_commands/set_output_bf6e8a29-af74-47e5-806f-526de4d01987 2025-12-04T09:31:16.0946136Z NO_TD=False 2025-12-04T09:31:16.0946431Z OLDPWD=/var/lib/jenkins 2025-12-04T09:31:16.0946749Z _=/usr/bin/env 2025-12-04T09:31:16.0947055Z + echo 'Testing pytorch' 2025-12-04T09:31:16.0947389Z Testing pytorch 2025-12-04T09:31:16.0947702Z + export LANG=C.UTF-8 2025-12-04T09:31:16.0948015Z + LANG=C.UTF-8 2025-12-04T09:31:16.0948372Z + PR_NUMBER= 2025-12-04T09:31:16.0948685Z + [[ default == \d\e\f\a\u\l\t ]] 2025-12-04T09:31:16.0949068Z + export CUDA_VISIBLE_DEVICES=0 2025-12-04T09:31:16.0949433Z + CUDA_VISIBLE_DEVICES=0 2025-12-04T09:31:16.0949785Z + export HIP_VISIBLE_DEVICES=0 2025-12-04T09:31:16.0950158Z + HIP_VISIBLE_DEVICES=0 2025-12-04T09:31:16.0950540Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2025-12-04T09:31:16.0950932Z + [[ default == \s\l\o\w ]] 2025-12-04T09:31:16.0951364Z + [[ linux-jammy-rocm-py3.10 == *slow-gradcheck* ]] 2025-12-04T09:31:16.0951836Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:31:16.0952260Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:31:16.0952703Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:31:16.0953145Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2025-12-04T09:31:16.0953546Z + [[ default == *crossref* ]] 2025-12-04T09:31:16.0953930Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:31:16.0954325Z + export VALGRIND=OFF 2025-12-04T09:31:16.0954817Z + VALGRIND=OFF 2025-12-04T09:31:16.0955107Z + rocminfo 2025-12-04T09:31:16.1024871Z ROCk module version 6.12.12 is loaded 2025-12-04T09:31:16.1346496Z ===================== 2025-12-04T09:31:16.1346958Z HSA System Attributes 2025-12-04T09:31:16.1347329Z ===================== 2025-12-04T09:31:16.1347687Z Runtime Version: 1.18 2025-12-04T09:31:16.1348059Z Runtime Ext Version: 1.14 2025-12-04T09:31:16.1348519Z System Timestamp Freq.: 1000.000000MHz 2025-12-04T09:31:16.1349236Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2025-12-04T09:31:16.1349927Z Machine Model: LARGE 2025-12-04T09:31:16.1350572Z System Endianness: LITTLE 2025-12-04T09:31:16.1351056Z Mwaitx: DISABLED 2025-12-04T09:31:16.1351449Z XNACK enabled: NO 2025-12-04T09:31:16.1351819Z DMAbuf Support: YES 2025-12-04T09:31:16.1352177Z VMM Support: YES 2025-12-04T09:31:16.1352421Z 2025-12-04T09:31:16.1352547Z ========== 2025-12-04T09:31:16.1352890Z HSA Agents 2025-12-04T09:31:16.1353217Z ========== 2025-12-04T09:31:16.1353622Z ******* 2025-12-04T09:31:16.1353952Z Agent 1 2025-12-04T09:31:16.1354270Z ******* 2025-12-04T09:31:16.1354674Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1355172Z Uuid: CPU-XX 2025-12-04T09:31:16.1355694Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1356224Z Vendor Name: CPU 2025-12-04T09:31:16.1356732Z Feature: None specified 2025-12-04T09:31:16.1357238Z Profile: FULL_PROFILE 2025-12-04T09:31:16.1357749Z Float Round Mode: NEAR 2025-12-04T09:31:16.1358361Z Max Queue Number: 0(0x0) 2025-12-04T09:31:16.1358870Z Queue Min Size: 0(0x0) 2025-12-04T09:31:16.1359360Z Queue Max Size: 0(0x0) 2025-12-04T09:31:16.1360092Z Queue Type: MULTI 2025-12-04T09:31:16.1360568Z Node: 0 2025-12-04T09:31:16.1361044Z Device Type: CPU 2025-12-04T09:31:16.1361487Z Cache Info: 2025-12-04T09:31:16.1361887Z L1: 49152(0xc000) KB 2025-12-04T09:31:16.1362354Z Chip ID: 0(0x0) 2025-12-04T09:31:16.1362847Z ASIC Revision: 0(0x0) 2025-12-04T09:31:16.1363366Z Cacheline Size: 64(0x40) 2025-12-04T09:31:16.1363883Z Max Clock Freq. (MHz): 3300 2025-12-04T09:31:16.1364374Z BDFID: 0 2025-12-04T09:31:16.1364864Z Internal Node ID: 0 2025-12-04T09:31:16.1365376Z Compute Unit: 64 2025-12-04T09:31:16.1365871Z SIMDs per CU: 0 2025-12-04T09:31:16.1366371Z Shader Engines: 0 2025-12-04T09:31:16.1366892Z Shader Arrs. per Eng.: 0 2025-12-04T09:31:16.1367423Z WatchPts on Addr. Ranges:1 2025-12-04T09:31:16.1367892Z Memory Properties: 2025-12-04T09:31:16.1368351Z Features: None 2025-12-04T09:31:16.1368712Z Pool Info: 2025-12-04T09:31:16.1369058Z Pool 1 2025-12-04T09:31:16.1369485Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:31:16.1370106Z Size: 1584733356(0x5e751cac) KB 2025-12-04T09:31:16.1370608Z Allocatable: TRUE 2025-12-04T09:31:16.1371131Z Alloc Granule: 4KB 2025-12-04T09:31:16.1371686Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1372240Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1372778Z Accessible by all: TRUE 2025-12-04T09:31:16.1373236Z Pool 2 2025-12-04T09:31:16.1373675Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:31:16.1374180Z Size: 1584733356(0x5e751cac) KB 2025-12-04T09:31:16.1374672Z Allocatable: TRUE 2025-12-04T09:31:16.1375187Z Alloc Granule: 4KB 2025-12-04T09:31:16.1375735Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1376282Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1376816Z Accessible by all: TRUE 2025-12-04T09:31:16.1377286Z Pool 3 2025-12-04T09:31:16.1377722Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:31:16.1378276Z Size: 1584733356(0x5e751cac) KB 2025-12-04T09:31:16.1378771Z Allocatable: TRUE 2025-12-04T09:31:16.1379286Z Alloc Granule: 4KB 2025-12-04T09:31:16.1379821Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1380371Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1380896Z Accessible by all: TRUE 2025-12-04T09:31:16.1381359Z Pool 4 2025-12-04T09:31:16.1381790Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:31:16.1382281Z Size: 1584733356(0x5e751cac) KB 2025-12-04T09:31:16.1382896Z Allocatable: TRUE 2025-12-04T09:31:16.1383417Z Alloc Granule: 4KB 2025-12-04T09:31:16.1383960Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1384499Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1385029Z Accessible by all: TRUE 2025-12-04T09:31:16.1385488Z ISA Info: 2025-12-04T09:31:16.1385829Z ******* 2025-12-04T09:31:16.1386164Z Agent 2 2025-12-04T09:31:16.1386485Z ******* 2025-12-04T09:31:16.1386880Z Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1387373Z Uuid: CPU-XX 2025-12-04T09:31:16.1387891Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1388475Z Vendor Name: CPU 2025-12-04T09:31:16.1388991Z Feature: None specified 2025-12-04T09:31:16.1389496Z Profile: FULL_PROFILE 2025-12-04T09:31:16.1390007Z Float Round Mode: NEAR 2025-12-04T09:31:16.1390527Z Max Queue Number: 0(0x0) 2025-12-04T09:31:16.1391035Z Queue Min Size: 0(0x0) 2025-12-04T09:31:16.1391539Z Queue Max Size: 0(0x0) 2025-12-04T09:31:16.1392041Z Queue Type: MULTI 2025-12-04T09:31:16.1392515Z Node: 1 2025-12-04T09:31:16.1393082Z Device Type: CPU 2025-12-04T09:31:16.1393533Z Cache Info: 2025-12-04T09:31:16.1393921Z L1: 49152(0xc000) KB 2025-12-04T09:31:16.1394389Z Chip ID: 0(0x0) 2025-12-04T09:31:16.1394876Z ASIC Revision: 0(0x0) 2025-12-04T09:31:16.1395386Z Cacheline Size: 64(0x40) 2025-12-04T09:31:16.1395901Z Max Clock Freq. (MHz): 3300 2025-12-04T09:31:16.1396391Z BDFID: 0 2025-12-04T09:31:16.1396879Z Internal Node ID: 1 2025-12-04T09:31:16.1397387Z Compute Unit: 64 2025-12-04T09:31:16.1397887Z SIMDs per CU: 0 2025-12-04T09:31:16.1398498Z Shader Engines: 0 2025-12-04T09:31:16.1399024Z Shader Arrs. per Eng.: 0 2025-12-04T09:31:16.1399555Z WatchPts on Addr. Ranges:1 2025-12-04T09:31:16.1400022Z Memory Properties: 2025-12-04T09:31:16.1400393Z Features: None 2025-12-04T09:31:16.1400754Z Pool Info: 2025-12-04T09:31:16.1401101Z Pool 1 2025-12-04T09:31:16.1401531Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:31:16.1402038Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T09:31:16.1402541Z Allocatable: TRUE 2025-12-04T09:31:16.1403057Z Alloc Granule: 4KB 2025-12-04T09:31:16.1403602Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1404142Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1404689Z Accessible by all: TRUE 2025-12-04T09:31:16.1405148Z Pool 2 2025-12-04T09:31:16.1405583Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:31:16.1406182Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T09:31:16.1406675Z Allocatable: TRUE 2025-12-04T09:31:16.1407188Z Alloc Granule: 4KB 2025-12-04T09:31:16.1407725Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1408346Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1408878Z Accessible by all: TRUE 2025-12-04T09:31:16.1409340Z Pool 3 2025-12-04T09:31:16.1409777Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2025-12-04T09:31:16.1410280Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T09:31:16.1410767Z Allocatable: TRUE 2025-12-04T09:31:16.1411280Z Alloc Granule: 4KB 2025-12-04T09:31:16.1411822Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1412367Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1412894Z Accessible by all: TRUE 2025-12-04T09:31:16.1413348Z Pool 4 2025-12-04T09:31:16.1413767Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:31:16.1414269Z Size: 1585355616(0x5e7e9b60) KB 2025-12-04T09:31:16.1414753Z Allocatable: TRUE 2025-12-04T09:31:16.1415269Z Alloc Granule: 4KB 2025-12-04T09:31:16.1415896Z Alloc Recommended Granule:4KB 2025-12-04T09:31:16.1416433Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1416962Z Accessible by all: TRUE 2025-12-04T09:31:16.1417426Z ISA Info: 2025-12-04T09:31:16.1417765Z ******* 2025-12-04T09:31:16.1418167Z Agent 3 2025-12-04T09:31:16.1418491Z ******* 2025-12-04T09:31:16.1418872Z Name: gfx942 2025-12-04T09:31:16.1419373Z Uuid: GPU-2beed1e7736038cc 2025-12-04T09:31:16.1419881Z Marketing Name: 2025-12-04T09:31:16.1437375Z Vendor Name: AMD 2025-12-04T09:31:16.1437920Z Feature: KERNEL_DISPATCH 2025-12-04T09:31:16.1438517Z Profile: BASE_PROFILE 2025-12-04T09:31:16.1439051Z Float Round Mode: NEAR 2025-12-04T09:31:16.1439602Z Max Queue Number: 128(0x80) 2025-12-04T09:31:16.1440115Z Queue Min Size: 64(0x40) 2025-12-04T09:31:16.1440623Z Queue Max Size: 131072(0x20000) 2025-12-04T09:31:16.1441125Z Queue Type: MULTI 2025-12-04T09:31:16.1441607Z Node: 2 2025-12-04T09:31:16.1442086Z Device Type: GPU 2025-12-04T09:31:16.1442544Z Cache Info: 2025-12-04T09:31:16.1442947Z L1: 32(0x20) KB 2025-12-04T09:31:16.1443405Z L2: 4096(0x1000) KB 2025-12-04T09:31:16.1443836Z L3: 262144(0x40000) KB 2025-12-04T09:31:16.1444290Z Chip ID: 29861(0x74a5) 2025-12-04T09:31:16.1444793Z ASIC Revision: 1(0x1) 2025-12-04T09:31:16.1445314Z Cacheline Size: 128(0x80) 2025-12-04T09:31:16.1446027Z Max Clock Freq. (MHz): 2100 2025-12-04T09:31:16.1446526Z BDFID: 5376 2025-12-04T09:31:16.1447027Z Internal Node ID: 2 2025-12-04T09:31:16.1447542Z Compute Unit: 304 2025-12-04T09:31:16.1448042Z SIMDs per CU: 4 2025-12-04T09:31:16.1448623Z Shader Engines: 32 2025-12-04T09:31:16.1449146Z Shader Arrs. per Eng.: 1 2025-12-04T09:31:16.1449693Z WatchPts on Addr. Ranges:4 2025-12-04T09:31:16.1450236Z Coherent Host Access: FALSE 2025-12-04T09:31:16.1450725Z Memory Properties: 2025-12-04T09:31:16.1451122Z Features: KERNEL_DISPATCH 2025-12-04T09:31:16.1451607Z Fast F16 Operation: TRUE 2025-12-04T09:31:16.1452143Z Wavefront Size: 64(0x40) 2025-12-04T09:31:16.1452677Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:31:16.1453163Z Workgroup Max Size per Dimension: 2025-12-04T09:31:16.1453597Z x 1024(0x400) 2025-12-04T09:31:16.1454040Z y 1024(0x400) 2025-12-04T09:31:16.1454467Z z 1024(0x400) 2025-12-04T09:31:16.1454935Z Max Waves Per CU: 32(0x20) 2025-12-04T09:31:16.1455462Z Max Work-item Per CU: 2048(0x800) 2025-12-04T09:31:16.1455987Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:31:16.1456557Z Grid Max Size per Dimension: 2025-12-04T09:31:16.1456954Z x 2147483647(0x7fffffff) 2025-12-04T09:31:16.1457397Z y 65535(0xffff) 2025-12-04T09:31:16.1457830Z z 65535(0xffff) 2025-12-04T09:31:16.1458396Z Max fbarriers/Workgrp: 32 2025-12-04T09:31:16.1459017Z Packet Processor uCode:: 185 2025-12-04T09:31:16.1459576Z SDMA engine uCode:: 24 2025-12-04T09:31:16.1460104Z IOMMU Support:: None 2025-12-04T09:31:16.1460561Z Pool Info: 2025-12-04T09:31:16.1460919Z Pool 1 2025-12-04T09:31:16.1461369Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2025-12-04T09:31:16.1461885Z Size: 268419072(0xfffc000) KB 2025-12-04T09:31:16.1462396Z Allocatable: TRUE 2025-12-04T09:31:16.1462919Z Alloc Granule: 4KB 2025-12-04T09:31:16.1463475Z Alloc Recommended Granule:2048KB 2025-12-04T09:31:16.1464031Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1464567Z Accessible by all: FALSE 2025-12-04T09:31:16.1465031Z Pool 2 2025-12-04T09:31:16.1465481Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2025-12-04T09:31:16.1465993Z Size: 268419072(0xfffc000) KB 2025-12-04T09:31:16.1466489Z Allocatable: TRUE 2025-12-04T09:31:16.1467015Z Alloc Granule: 4KB 2025-12-04T09:31:16.1467559Z Alloc Recommended Granule:2048KB 2025-12-04T09:31:16.1468159Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1468696Z Accessible by all: FALSE 2025-12-04T09:31:16.1469238Z Pool 3 2025-12-04T09:31:16.1469778Z Segment: GLOBAL; FLAGS: FINE GRAINED 2025-12-04T09:31:16.1470376Z Size: 268419072(0xfffc000) KB 2025-12-04T09:31:16.1470868Z Allocatable: TRUE 2025-12-04T09:31:16.1471391Z Alloc Granule: 4KB 2025-12-04T09:31:16.1471938Z Alloc Recommended Granule:2048KB 2025-12-04T09:31:16.1472485Z Alloc Alignment: 4KB 2025-12-04T09:31:16.1473016Z Accessible by all: FALSE 2025-12-04T09:31:16.1473482Z Pool 4 2025-12-04T09:31:16.1473910Z Segment: GROUP 2025-12-04T09:31:16.1474383Z Size: 64(0x40) KB 2025-12-04T09:31:16.1474884Z Allocatable: FALSE 2025-12-04T09:31:16.1475405Z Alloc Granule: 0KB 2025-12-04T09:31:16.1475958Z Alloc Recommended Granule:0KB 2025-12-04T09:31:16.1476506Z Alloc Alignment: 0KB 2025-12-04T09:31:16.1477037Z Accessible by all: FALSE 2025-12-04T09:31:16.1477501Z ISA Info: 2025-12-04T09:31:16.1477859Z ISA 1 2025-12-04T09:31:16.1478353Z Name: amdgcn-amd-amdhsa--gfx942:sramecc+:xnack- 2025-12-04T09:31:16.1478910Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:31:16.1479458Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:31:16.1480089Z Default Rounding Mode: NEAR 2025-12-04T09:31:16.1480643Z Default Rounding Mode: NEAR 2025-12-04T09:31:16.1481170Z Fast f16: TRUE 2025-12-04T09:31:16.1481686Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:31:16.1482168Z Workgroup Max Size per Dimension: 2025-12-04T09:31:16.1482618Z x 1024(0x400) 2025-12-04T09:31:16.1483073Z y 1024(0x400) 2025-12-04T09:31:16.1483501Z z 1024(0x400) 2025-12-04T09:31:16.1483987Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:31:16.1484464Z Grid Max Size per Dimension: 2025-12-04T09:31:16.1484876Z x 2147483647(0x7fffffff) 2025-12-04T09:31:16.1485331Z y 65535(0xffff) 2025-12-04T09:31:16.1485768Z z 65535(0xffff) 2025-12-04T09:31:16.1486260Z FBarrier Max Size: 32 2025-12-04T09:31:16.1486728Z ISA 2 2025-12-04T09:31:16.1487207Z Name: amdgcn-amd-amdhsa--gfx9-4-generic:sramecc+:xnack- 2025-12-04T09:31:16.1487796Z Machine Models: HSA_MACHINE_MODEL_LARGE 2025-12-04T09:31:16.1488386Z Profiles: HSA_PROFILE_BASE 2025-12-04T09:31:16.1488977Z Default Rounding Mode: NEAR 2025-12-04T09:31:16.1489563Z Default Rounding Mode: NEAR 2025-12-04T09:31:16.1490109Z Fast f16: TRUE 2025-12-04T09:31:16.1490628Z Workgroup Max Size: 1024(0x400) 2025-12-04T09:31:16.1491133Z Workgroup Max Size per Dimension: 2025-12-04T09:31:16.1491573Z x 1024(0x400) 2025-12-04T09:31:16.1492123Z y 1024(0x400) 2025-12-04T09:31:16.1492567Z z 1024(0x400) 2025-12-04T09:31:16.1493049Z Grid Max Size: 4294967295(0xffffffff) 2025-12-04T09:31:16.1493524Z Grid Max Size per Dimension: 2025-12-04T09:31:16.1493940Z x 2147483647(0x7fffffff) 2025-12-04T09:31:16.1494393Z y 65535(0xffff) 2025-12-04T09:31:16.1494837Z z 65535(0xffff) 2025-12-04T09:31:16.1495329Z FBarrier Max Size: 32 2025-12-04T09:31:16.1495794Z *** Done *** 2025-12-04T09:31:16.1496149Z + rocminfo 2025-12-04T09:31:16.1496477Z + grep -E 'Name:.*\sgfx|Marketing' 2025-12-04T09:31:16.1870430Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1871067Z Marketing Name: AMD EPYC 9575F 64-Core Processor 2025-12-04T09:31:16.1871611Z Name: gfx942 2025-12-04T09:31:16.1872104Z Marketing Name: 2025-12-04T09:31:16.1916030Z + MAYBE_ROCM=rocm/ 2025-12-04T09:31:16.1916454Z + [[ linux-jammy-rocm-py3.10 == *xpu* ]] 2025-12-04T09:31:16.1916957Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:31:16.1917408Z + pip_install ninja==1.10.2 2025-12-04T09:31:16.1917917Z + pip_install_pkg='python3 -m pip install --progress-bar off' 2025-12-04T09:31:16.1918694Z + python3 -m pip install --progress-bar off ninja==1.10.2 2025-12-04T09:31:16.3947886Z Collecting ninja==1.10.2 2025-12-04T09:31:16.4209246Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2025-12-04T09:31:16.4287844Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2025-12-04T09:31:16.5911958Z Installing collected packages: ninja 2025-12-04T09:31:16.5912560Z Attempting uninstall: ninja 2025-12-04T09:31:16.5913016Z Found existing installation: ninja 1.11.1.4 2025-12-04T09:31:16.5922783Z Uninstalling ninja-1.11.1.4: 2025-12-04T09:31:16.5950482Z Successfully uninstalled ninja-1.11.1.4 2025-12-04T09:31:16.6056250Z Successfully installed ninja-1.10.2 2025-12-04T09:31:16.6398888Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:31:16.6401219Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2025-12-04T09:31:16.6402588Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:31:16.6403077Z + [[ linux-jammy-rocm-py3.10 == *asan* ]] 2025-12-04T09:31:16.6403529Z + [[ linux-jammy-rocm-py3.10 == *-debug* ]] 2025-12-04T09:31:16.6403973Z + [[ linux-jammy-rocm-py3.10 != *-bazel-* ]] 2025-12-04T09:31:16.6404600Z + echo 'We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass' 2025-12-04T09:31:16.6405359Z We are not in debug mode: linux-jammy-rocm-py3.10. Expect the assertion to pass 2025-12-04T09:31:16.6406934Z + cd test 2025-12-04T09:31:16.6407399Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2025-12-04T09:31:17.5343807Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2025-12-04T09:31:17.5344013Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2025-12-04T09:31:17.5344170Z + [[ default == \l\e\g\a\c\y\_\n\v\i\d\i\a\_\d\r\i\v\e\r ]] 2025-12-04T09:31:17.5349828Z + DYNAMO_BENCHMARK_FLAGS=() 2025-12-04T09:31:17.5351145Z + [[ default == *pr_time_benchmarks* ]] 2025-12-04T09:31:17.5351289Z + [[ default == *dynamo_eager* ]] 2025-12-04T09:31:17.5351408Z + [[ default == *aot_eager* ]] 2025-12-04T09:31:17.5351887Z + [[ default == *aot_inductor* ]] 2025-12-04T09:31:17.5352008Z + [[ default == *max_autotune_inductor* ]] 2025-12-04T09:31:17.5352134Z + [[ default == *inductor* ]] 2025-12-04T09:31:17.5352246Z + [[ default == *dynamic* ]] 2025-12-04T09:31:17.5352356Z + [[ default == *cpu* ]] 2025-12-04T09:31:17.5352461Z + [[ default == *xpu* ]] 2025-12-04T09:31:17.5352596Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2025-12-04T09:31:17.5385754Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:31:17.5386207Z + [[ linux-jammy-rocm-py3.10 == *-bazel-* ]] 2025-12-04T09:31:17.5389950Z + cd test 2025-12-04T09:31:17.5390350Z + python -c 'import torch; print(torch.__config__.show())' 2025-12-04T09:31:18.2736412Z PyTorch built with: 2025-12-04T09:31:18.2736897Z - GCC 11.4 2025-12-04T09:31:18.2737278Z - C++ Version: 201703 2025-12-04T09:31:18.2738250Z - Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:31:18.2739157Z - Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:31:18.2739724Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:31:18.2740168Z - LAPACK is enabled (usually provided by MKL) 2025-12-04T09:31:18.2740590Z - NNPACK is enabled 2025-12-04T09:31:18.2740953Z - CPU capability usage: AVX512 2025-12-04T09:31:18.2741328Z - HIP Runtime 7.1.25424 2025-12-04T09:31:18.2741664Z - MIOpen 3.5.1 2025-12-04T09:31:18.2741965Z - Magma 2.9.0 2025-12-04T09:31:18.2747249Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, COMMIT_SHA=35b7a9a26c5923d98aebaa41a031dae21788a9ee, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_FBGEMM_GENAI -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -DC10_NODEPRECATED -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=range-loop-construct -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -faligned-new -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.10.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=ON, USE_ROCM_KERNEL_ASSERT=OFF, USE_XCCL=OFF, USE_XPU=OFF, 2025-12-04T09:31:18.2753533Z 2025-12-04T09:31:18.5244753Z + cd test 2025-12-04T09:31:18.5245316Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2025-12-04T09:31:19.1790363Z ATen/Parallel: 2025-12-04T09:31:19.1790903Z at::get_num_threads() : 128 2025-12-04T09:31:19.1791339Z at::get_num_interop_threads() : 128 2025-12-04T09:31:19.1791807Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2025-12-04T09:31:19.1792272Z omp_get_max_threads() : 128 2025-12-04T09:31:19.1793001Z Intel(R) oneAPI Math Kernel Library Version 2024.2-Product Build 20240605 for Intel(R) 64 architecture applications 2025-12-04T09:31:19.1793733Z mkl_get_max_threads() : 128 2025-12-04T09:31:19.1794254Z Intel(R) MKL-DNN v3.7.1 (Git Hash 8d263e693366ef8db40acc569cc7d8edf644556d) 2025-12-04T09:31:19.1794837Z std::thread::hardware_concurrency() : 128 2025-12-04T09:31:19.1795251Z Environment variables: 2025-12-04T09:31:19.1795592Z OMP_NUM_THREADS : [not set] 2025-12-04T09:31:19.1795947Z MKL_NUM_THREADS : [not set] 2025-12-04T09:31:19.1796321Z ATen parallel backend: OpenMP 2025-12-04T09:31:19.1796560Z 2025-12-04T09:31:19.4229356Z + [[ default == *numpy_2* ]] 2025-12-04T09:31:19.4230015Z + [[ linux-jammy-rocm-py3.10 == *aarch64* ]] 2025-12-04T09:31:19.4230485Z + [[ default == *backward* ]] 2025-12-04T09:31:19.4230921Z + [[ default == *libtorch_agnostic_targetting* ]] 2025-12-04T09:31:19.4231369Z + [[ default == *xla* ]] 2025-12-04T09:31:19.4231710Z + [[ default == *vllm* ]] 2025-12-04T09:31:19.4232936Z + [[ default == *executorch* ]] 2025-12-04T09:31:19.4233321Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2025-12-04T09:31:19.4233734Z + [[ default == \q\u\a\n\t\i\z\a\t\i\o\n ]] 2025-12-04T09:31:19.4234168Z + [[ linux-jammy-rocm-py3.10 == *libtorch* ]] 2025-12-04T09:31:19.4234600Z + [[ default == distributed ]] 2025-12-04T09:31:19.4234986Z + [[ default == *operator_benchmark* ]] 2025-12-04T09:31:19.4235413Z + [[ default == *operator_microbenchmark* ]] 2025-12-04T09:31:19.4235850Z + [[ default == *attention_microbenchmark* ]] 2025-12-04T09:31:19.4236269Z + [[ default == *inductor_distributed* ]] 2025-12-04T09:31:19.4236685Z + [[ default == *inductor-halide* ]] 2025-12-04T09:31:19.4237085Z + [[ default == *inductor-pallas* ]] 2025-12-04T09:31:19.4237521Z + [[ default == *inductor-triton-cpu* ]] 2025-12-04T09:31:19.4237969Z + [[ default == *inductor-micro-benchmark* ]] 2025-12-04T09:31:19.4238509Z + [[ default == *aoti_cross_compile_for_windows* ]] 2025-12-04T09:31:19.4238965Z + [[ default == *huggingface* ]] 2025-12-04T09:31:19.4239334Z + [[ default == *timm* ]] 2025-12-04T09:31:19.4239685Z + [[ default == cachebench ]] 2025-12-04T09:31:19.4240058Z + [[ default == verify_cachebench ]] 2025-12-04T09:31:19.4240449Z + [[ default == *torchbench* ]] 2025-12-04T09:31:19.4240843Z + [[ default == *inductor_cpp_wrapper* ]] 2025-12-04T09:31:19.4241260Z + [[ default == *inductor_core* ]] 2025-12-04T09:31:19.4241645Z + [[ default == *inductor* ]] 2025-12-04T09:31:19.4242009Z + [[ default == *einops* ]] 2025-12-04T09:31:19.4242372Z + [[ default == *dynamo_core* ]] 2025-12-04T09:31:19.4242755Z + [[ default == *dynamo_wrapped* ]] 2025-12-04T09:31:19.4243175Z + [[ linux-jammy-rocm-py3.10 == *rocm* ]] 2025-12-04T09:31:19.4243789Z + [[ -n '' ]] 2025-12-04T09:31:19.4244082Z + [[ 4 == 1 ]] 2025-12-04T09:31:19.4244367Z + [[ 4 == 2 ]] 2025-12-04T09:31:19.4244649Z + [[ 4 -gt 2 ]] 2025-12-04T09:31:19.4244956Z + install_torchvision 2025-12-04T09:31:19.4245286Z + local orig_preload 2025-12-04T09:31:19.4245610Z + local commit 2025-12-04T09:31:19.4245967Z ++ get_pinned_commit vision 2025-12-04T09:31:19.4246358Z ++ cat .github/ci_commit_pins/vision.txt 2025-12-04T09:31:19.4246812Z + commit=617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:19.4247240Z + orig_preload= 2025-12-04T09:31:19.4247538Z + '[' -n '' ']' 2025-12-04T09:31:19.4247871Z + [[ linux-jammy-rocm-py3.10 == *cuda* ]] 2025-12-04T09:31:19.4248792Z + pip_build_and_install git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e dist/vision 2025-12-04T09:31:19.4249869Z + local build_target=git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:19.4250560Z + local wheel_dir=dist/vision 2025-12-04T09:31:19.4250932Z + local found_whl=0 2025-12-04T09:31:19.4251271Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:31:19.4251664Z + [[ -f dist/vision/*.whl ]] 2025-12-04T09:31:19.4252010Z + '[' 0 == 0 ']' 2025-12-04T09:31:19.4252850Z + python3 -m pip wheel --no-build-isolation --no-deps -w dist/vision git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:19.5743515Z Collecting git+https://github.com/pytorch/vision.git@617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:19.5746208Z Cloning https://github.com/pytorch/vision.git (to revision 617079d944b0e72632311c30ae2bbdf1168b901e) to /tmp/pip-req-build-wgeupkzn 2025-12-04T09:31:19.5773467Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-wgeupkzn 2025-12-04T09:31:23.0281672Z Running command git rev-parse -q --verify 'sha^617079d944b0e72632311c30ae2bbdf1168b901e' 2025-12-04T09:31:23.0305628Z Running command git fetch -q https://github.com/pytorch/vision.git 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:23.6788844Z Resolved https://github.com/pytorch/vision.git to commit 617079d944b0e72632311c30ae2bbdf1168b901e 2025-12-04T09:31:25.3481998Z Preparing metadata (pyproject.toml) ... [?25l- \ | done 2025-12-04T09:31:25.3535618Z [?25hBuilding wheels for collected packages: torchvision 2025-12-04T09:32:04.4044764Z Building wheel for torchvision (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2025-12-04T09:32:04.4089159Z [?25h Created wheel for torchvision: filename=torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl size=1808989 sha256=f5e78fbe32672cef66d1bee163be7a969ca80beaa97ce0913bef5f3941ccc882 2025-12-04T09:32:04.4092396Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/12/b2/29/1f82685c5b5173629e1f36a9b93989ce92ce563e5fb91d27ac 2025-12-04T09:32:04.4144800Z Successfully built torchvision 2025-12-04T09:32:04.4801253Z + for file in "${wheel_dir}"/*.whl 2025-12-04T09:32:04.4801964Z + pip_install_whl dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:32:04.4802789Z + args=('dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl') 2025-12-04T09:32:04.4803371Z + local args 2025-12-04T09:32:04.4803887Z + [[ dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl == *\ * ]] 2025-12-04T09:32:04.4804500Z + for path in "${args[@]}" 2025-12-04T09:32:04.4805087Z + echo 'Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl' 2025-12-04T09:32:04.4805946Z Installing dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:32:04.4807183Z + python3 -mpip install --no-index --no-deps dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:32:04.6350390Z Processing ./dist/vision/torchvision-0.25.0a0+617079d-cp310-cp310-linux_x86_64.whl 2025-12-04T09:32:04.6398761Z Installing collected packages: torchvision 2025-12-04T09:32:04.8491321Z Successfully installed torchvision-0.25.0a0+617079d 2025-12-04T09:32:04.8703437Z + '[' -n '' ']' 2025-12-04T09:32:04.8703801Z + test_python_shard 4 2025-12-04T09:32:04.8704149Z + [[ -z 6 ]] 2025-12-04T09:32:04.8705127Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --exclude-quantization-tests --shard 4 6 --verbose --upload-artifacts-while-running 2025-12-04T09:32:06.5344087Z Excluding inductor/test_max_autotune on ROCm 2025-12-04T09:32:06.5344609Z Excluding test_cuda_nvml_based_avail on ROCm 2025-12-04T09:32:07.5336388Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2025-12-04T09:32:07.8791713Z Ignoring disabled issues: [''] 2025-12-04T09:32:07.8881263Z Found test times from artifacts 2025-12-04T09:32:07.9199829Z Found test times from artifacts 2025-12-04T09:32:07.9205479Z Running all tests 2025-12-04T09:32:07.9489151Z Running parallel tests on 1 processes 2025-12-04T09:32:07.9495457Z Name: tests to run (est. time: 180.8min) 2025-12-04T09:32:07.9495907Z Serial tests (98): 2025-12-04T09:32:07.9496321Z inductor/test_aot_inductor 3/3 2025-12-04T09:32:07.9496823Z inductor/test_torchinductor_codegen_dynamic_shapes 2/4 2025-12-04T09:32:07.9497359Z inductor/test_torchinductor_opinfo 3/12 2025-12-04T09:32:07.9497792Z inductor/test_torchinductor_opinfo 9/12 2025-12-04T09:32:07.9498294Z dynamo/test_dynamic_shapes 1/2 2025-12-04T09:32:07.9498689Z inductor/test_cpu_repro 5/5 2025-12-04T09:32:07.9499074Z inductor/test_inductor_freezing 1/1 2025-12-04T09:32:07.9499484Z inductor/test_flex_attention 2/4 2025-12-04T09:32:07.9499896Z inductor/test_cpu_cpp_wrapper 1/1 2025-12-04T09:32:07.9500315Z inductor/test_compile_subprocess 3/3 2025-12-04T09:32:07.9500755Z inductor/test_multi_kernel 1/1 2025-12-04T09:32:07.9501142Z inductor/test_analysis 1/1 2025-12-04T09:32:07.9501561Z inductor/test_pad_mm 1/1 2025-12-04T09:32:07.9501944Z inductor/test_triton_syntax 1/1 2025-12-04T09:32:07.9502339Z test_sparse_semi_structured 1/1 2025-12-04T09:32:07.9502742Z inductor/test_op_completeness 1/1 2025-12-04T09:32:07.9503147Z inductor/test_subgraph_choice 1/1 2025-12-04T09:32:07.9504005Z inductor/test_b2b_gemm 1/1 2025-12-04T09:32:07.9504391Z inductor/test_triton_heuristics 1/1 2025-12-04T09:32:07.9504794Z inductor/test_ck_backend 1/1 2025-12-04T09:32:07.9505148Z test_testing 1/1 2025-12-04T09:32:07.9505495Z inductor/test_autoheuristic 1/1 2025-12-04T09:32:07.9505888Z export/test_strict_export_v2 1/1 2025-12-04T09:32:07.9506287Z inductor/test_segmented_tree 1/1 2025-12-04T09:32:07.9506724Z inductor/test_kernel_optimization 1/1 2025-12-04T09:32:07.9507156Z inductor/test_inductor_scheduler 1/1 2025-12-04T09:32:07.9507568Z inductor/test_remote_cache 1/1 2025-12-04T09:32:07.9507989Z inductor/test_coordinate_descent_tuner 1/1 2025-12-04T09:32:07.9508499Z inductor/test_inplace_padding 1/1 2025-12-04T09:32:07.9508901Z inductor/test_minifier_utils 1/1 2025-12-04T09:32:07.9509291Z inductor/test_debug_trace 1/1 2025-12-04T09:32:07.9509679Z dynamo/test_recompiles 1/1 2025-12-04T09:32:07.9510052Z inductor/test_foreach 1/1 2025-12-04T09:32:07.9510433Z dynamo/test_buffers_override 1/1 2025-12-04T09:32:07.9510830Z inductor/test_inplacing_pass 1/1 2025-12-04T09:32:07.9511251Z inductor/test_aot_inductor_custom_ops 1/1 2025-12-04T09:32:07.9511683Z inductor/test_split_cat_fx_passes 1/1 2025-12-04T09:32:07.9512087Z export/test_cpp_serdes 1/1 2025-12-04T09:32:07.9512441Z dynamo/test_pgo 1/1 2025-12-04T09:32:07.9512778Z dynamo/test_config 1/1 2025-12-04T09:32:07.9513135Z dynamo/test_metrics_context 1/1 2025-12-04T09:32:07.9513523Z export/test_package 1/1 2025-12-04T09:32:07.9513885Z export/test_export_opinfo 1/1 2025-12-04T09:32:07.9514263Z dynamo/test_nops 1/1 2025-12-04T09:32:07.9514650Z inductor/test_graph_transform_observer 1/1 2025-12-04T09:32:07.9515261Z inductor/test_memory 1/1 2025-12-04T09:32:07.9515630Z inductor/test_control_flow 1/4 2025-12-04T09:32:07.9516007Z inductor/test_pallas 1/1 2025-12-04T09:32:07.9516363Z dynamo/test_callback 1/1 2025-12-04T09:32:07.9516778Z dynamo/test_cudagraphs_expandable_segments 1/1 2025-12-04T09:32:07.9517233Z dynamo/test_aot_autograd_cache 1/1 2025-12-04T09:32:07.9517636Z test_utils_config_module 1/1 2025-12-04T09:32:07.9517999Z test_hop_infra 1/1 2025-12-04T09:32:07.9518384Z test_transformers 1/1 2025-12-04T09:32:07.9518802Z torch_np/numpy_tests/core/test_scalarinherit 1/1 2025-12-04T09:32:07.9519252Z test_tensorexpr_pybind 1/1 2025-12-04T09:32:07.9519623Z test_fx_experimental 1/1 2025-12-04T09:32:07.9519979Z test_namedtensor 1/1 2025-12-04T09:32:07.9520310Z test_dataloader 1/1 2025-12-04T09:32:07.9520641Z test_decomp 6/12 2025-12-04T09:32:07.9520957Z test_decomp 12/12 2025-12-04T09:32:07.9521296Z test_ci_sanity_check_fail 1/1 2025-12-04T09:32:07.9521660Z test_meta 3/3 2025-12-04T09:32:07.9521976Z test_ops_gradients 2/2 2025-12-04T09:32:07.9522308Z test_linalg 2/2 2025-12-04T09:32:07.9522637Z nn/test_packed_sequence 1/1 2025-12-04T09:32:07.9523004Z test_numa_binding 1/1 2025-12-04T09:32:07.9523348Z test_pruning_op 1/1 2025-12-04T09:32:07.9523681Z test_jit_fuser_te 1/1 2025-12-04T09:32:07.9524020Z nn/test_dropout 1/1 2025-12-04T09:32:07.9524359Z functorch/dim/test_split 1/1 2025-12-04T09:32:07.9524772Z torch_np/numpy_tests/lib/test_type_check 1/1 2025-12-04T09:32:07.9525231Z cpp_extensions/test_libtorch_agnostic 1/1 2025-12-04T09:32:07.9525676Z test_cpp_extensions_stream_and_event 1/1 2025-12-04T09:32:07.9526102Z profiler/test_execution_trace 1/1 2025-12-04T09:32:07.9526510Z test_dispatch 1/1 2025-12-04T09:32:07.9526835Z test_datapipe 1/1 2025-12-04T09:32:07.9527148Z test_ops 1/5 2025-12-04T09:32:07.9527500Z torch_np/numpy_tests/linalg/test_linalg 1/1 2025-12-04T09:32:07.9527924Z nn/test_module_hooks 1/1 2025-12-04T09:32:07.9528368Z torch_np/numpy_tests/lib/test_twodim_base 1/1 2025-12-04T09:32:07.9528807Z profiler/test_memory_profiler 1/1 2025-12-04T09:32:07.9529200Z test_serialization 1/1 2025-12-04T09:32:07.9529651Z test_indexing 1/1 2025-12-04T09:32:07.9530015Z torch_np/numpy_tests/fft/test_pocketfft 1/1 2025-12-04T09:32:07.9530432Z functorch/test_ops 2/4 2025-12-04T09:32:07.9530805Z torch_np/numpy_tests/core/test_dlpack 1/1 2025-12-04T09:32:07.9531242Z test_multiprocessing_spawn 1/1 2025-12-04T09:32:07.9531611Z test_mkldnn 1/1 2025-12-04T09:32:07.9531969Z torch_np/numpy_tests/lib/test_index_tricks 1/1 2025-12-04T09:32:07.9532390Z test_jit_autocast 1/1 2025-12-04T09:32:07.9532732Z nn/test_pooling 1/1 2025-12-04T09:32:07.9533062Z lazy/test_reuse_ir 1/1 2025-12-04T09:32:07.9533438Z test_functional_autograd_benchmark 1/1 2025-12-04T09:32:07.9534032Z cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility 1/1 2025-12-04T09:32:07.9534626Z test_autoload_disable 1/1 2025-12-04T09:32:07.9535007Z test_cpp_extensions_aot_ninja 1/1 2025-12-04T09:32:07.9535416Z test_cpp_extensions_aot_no_ninja 1/1 2025-12-04T09:32:07.9535813Z Parallel tests (0): 2025-12-04T09:32:07.9536204Z Name: excluded (est. time: 0.0min) 2025-12-04T09:32:07.9536568Z Serial tests (0): 2025-12-04T09:32:07.9536877Z Parallel tests (0): 2025-12-04T09:32:07.9537410Z Running inductor/test_aot_inductor 3/3 ... [2025-12-04 09:32:07.949417][4965510.445606391] 2025-12-04T09:32:07.9538019Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:32:07.9539369Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor.py', '--shard-id=3', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:32:07.949671] 2025-12-04T09:42:56.5626531Z 2025-12-04T09:42:56.5630071Z inductor/test_aot_inductor 3/3 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_3.3_afaed714f7d2d072_.log 2025-12-04T09:42:56.5678759Z Running 308 items in this shard: test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_cross_compile_windows_package_format, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_explicit_set, test/inductor/test_aot_inductor.py::TestAOTInductorConfig::test_compile_standalone_package_cpp_false_raises, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__int_mm_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_addmm_multiple_dynamic_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_amp_fallback_random_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_constant_tensor_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_debug_printing_model_inputs_codegen_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_profiler_enable_kernel_profile_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_profiler_enable_kernel_profile_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_aoti_runtime_asserts_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_assert_tensor_meta_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_autotune_with_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_autotuning_args_reuse_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_bool_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_2_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_4_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_mutation_and_force_mmap_weights_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_buffer_reuse_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_codegen_int_array_var_fix_memory_leak_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_composed_dynamic_size_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_mismatched_branch_output_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_symint_input_disable_one_pass_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_unbacked_symint_closure_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_outer_code_before_after_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_cond_with_parameters_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_constant_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_constant_folding_with_update_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_conv_freezing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_convolution_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_copy_non_blocking_is_pinned_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_deconv_freezing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_dup_unbacked_sym_decl_with_refinement_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_duplicate_constant_folding_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_embedding_bag_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_cat_dtype_promotion_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_empty_graph_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_extract_constants_map_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fallback_kernel_with_symexpr_output_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_fallback_mem_leak_fix_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_index_put_with_none_index_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_large_mmaped_weights_on_disk_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_load_package_multiple_gpus_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_misc_1_max_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_mixed_device_1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_multiple_output_alias_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_narrow_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_non_contiguous_output_alias_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_non_tensor_input_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_on_gpu_device1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_output_misaligned_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_pad_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_quantized_linear_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_replicate_on_devices_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_rocm_triton_autotuning_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_run_with_grad_enabled_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_device_type_failed_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_dtype_failed_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_large_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_runtime_checks_shape_failed_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scaled_grouped_mm_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_scatter_reduce_fallback_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_seq_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_shifted_constraint_ranges_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_simple_embed_kernel_binary_False_max_autotune_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_size_with_unbacked_add_and_mul_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_so_without_weight_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_stride_with_unbacked_expr_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sym_expr_indexing_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_sympy_cpp_printer_min_max_minmax1_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_torchvision_transforms_functional_tensor_resize_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_dynamic_launcher_grid_infer_from_tensor_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_bool_param_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_dynamic_grid_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_dynamic_shape_with_div_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_equal_to_1_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_extern_kernel_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_1_num_dims_2_dynamic_True_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_2_num_dims_2_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_multi_output_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_on_device_tma_dynamic_True_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_reinterpret_view_mem_leak_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_old_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_weird_param_order_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_triton_kernel_with_none_inputs_and_equal_to_1_arg_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_2_use_static_size_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_update_inactive_constant_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_view_outputs_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_conv_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_mixed_device_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_mixed_device_dynamic_True_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_parameters_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_while_loop_with_unbacked_symint_closure_dynamic_False_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_no_triton_profiler_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_offset_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_with_profiler_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_grid_with_backed_symbols_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_size_buffer_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleCpu::test_zero_size_weight_cpu, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__int_mm_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_m_32_n_64_q_group_64_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_32_num_groups_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_addmm_multiple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aliased_buffer_reuse_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_constant_tensor_name_collision_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_debug_printer_cpp_kernel_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_debug_printer_sym_inputs_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_debug_printing_model_inputs_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_profiler_enable_kernel_profile_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_profiler_enable_kernel_profile_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_aoti_user_defined_triton_kernel_profiling_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_assert_tensor_meta_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_autotune_int64_user_defined_triton_kernel_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_autotuning_args_reuse_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_backward_no_op_logging_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_4_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_mutation_and_force_mmap_weights_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_buffer_reuse_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_cpu_predicate_cuda_operands_max_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_predicate_on_cpu_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_with_outer_code_before_after_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_cond_with_parameters_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_conv3d_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_copy_non_blocking_is_pinned_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_deconv_freezing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_duplicate_constant_folding_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_dynamic_cat_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_extract_constants_map_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fake_tensor_device_validation_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_foreach_multiple_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_fp8_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_index_put_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_inf_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_input_codegen_with_sympy_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_libtorch_free_so_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_load_package_multiple_gpus_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_masked_select_dynamic_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_misaligned_input_1_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_misaligned_input_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_misc_1_max_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_multi_device_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_multiple_output_alias_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_nested_tensor_from_jagged_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_non_tensor_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_none_args_aot_codegen_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_normal_functional_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_output_path_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_pad_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_pad_non_zero_memory_leak_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_proxy_executor_squeeze_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_quantized_linear_bias_none_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_repeated_calling_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_repeated_user_defined_triton_kernel_embed_kernel_binary_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_replace_unbacked_symbol_with_backed_expr_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_return_view_constant_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_reuse_kernel_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_rocm_triton_autotuning_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_run_with_grad_enabled_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_runtime_checks_device_type_failed_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_scatter_fallback_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_seq_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_False_max_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_False_max_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_embed_kernel_binary_True_max_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_simple_split_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_size_with_unbacked_add_expr_transitive_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_sym_expr_indexing_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_symbool_item_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_torchvision_transforms_functional_tensor_resize_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_dynamic_shape_with_div_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_equal_to_1_float_arg_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_2_num_dims_1_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_multi_output_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_on_device_tma_dynamic_False_tma_version_new_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_reinterpret_view_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_sympy_fn_like_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_unbacked_symint_in_grid_dynamic_False_autotuning_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_with_none_input_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_kernel_with_none_inputs_and_equal_to_1_arg_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_triton_next_power_of_2_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_equals_input_size_runtime_assertion_mark_unbacked_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_1_use_static_size_False_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_unbacked_expr_replacements_shift_k_1_use_static_size_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_update_inactive_constant_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_upper_bound_i64_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_view_outputs_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_weight_on_disk_legacy_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_mixed_device_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_outer_buffers_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_parameters_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_while_loop_with_sym_expr_cond_dynamic_True_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_cudagraphs_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_with_profiler_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleGpu::test_zero_size_buffer_cuda, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__int_mm_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_m_32_n_64_q_group_32_num_groups_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test__weight_int4pack_mm_with_scales_and_zeros_m_32_n_64_q_group_64_num_groups_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_addmm_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_constant_tensor_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_debug_printing_model_inputs_codegen_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_runtime_asserts_backed_symint_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_aoti_runtime_asserts_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_assert_async_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_assert_tensor_meta_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_backward_no_op_logging_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_1_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_buffer_mutation_3_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_clamp_decomposition_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_non_tensor_predicates_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_predicate_on_cpu_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_share_predicate_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_unbacked_symint_closure_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_unbacked_symint_closure_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_use_buffers_from_outer_scope_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_parameters_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_cond_with_reinterpret_view_inputs_outputs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_copy_non_blocking_is_pinned_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_d2h_copy_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_deconv_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dup_unbacked_sym_decl_with_refinement_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_dynamic_cat_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fft_c2c_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fp8_view_of_param_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_fx_gm_return_tuple_validation_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_index_put_fallback_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_index_put_with_none_index_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_input_codegen_with_sympy_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_dynamic_dim_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_mmaped_weights_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_large_mmaped_weights_on_disk_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_libtorch_free_so_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_linear_freezing_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_misc_1_max_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_multi_device_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_no_args_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_normal_functional_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_output_path_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_poi_multiple_dynamic_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_proxy_executor_abs_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_proxy_executor_squeeze_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_quanatized_int8_linear_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeat_interleave_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeat_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_repeated_calling_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_rocm_triton_autotuning_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_device_type_failed_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_runtime_checks_shape_failed_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sdpa_2_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_shifted_constraint_ranges_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_simple_embed_kernel_binary_False_max_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_simple_embed_kernel_binary_True_max_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_from_multi_output_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_size_with_unbacked_add_and_mul_expr_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_stft_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_subclasses_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_symbool_item_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_symfloat_item_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_symint_item_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_sympy_cpp_printer_min_max_minmax0_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_torchvision_transforms_functional_tensor_resize_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_bool_param_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_dynamic_shape_with_div_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_equal_to_1_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_1_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_1_num_dims_1_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_False_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_1_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_False_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_grid_type_3_num_dims_2_dynamic_True_autotune_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_on_device_tma_dynamic_True_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_sympy_expr_arg_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_False_tma_version_old_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_1d_dynamic_True_tma_version_old_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_2d_dynamic_False_tma_version_old_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_tma_descriptor_2d_dynamic_True_tma_version_new_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_unbacked_symint_in_grid_dynamic_True_autotuning_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_triton_kernel_weird_param_order_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_0_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_1_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_unbacked_expr_replacements_shift_k_3_use_static_size_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_update_constant_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_sym_expr_cond_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_unbacked_symint_closure_dynamic_False_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_while_loop_with_unbacked_symint_closure_dynamic_True_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_size_buffer_mps, test/inductor/test_aot_inductor.py::AOTInductorTestABICompatibleMps::test_zero_size_weight_mps 2025-12-04T09:42:56.5725199Z 2025-12-04T09:42:56.5725324Z Finished inductor/test_aot_inductor 3/3 ... [2025-12-04 09:42:56.563228][4966159.059413338], took 10.81min 2025-12-04T09:42:56.5725721Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T09:42:58.8284957Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:42:58.8285367Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T09:42:58.8285653Z Uploading artifacts took 0.00 seconds 2025-12-04T09:42:58.8289843Z Running inductor/test_torchinductor_codegen_dynamic_shapes 2/4 ... [2025-12-04 09:42:58.828553][4966161.32473728] 2025-12-04T09:42:58.8290212Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:42:58.8291622Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_codegen_dynamic_shapes.py', '--shard-id=2', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:42:58.828849] 2025-12-04T09:48:22.3112237Z 2025-12-04T09:48:22.3113137Z inductor/test_torchinductor_codegen_dynamic_shapes 2/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_codegen_dynamic_shapes_2.4_58ba3c483cf885dd_.log 2025-12-04T09:48:22.3210427Z Running 453 items in this shard: test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test__dyn_quant_matmul_4bit_fp32_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test__dyn_quant_pack_4bit_weight_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test__dyn_quant_pack_4bit_weight_fp32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test__unsafe_masked_index_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test__unsafe_masked_index_put_accumulate_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adaptive_avg_pool2d1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex10_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_complex_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_add_inplace_permuted_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_adding_tensor_offsets_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_alexnet_prefix_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_allow_reuse_disable_if_exceed_peak_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_angle_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_any_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_aoti_eager_cache_hit_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_aoti_eager_with_persistent_cache_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_arange1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_argmax_argmin1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_as_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_avg_pool2d_backward4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bitwise2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bitwise3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bmm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int32_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_int64_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_bucketize_int_uint8_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_buffer_batch_norm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_buffer_use_after_remove_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_builtins_round_float_ndigits_neg_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_extern_kernel_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cat_unbacked_legacy_empty_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cauchy_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_check_stack_no_cycles_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_clamp_type_promotion_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_complex_from_real_imag_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_concat_add_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_consecutive_split_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_1d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_constant_pad_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv1d_depthwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv1d_with_permute_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv3d_channels_last_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv_inference_heuristics_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_conv_with_as_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_convolution1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_convolution2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_convolution3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_copy_non_blocking_is_pinned_use_cat_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cpu_scalar_with_cpu_scalar_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_cumprod_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_op_1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_custom_scan_op_compiled_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_deterministic_codegen_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_deterministic_codegen_on_graph_break_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_div_presicion_accuracy_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dont_constant_fold_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dropout_trivial_0_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtype_sympy_expr_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_bfloat16_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_bfloat16_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_bfloat16_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float16_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float16_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_float64_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_fusion_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int16_bfloat16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int32_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int64_float16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int64_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int64_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_int16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_int8_int32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_uint8_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_uint8_int8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_dtypeview_uint8_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_elu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_empty2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_erfinv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_expanded_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fallback_mutable_op_with_return_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fill1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_floordiv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fmod_zero_dim_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_forced_buffer_realize_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_fractional_max_pool2d5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_generated_code_has_size_stride_assert_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_getitem_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_glu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_arange1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_arange2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_both_scalars_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_graph_partition_misaligned_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_hardsigmoid_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_hardswish_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_float_zero_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_floordiv_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_propagation_remainder_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_fallback1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_fallback2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_index_put_reinplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inductor_triton_bucketize_respects_masking_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inplace_activations_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inplace_add_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inplace_mixed_dtype_ops_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inplace_resize_as_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_inplace_where_pointwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_input_mutation1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_int_input_dynamic_shapes_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_issue102546_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_broadcast_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_grid_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_large_offset_pointwise_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_layer_norm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_leaky_relu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_linear_float64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_list_clearing_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lite_mode_fallback_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lite_mode_not_decompose_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lite_regional_compile_flex_attention_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_lite_regional_compile_invoke_subgraph_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_log1p_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_log2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_log_fp64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_logcumsumexp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mark_dynamic_with_hint_override_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d6_dilation_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_max_pool2d_with_indices_backward5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_min_max_reduction_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_min_max_reduction_nan_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mixed_mm2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mixed_mm3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mm_mixed_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_multilayer_sum_low_prec_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_mutable_custom_op_fixed_layout2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_nan_sort_stable_True_descending_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_neg_max_uint8_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_new_empty_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_nll_loss_forward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_one_hot_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pad_cast_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pattern_matcher_unbacked_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_bessel_j0_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erf_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_erfc_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_exp2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_expm1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_gammaincc_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_modified_bessel_i0_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_multigammaln_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_ndtr_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_polygamma_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_scaled_modified_bessel_k1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pointwise_xlogy_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_pow_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_prepare_softmax_with_fast_math_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_rand_like_deterministic_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reduction1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reduction2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_reduction_config_limit_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remainder_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_slice_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_remove_noop_view_default_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_as_strided_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_repeat_interleave_decomposition_has_clamp_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_resize_as_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_round_correctness_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_rsqrt_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scalar_input_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scaled_dot_product_efficient_attention_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter2_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter6_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scatter_bf16_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_scheduler_vertical_fusion1_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_setitem_with_int_parameter_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sgn_extremal_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_should_pad_bench_for_bmm_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sigmoid_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sign_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sin_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_slice_scatter_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_softmax_backward_data_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_softmax_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_softmax_one_kernel_persist_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sort_stable_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_cumsum_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_split_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_squeeze_varargs_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum3_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum4_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_sum_int_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_device_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_to_dtype_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_torch_device_split_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_triu_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_uint_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unbacked_float_item_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unbacked_floordiv_simplify_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unbacked_floordiv_simplify_errors_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unbind_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_float32_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unspec_inputs_int64_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_unsqueeze_inplace_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_upsample_nearest2d_backward_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_var_correction_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_var_mean_tile_reduction_True_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_views5_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_views7_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_weight_norm_conv2d_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_where_broadcast_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenCpuTests::test_xblock_divides_xnumel_dynamic_shapes_cpu, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_AllenaiLongformerBase_repro_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test__dyn_quant_matmul_4bit_fp32_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test__dyn_quant_pack_4bit_weight_fp32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test__unsafe_masked_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool_errors_with_long_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_avg_pool_with_output_size_0_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_max_pool2d1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_max_pool2d3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_adaptive_pool_errors_with_long_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_complex_strided_fallback_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_add_inplace_permuted_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_alexnet_prefix_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_any_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_arange2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_arange6_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_argmax_argmin3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool2d2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_avg_pool3d_backward3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_default_kwargs_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int32_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int64_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_int_int8_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_bucketize_nd_tiling_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_buffer_batch_norm_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_float_ndigits_pos_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_builtins_round_float_ndigits_zero_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_negative_dim_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_of_loops_and_extern_kernel_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cat_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cauchy_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_chunk_recompiles_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_clamp_type_promotion_non_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_computed_buffer_inlining_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_config_option_dont_assume_alignment_recompiles_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_constant_pad_1d_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_constant_pad_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_constant_pad_nd_inplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv_bn_fuse_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv_inference_heuristics_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_conv_with_as_strided_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_copy_non_blocking_is_pinned_use_cat_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_copy_non_blocking_is_pinned_use_cat_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cpu_scalar_with_cpu_scalar_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cummin_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_cumprod_zero_dim_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_op_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_scan_op_compiled_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_scan_op_multi_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_custom_scan_would_split_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_deterministic_codegen_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dist_bf16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_div3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtype_mismatch_issue_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_bfloat16_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_bfloat16_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_bfloat16_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float16_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float16_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float16_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float32_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_float64_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int16_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int32_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int32_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int32_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int32_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int64_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int64_int8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int8_int64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_int8_uint8_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_uint8_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_uint8_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_uint8_int16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_dtypeview_uint8_int32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_embedding_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_empty2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_empty_strided_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_emulate_precision_triton_fp_fusion_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_exp2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fallback_mutable_op_no_mutated_tensors_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fallback_mutable_op_with_return_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fft_real_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_float_index_expression_type_promotion_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_floordiv_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_fuse_large_params_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_generated_code_has_alignment_assert_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_getitem_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_misaligned_input_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_graph_partition_refcount_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_hardsigmoid_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_horizonal_fusion2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_device_assert_masked_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_flip_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_propagation_floordiv_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_fallback1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_put_fallback2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_index_select_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_inplace_mixed_dtype_ops_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_int_input_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_invalid_operand_issue1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_isinf2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_issue102546_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_kernel_names_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_large_block_sizes_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_large_grid_use_block_ptr_False_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_large_pointwise_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_large_tensor_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_lerp_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_like_rands_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_linear1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_linear2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_linear_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_list_clearing_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_lite_regional_compile_flex_attention_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_lite_regional_compile_invoke_subgraph_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_log2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_low_memory_max_pool_dilation_1_dim_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_low_memory_max_pool_dilation_1_dim_3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_low_memory_max_pool_dilation_2_dim_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_min_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d4_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d6_dilation_1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d6_dilation_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_max_pool2d_with_indices_backward6_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mean_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_mul_index_expr_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multi_gpu_device_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_multilayer_var_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_nan_assert_inside_triton_kernel_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_nan_sort_stable_False_descending_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_new_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_nll_loss_forward_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pixel_shuffle_channels_last_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_bessel_j1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_chebyshev_polynomial_t_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_chebyshev_polynomial_w_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_expit_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_gammaincc_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_i0_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_log1p_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_modified_bessel_k0_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_psi_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_round_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pointwise_shifted_chebyshev_polynomial_v_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pow3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pow_int_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_pow_symfloat_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_randn_generator_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_randn_like_empty_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reduction1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_reflection_pad2d_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_relu_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_remove_noop_view_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_repeat_interleave_Tensor_decomp_int32_nd_2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_require_stride_expanded_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_resize_as_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_roll_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_rsqrt_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scalar_output_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scatter_reduce1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_scheduler_vertical_fusion1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sdpa_unaligned_mask_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_searchsorted_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_select_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sin_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_size_asserts_for_multi_output_fallback_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sizehint_issue1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_mutation3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_slice_scatter_reinplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_softmax_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_softmax_one_kernel_persist_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_cumprod_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_cumsum_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_split_cumsum_low_prec_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sqrt_dynamic_shapes_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_squeeze1_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sum3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_sum5_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tanh_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_to_dtype_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_triton_argmin_argmax_transpose_logical_index_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_triton_kernel_bool_param_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unfold_zero_dimension_tensor_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unroll_small_reduction_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_bfloat16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_float16_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_float32_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unspec_inputs_float64_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unsqueeze_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_unsqueeze_inplace_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_upsample_bilinear2d_a_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_vectorized_ops_masked_var_novec_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_view_as_complex_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_view_as_real_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_view_detach_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_views2_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_views3_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_views7_dynamic_shapes_cuda, test/inductor/test_torchinductor_codegen_dynamic_shapes.py::DynamicShapesCodegenGPUTests::test_zeros_dynamic_shapes_cuda 2025-12-04T09:48:22.3294062Z 2025-12-04T09:48:22.3294238Z Finished inductor/test_torchinductor_codegen_dynamic_shapes 2/4 ... [2025-12-04 09:48:22.312107][4966484.808290685], took 5.39min 2025-12-04T09:48:22.3294682Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T09:48:22.3295044Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:48:22.3297211Z Running inductor/test_torchinductor_opinfo 3/12 ... [2025-12-04 09:48:22.329470][4966484.82565614] 2025-12-04T09:48:22.3297424Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:48:22.3303293Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=3', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:48:22.329956] 2025-12-04T09:58:05.0387609Z 2025-12-04T09:58:05.0390018Z inductor/test_torchinductor_opinfo 3/12 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_3.12_3077f1dca5997b5b_.log 2025-12-04T09:58:05.0537936Z Running 278 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_T_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___radd___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rand___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rxor___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__segment_reduce_lengths_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__softmax_backward_data_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acos_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acosh_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_add_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addcmul_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addmm_decomposed_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_allclose_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_allclose_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_aminmax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_angle_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_angle_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_arange_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argwhere_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_partial_views_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_scatter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_scatter_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_asinh_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atan_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atanh_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_atleast_3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bernoulli_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bincount_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bitwise_right_shift_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bool_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bool_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_tensors_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cartesian_prod_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cat_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cdouble_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cholesky_solve_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_physical_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_constant_pad_nd_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_constant_pad_nd_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_contiguous_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_contiguous_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cos_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cosh_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumprod_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cumsum_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diff_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_digamma_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_floor_rounding_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_no_rounding_mode_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_double_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dsplit_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dstack_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_permuted_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_strided_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_eq_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfinv_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_exp_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftshift_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft2_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft2_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flatten_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gcd_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gradient_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_grid_sampler_3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_histc_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hsplit_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_hstack_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_i0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amax_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amax_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_prod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_select_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_int_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isfinite_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isnan_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lerp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cond_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_eigh_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_inv_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_matrix_norm_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_matrix_norm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_norm_subgradients_at_zero_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_pinv_hermitian_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_slogdet_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log10_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log1p_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log2_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logaddexp2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logdet_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logdet_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_and_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_not_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_or_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logit_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logit_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logspace_tensor_overload_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lu_solve_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amax_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_amax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_cumsum_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_fill_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_fill_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logaddexp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logsumexp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logsumexp_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_logsumexp_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_median_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_normalize_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_scatter_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_select_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_select_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_softmin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_std_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_sum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_sum_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_pool2d_with_indices_backward_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_max_reduction_with_dim_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_maximum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_minimum_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nan_to_num_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nanmean_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nanquantile_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_copy_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ne_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_max_pool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_avg_pool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_batch_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_fractional_max_pool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_grid_sample_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_grid_sample_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardswish_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_bilinear_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_l1_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_local_response_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_logsigmoid_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_pool3d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool1d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multi_margin_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_selu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_smooth_l1_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_soft_margin_loss_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_threshold_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_nuc_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_normal_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_normal_number_mean_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_permute_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polar_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_put_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randint_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randint_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_repeat_interleave_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resize__cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_decimals_0_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_decimals_3_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_round_decimals_neg_3_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_add_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_amax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_amin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_prod_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_exponential_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signbit_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sin_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sort_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_airy_ai_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j1_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_chebyshev_polynomial_w_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_entr_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i0_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i0_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_i1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_k0_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_k1_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_std_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_std_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tan_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensor_split_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trace_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_trapezoid_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_triu_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_consecutive_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_consecutive_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_consecutive_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_split_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_var_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_var_mean_unbiased_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_var_unbiased_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vstack_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_xlogy_cuda_uint8 2025-12-04T09:58:05.0682425Z 2025-12-04T09:58:05.0682883Z Finished inductor/test_torchinductor_opinfo 3/12 ... [2025-12-04 09:58:05.038972][4967067.535156612], took 9.71min 2025-12-04T09:58:05.0684229Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T09:58:05.0694714Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T09:58:05.0734079Z Running inductor/test_torchinductor_opinfo 9/12 ... [2025-12-04 09:58:05.072966][4967067.569099477] 2025-12-04T09:58:05.0734747Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T09:58:05.0753326Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor_opinfo.py', '--shard-id=9', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 09:58:05.074944] 2025-12-04T10:06:24.5914210Z 2025-12-04T10:06:24.5915206Z inductor/test_torchinductor_opinfo 9/12 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_opinfo_9.12_3a8f84ec623d8187_.log 2025-12-04T10:06:24.6007717Z Running 275 items in this shard: test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___getitem___cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rdiv___cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___ror___cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive___rsub___cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_acos_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_add_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_add_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addcdiv_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_addr_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_amin_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argmin_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argwhere_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_argwhere_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_partial_views_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_as_strided_scatter_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bfloat16_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_block_diag_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bmm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_bool_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_tensors_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_broadcast_to_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_byte_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ceil_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_chunk_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_max_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_max_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clamp_min_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_clone_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_column_stack_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_combinations_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_conj_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_contiguous_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_copysign_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cross_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_cummin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_deg2rad_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diag_embed_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_diagonal_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_digamma_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_digamma_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_div_trunc_rounding_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_dsplit_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_like_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_like_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_empty_strided_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_erfc_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expand_as_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_expm1_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_exponential_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft2_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_fft_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfft_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfftn_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_hfftn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifft_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_ifftn_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfft_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_irfftn_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fft_rfft2_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fill_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flip_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fliplr_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_flipud_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_float_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_floor_divide_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_fmax_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_frexp_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_gather_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geometric_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_geometric_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_half_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_half_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_heaviside_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_igamma_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_fill_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amax_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_amin_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_index_reduce_mean_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isneginf_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_isreal_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_jiterator_unary_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_kron_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ldexp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_le_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_lgamma_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cholesky_ex_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cholesky_ex_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cross_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_cross_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_det_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_eigvals_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_ldl_factor_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_ldl_factor_ex_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_lu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_matrix_rank_hermitian_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_multi_dot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_solve_ex_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_svdvals_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_linalg_vector_norm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_with_dtype_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_log_softmax_with_dtype_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logaddexp_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_and_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_or_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_or_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logical_xor_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_logsumexp_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mH_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mT_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_argmax_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_argmin_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_mean_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_normalize_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_masked_prod_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_matrix_exp_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_meshgrid_variadic_tensors_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_binary_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_binary_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_no_dim_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_min_reduction_no_dim_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mode_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_movedim_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_msort_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mul_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_mul_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_multinomial_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nansum_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_narrow_copy_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_native_batch_norm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ne_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_empty_strided_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_zeros_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_new_zeros_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nextafter_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nextafter_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_adaptive_avg_pool2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_conv3d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_dropout2d_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_dropout3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_elu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_fractional_max_pool2d_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_fractional_max_pool3d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_gaussian_nll_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_glu_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardsigmoid_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_hardtanh_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_logsigmoid_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_max_unpool2d_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_multilabel_margin_loss_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_pad_replicate_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu6_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_relu6_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_smooth_l1_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_softmin_with_dtype_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_nonzero_static_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_fro_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_norm_inf_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_ones_like_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_permute_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_prod_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_quantile_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randint_like_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randn_like_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_randn_like_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reciprocal_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_renorm_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_as_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_as_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_reshape_as_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_conj_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_conj_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_resolve_conj_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_roll_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rsub_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_rsub_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_mean_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_scatter_reduce_sum_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_searchsorted_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_searchsorted_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sgn_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_short_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_short_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sign_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_cosine_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_hann_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_signal_windows_kaiser_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinc_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sinh_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_softmax_with_dtype_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sort_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sort_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sparse_mm_reduce_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sparse_sampled_addmm_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j0_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_j1_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_bessel_y0_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i0e_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1e_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1e_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1e_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_i1e_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_log_ndtr_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_modified_bessel_k1_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_ndtri_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_xlog1py_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_special_zeta_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_square_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_squeeze_multiple_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_stack_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_std_mean_unbiased_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_stft_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sub_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_sum_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_svd_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_svd_cuda_float64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_t_copy_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_take_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tensor_split_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tile_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_to_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_topk_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_transpose_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_tril_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_triu_cuda_int64, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_triu_indices_cuda_int32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_copy_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unfold_copy_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unique_cuda_float32, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unravel_index_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsafe_chunk_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_unsqueeze_cuda_uint8, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_vdot_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_bool, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_view_as_cuda_float16, test/inductor/test_torchinductor_opinfo.py::TestInductorOpInfoCUDA::test_comprehensive_zeros_cuda_float64 2025-12-04T10:06:24.6097022Z 2025-12-04T10:06:24.6097303Z Finished inductor/test_torchinductor_opinfo 9/12 ... [2025-12-04 10:06:24.591881][4967567.088066172], took 8.33min 2025-12-04T10:06:24.6098214Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:06:24.6098944Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:06:24.6099398Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:06:24.6099766Z Uploading artifacts took 0.00 seconds 2025-12-04T10:06:24.6100135Z Running dynamo/test_dynamic_shapes 1/2 ... [2025-12-04 10:06:24.608989][4967567.105175889] 2025-12-04T10:06:24.6100512Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:06:24.6101314Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_dynamic_shapes.py', '--shard-id=1', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:06:24.609453] 2025-12-04T10:14:11.7281290Z 2025-12-04T10:14:11.7282234Z dynamo/test_dynamic_shapes 1/2 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_dynamic_shapes_1.2_3052625fa6a61a5f_.log 2025-12-04T10:14:11.7443269Z Running 1003 items in this shard: test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_311_resume_block_keyerror2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_arguments_binding_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_cpu_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_cpu_graph_break_2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_cpu_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_cpu_graph_break_inner_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_decorator_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autocast_device_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autograd_profiler_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_autograd_profiler_enabled_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_context_wrapping_grad_mode_nested_function_decorator_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_amp_autocast_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_device_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_event_across_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_event_created_outside_of_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_event_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_stream_compared_with_stream_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_stream_context_manager1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_cuda_stream_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_disable_saved_tensors_hooks_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_disable_saved_tensors_hooks_prev_disabled_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_generic_context_manager_customized_ctx_manager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_generic_context_manager_with_graph_break_CustomizedCtxManager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_generic_ctx_manager_with_graph_break_customized_ctx_manager_with_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_graph_break_inlining_autocast_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_graph_break_inlining_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_inactive_context_graph_break_local_nullctx_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_inactive_context_graph_break_stack2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_nested_generic_context_manager_with_graph_break_CustomizedCtxManager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_nested_grad_mode_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_no_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_return_context_manager_with_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_sdpa_kernel_ctx_manager1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_sdpa_kernel_ctx_manager2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_sdpa_kernel_ctx_manager3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_sdpa_kernel_ctx_manager_as_decorator_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesCtxManagerTests::test_sdpa_kernel_ctx_manager_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_T_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_add_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_addcmul__dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_are_functorch_transforms_active_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_attrgetter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_broadcast_foreach_pow_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_call_dict2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_call_dict3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_callable_class_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_callable_lambda_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_callable_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_class_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_cls_eq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_cls_hasattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_cls_is_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_complex_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_constant1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_constant2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_constant3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_constant4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_constant_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_context_wrapping_nested_functions_no_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_cublas_allow_tf32_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_custom_dict_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_default_dict_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_default_dict_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_default_dict_tuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_defaultdict_setdefault2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_defaultdict_setdefault3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_del_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_device_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_fromkeys_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_hasattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_key_set1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_key_set2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_key_set3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_mutable_map_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_ops_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_param_keys_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_setdefault1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_setdefault2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_setdefault3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dict_values_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_distributed_is_available_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_distributed_is_initialized_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_dtype_compare_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_enumerate_custom_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_filter_fallback_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_filter_graph_break_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_filter_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_filter_with_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_foreach_lerp__dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_fstrings1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_fstrings2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_functools_cache_guard_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_generic_namedtuple_subclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_generic_namedtuple_user_methods_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_get_autocast_gpu_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_get_default_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_get_device_properties_tensor_device_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_globalfn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_import1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_indexed_range_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_indirect3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_inline_jit__unwrap_optional_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_inline_script_if_tracing_fn_with_default_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_inline_softmax_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_any_autocast_enabled_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_checkpoint_valid_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_complex_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_contiguous_memory_format_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_floating_point_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_fx_tracing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_inference_recompilation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_is_not_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_isinstance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_islice_chain_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_chain_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_chain_from_iterable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_compress_tensors_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_filterfalse_basic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_permutations_basic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_permutations_various_iterators_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_itertools_product_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_len_constant_misc_iterables_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_clear_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_convert_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_expand_lhs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_reversed_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_setitem_slice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_list_slice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_listarg2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_listarg4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_load_global_bool_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_lru_cache_warning_issued_during_tracing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_mT_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_call_function_ex_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_dict_fromkeys_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_list_slice_assign_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_max_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_reduce_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_sorted_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_str_join_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_tuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_map_unpack_twice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_math_fma_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_math_radians_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_mean_sum_np_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_methodcall1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_methodcall3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_methodcaller_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_min_max_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_namedtuple_defaults_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_namedtuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_namedtuple_hasattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_namedtuple_subclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_namedtuple_user_methods_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_ndarray_builtin_functions_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_ndarray_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_ndarray_transpose_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_ndim_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_no_recompile_inner_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_no_recompile_inner_lambda_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_not_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_np_constant_collections_as_input_int_or_float_float_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_np_constant_collections_guards_float_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_np_constant_collections_guards_int_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_np_iinfo_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_number_method_method_as_integer_ratio_num_type3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_number_method_method_conjugate_num_type2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_number_method_method_conjugate_num_type4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_number_method_method_is_integer_num_type6_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_attributes_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_dtype_argument_to_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_dtype_call_in_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_fft_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_linalg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_numpy_meshgrid_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_obj_eq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partial_across_graph_break_uninvoked_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___annotations___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___class___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___code___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___dir___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___eq___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___ge___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___get___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___getattribute___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___globals___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___gt___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___init___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___init_subclass___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___kwdefaults___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___lt___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___module___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___qualname___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___reduce___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___reduce_ex___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr___sizeof___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_hasattr_attr_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_lambda_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_recompilation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_torch_op_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_torch_op_kwarg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_udf_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_udf_kwarg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_partials_udf_kwarg_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_pop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_pos_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_pos_only_args_with_same_name_in_star_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_pow_int_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_promote_types_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_rand_inlined_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_range1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_range_length_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_range_with_slice_index_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_reduce_with_initial_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_reduce_with_none_initial_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_reduce_with_single_with_initial_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_return_dict2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_return_multiple_numpy_ndarray_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_return_tuple2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_returning_recursive_func_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_set_add_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_shape2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_slice3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_slice4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_slice5_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_sorted_const_key_non_const_items_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_sourceless_build_method_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_sum_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_sum_shortcut_with_start_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_sum_with_start_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_dim_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_len_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_new_with_size_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_size_indexed_by_symint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_type4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tensor_type5_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_torch_from_numpy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_torch_get_device_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_truth_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tuple2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tuple_contains_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_tuple_sorted_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unary_fold_op_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unpack1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unpack2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unpack3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unpack_ex1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_unpack_ex3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFunctionTests::test_viamethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_312_binary_slice_with_graph_break1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_RAISE_VARARGS_0_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_T_tensor_attribute_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_add_to_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_anomaly_aot_autograd_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_aot_autograd_propagate_unbacked_symints_shape_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_arange_length_with_float32_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_argwhere_with_dynamic_shapes_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_assert_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_assert_size_stride_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_backend_match_guard_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_backend_match_guard_multi_threads_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_boolarg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_builder_for_class_with_metaclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_builtin_abs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_builtin_bool_on_symint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_builtin_complex_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_builtin_subclasses_as_method_on_var_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_call_parent_non_class_methods_from_child_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_callpacked_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cat_unbacked_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_catch_watchings2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cell_output2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_assert_error_at_runtime_when_predicate_true_and_message_has_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_compiles_when_predicate_true_and_message_None_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_compiles_when_predicate_true_and_message_has_global_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_compiles_when_predicate_true_and_message_has_no_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_compiles_when_predicate_true_constant_and_message_None_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_global_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_raises_at_runtime_when_predicate_false_and_message_has_no_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_raises_at_runtime_when_predicate_false_constant_and_message_None_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_check_simplification_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_class_binop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_class_duner_flags_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_class_has_instancecheck_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_clone_sparse_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_closure_out_of_scope_cell_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_closure_out_of_scope_cell_with_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_closure_with_mutation_and_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_closure_write_across_functions_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_compare_shapes_eq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_compare_shapes_neq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_compare_shapes_tuple_eq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_compare_tensor_with_none_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_compilation_metrics_size_limit_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cond_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cond_export_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cond_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_conditional_list_comp_in_context_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_config_getattr_default_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_constant_hasattr_returns_bool_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cross_entropy_loss_fancy_ctor2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_cross_entropy_loss_simple_ctor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_data_access_in_inference_mode_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_data_ptr_graph_break_aten_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_data_ptr_graph_break_builtin_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_default_dtype_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_deque_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_deterministic_algorithms_mutated_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dictcomp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dunder_methods_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dunder_new_function_inlining2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dunder_new_function_inlining3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dunder_weakref_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_duplicate_graph_break_log_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_one_hot_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_shapes_as_strided_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_dynamic_override_regex_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamic_sources_int_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamo_cache_invalidate_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamo_cache_move_to_front_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamo_compiling_fake_tensor_to_vararg_int_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_dynamo_disabled_in_custom_op_kernels_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_empty_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_enum_as_dict_key_with_overloaded_str_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_enum_guards_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_enum_no_graphbreaks_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_enum_subclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_error_on_nested_fx_trace_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_error_on_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_escaping_closure_var_with_backward_hook_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_escaping_closure_var_with_nonlocal_var_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_fail_on_recompile_error_message_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_flat_name_to_original_fqn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_fn_hasattr__name__1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_fn_hasattr__name__2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_fn_hasattr__name__3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_free_var_and_local_name_collision_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_frozen_dataclass_default_factory_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_frozen_dataclass_default_value_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_frozen_dataclass_hashable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_frozen_dataclass_kw_only_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_fullgraph_capture_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_funcname_cache_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_generate_trivial_abstract_impl_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_get_cache_entry_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_getset_descriptor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_global_state_guard_serialization_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_grad_none_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_graph_break_compilation_metrics_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_graph_break_compilation_metrics_on_failure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_graph_break_correctly_when_passing_numpy_ndarray_to_torch_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_failure_fn2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_failure_fn_shape_control_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_filter_fn_by_id_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_filter_fn_by_name_and_value_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_filter_globals_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_size_oblivious_backed_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guard_string_escaped_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guards_cse_pass_multiple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_guards_strip_function_call_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_hash_hop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_id_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_if_cond_nn_mod1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_if_cond_nn_mod3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_infer_unbacked_size_gt_zero_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inference_mode_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inference_mode_param_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_closure_not_loaded_by_parent_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_closure_returned_by_another_function_and_captures_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_dict_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_dict_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_func_jump_on_tensor_condition_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_list_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inline_local_dict_clear_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inplace_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inplace_view_on_graph_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_inspect_signature_bind_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_int_shape_comparisons_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_int_shape_inplace_binops_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_intermediary_tensor_grad_access_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_invalid_args_builtin_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_is_floating_point_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_is_tensor_like2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_is_tensor_like_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_item_changes_new_shape_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_iter_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_iter_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_iterator_limit_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_accumulate_symint_default_sum_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_accumulate_tensors_default_sum_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_accumulate_tensors_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_groupby_pure_python_default_identify_func_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_infinite_count_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_infinite_repeat_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_islice_default_step_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_islice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_repeat_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_itertools_tee_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_large_reduction_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_linear_module_free_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_list_append_return_none_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_list_iadd_side_effect_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_list_iadd_with_shape_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_list_slice_mul_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_listcomp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_load_fast_and_clear_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_map_with_quantization_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_mark_dynamic_with_ranges_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_module_complex_iter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_module_deepcopy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_module_not_callable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_multiple_inheritance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_mutable_mapping_multiple_inheritance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_namedtuple2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_namedtuple_class_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_namedtuple_source_dynamic_attributes_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_namedtuple_sourceless_dynamic_attributes_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_namedtuple_with_custom_getitem_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_ne_operator_with_custom_eq_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_ne_operator_with_custom_ne_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_closure_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_dataclass_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_frozen_dataclass_hashable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_function_resuming_with_correct_globals_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_optimize_decorator_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_optimize_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_optimize_run_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_sequential_try_with_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nested_wraps_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nn_functional_reduction_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nn_module_getattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_nn_sequential_invocation_reposition_indices_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_no_guard_for_unused_sym_node_fstring_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_no_raise_guard_partial_constraint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_as_global_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_fallback_on_eager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_force_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_gt_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_iter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_ndarray_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_non_torch_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_random_config_to_numpy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_readonly_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_recompilation_scalar_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_take_along_axis_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_tolist_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_torch_operators_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_variable_isinstance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_numpy_with_builtin_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_object_classmethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_object_staticmethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_onnx_shape_as_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_optimize_on_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_ordered_dict_alias_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_os_environ_get_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_os_environ_set_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_out_variants_with_resizing_on_graph_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_out_variants_with_resizing_on_graph_inputs_with_dynamic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_outside_linear_module_free_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_packaging_version_parse_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_pair_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_param_shape_binops_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_parameter_free_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_pep0479_convert_stopiteration_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_precompile_entries_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_precompile_fail_on_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_pt2_compliant_overload_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_pure_python_accumulate_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_py_guards_mark_dynamic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_python_slice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_raise_guard_full_constraint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_raise_guard_partial_constraint_across_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_raise_guard_partial_constraint_no_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_raise_on_backend_error_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_raises_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_range___iter___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_range_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_range_iter_side_effects_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_range_with_shape_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_real_imag_tensor_attribute_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_recompile_message_on_parameter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_recompile_on_disable_1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_recompile_on_global_state_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_reconstruct_frozen_dataclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_recursion_depth_guards_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_recursive_inline_list_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_release_input_memory_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_remove_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_replay_side_effects_model_attr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_repro_graph_breaks_in__get_item_by_idx_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_restore_graphstate_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_returning_func_with_captured_func_and_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_running_func_with_captured_func_and_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_sample_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_scalar_device_movement_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_scalar_tensor_is_equivalent_to_symint_argument_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_scalar_tensor_is_equivalent_to_symint_list_argument_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_set_aliasing_recompiles_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_set_descriptor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_set_update_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_setattr_mutation2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_setattr_mutation3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_and_tuple_equality_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_equal_empty_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_equal_evaluate_expr_divisible_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_equal_evaluate_expr_refinement_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_equal_evaluate_expr_replacement_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_equal_runtime_assert_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_env_no_recording_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_shape_int_inplace_binops_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_side_effects_codegen_update_mutated_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_simple_set_usage_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_size_dim_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_sourceless_namedtuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_sparse_output_inductor_should_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_storage_return_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_str___iter___dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_str_format_assert2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_str_format_return2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_super_calling_with_metaclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_sym_and_terms_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_symint_as_device_kwarg_non_strict_export_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_sys_modules_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tagging_tensors_mix_used_unused_structure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_data_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_dict1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_dict2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_dict3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_dynamic_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_interacts_with_numpy_ndarray_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tensor_is_contiguous_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tolist_kd_dynamic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tolist_kd_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tolist_scalar_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_check_nonnegative_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_compile_ctx_on_forward_and_training_step_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_guards_stack_frame_register_inlining_deep_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_guards_stack_frame_register_inlining_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_nn_parameter_isinstance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_package_working_with_trace_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_seed_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_size_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_size_numel_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_torch_variable_hasattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tuple_class_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tuple_from_tuple_iter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tuple_hasattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_tuple_mul_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_type_copy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_typing_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_typing_union_and_optional_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_typing_variable_isinstance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unbacked_2d_expand_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unbacked_sources_scalar_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unbacked_sources_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unbacked_symint_split_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unhandled_exception_in_dynamo_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unpack5_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_unpack_tensor_shape_mismatch_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_code_statically_known_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_defined_binop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_defined_class_python_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_defined_setattr1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_function_variable_supports_function_argument_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_function_variable_supports_type_abcmeta_argument_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_getattr1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_getattr2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_user_getattribute_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_usr_cls_classmethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_variable_tracker_recursively_contains_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_version_ci_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_write_to_cells_with_name_shadowing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_write_to_closures_in_inlining_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_writes_to_cells_across_frames2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_yield_from_in_a_loop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_yield_from_user_stop_iteration_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesMiscTests::test_yield_send_to_subgenerator_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_Size_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_add_sub_alpha_out_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_batch_encoding_clone_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_batchnorm_e2e_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_cells_unsupported_step_exception_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_changing_stride_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_class_member_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_classmethod_with_slots_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_clone_not_memory_dense_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_compilation_metrics_on_error_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_compile_complex_conj_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_compile_copy__int_overload_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_compiled_module_truthiness_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_const_dict_keyerror_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_data_attr_mutation_after_saved_for_bw_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dataclass_init_with_default_factory_with_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_ddp_checkpoint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dedup_global_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_deleted_compile_wrapper_segfault_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dont_dce_rand_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dynamic_shapes_float_guard_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dynamic_shapes_right_side_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dynamo_default_lru_cache_behavior_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dynamo_set_recursion_limit_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_dynamo_set_recursion_limit_usage_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_ellipsis_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_embedding_backward_broadcasting_decomp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_empty_graph_nested_calls_fullgraph_False_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_empty_list_contains_with_jump_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_ephemeral_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_error_return_without_exception_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_exec_wildcard_import_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_for_loop_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_get_parameter_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_grad_mode_carrying_correct_state_after_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_graph_break_on_jit_isinstance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_graph_break_on_jit_isinstance_pep585_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_graph_break_unsupported_fake_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_guard_fail_nested_tuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_guard_fail_tensor_bool_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_guard_ordering_shape_fail_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_hf_bigbird_unsqueeze_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_hf_classinstantier_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_hf_gelu_inline_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_hf_model_output_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_hf_xsoftmax_inference_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_iadd_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_incompatible_configs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_indexing_with_list_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_inductor_rng_default_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_inlining_cornercase_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_is_make_fx_tracing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_is_symbolic_tracing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_issue111522_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_issue111918_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_issue1466_size_aot_autograd_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_issue164247_backend_eager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_issue164247_backend_inductor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_jit_trace_errors_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_kwargs_out_list_variable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_list_aliasing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_list_index_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_listcomp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_maml_item_capture_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_many_views_with_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_map_with_multiple_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_mem_leak_guards_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_merge_criteria_processor_list1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_method_overriding_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_multi_import_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_named_buffers_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_negative_floor_div_solve_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_nested_while_loop_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_nn_module_stack_bc_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_nonconst_issubclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_nullcontext1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_nullcontext2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_numpy_tobytes_no_error_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_omegaconf_listconfig_iter_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_ones_out_dynamic_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_optim_state_references_cleared_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_optimized_deepcopy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_optimized_module_patched_init_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_optimized_module_training_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_out_nested_cell_shape_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_out_nested_cell_tuple_shape_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_out_root_cell_shape_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_out_root_cell_tuple_shape_change_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_overlapping_inputs_with_dynamic_shapes_error_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_partitioner_cse_respects_mutation_boundaries_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_primtorch_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_primtorch_no_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_recursive_map_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_reformer_eval_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_reformer_train_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_relative_import_no_modulename_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_requires_grad_guards_with_grad_mode1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_return_value_duplication_mixed_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_return_value_duplication_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_rewrite_assert_noop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_rewrite_assert_with_msg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_rng_state_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_setattr_requires_grad_graph_breaks_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_setitem_boolean_mask_diff_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_setitem_tuple_boolean_mask_diff_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_sigmoid_out2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_size_typematch_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_slice_into_list_mutable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_sort_out2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_sort_out_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_split_with_sizes_aot_autograd_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_stk_sdd_is_transposed_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_stop_iteration_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_super_classmethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_super_classmethod_inheritance_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_super_diamond_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_super_in_staticmethod_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_swin_base_tensor_attr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_symnode_is_not_op_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_symnode_is_op_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_sys_monitoring_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_data_kwarg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_item_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_aot_eager_func_name_func1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_aot_eager_func_name_func2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_aot_eager_func_name_func3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_eager_func_name_func1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_eager_func_name_func2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_inductor_func_name_func1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_inductor_func_name_func2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tensor_set_data_backend_inductor_func_name_func3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_torch_ops_aten_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_torch_tensor_ops_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_torch_variable_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_torchname_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_trace_functional_tensor_with_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_tuple_enum_as_key_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_typed_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_typed_dict_total_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_udf_classes_reconstruction_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_unbind_copy_out_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_unpack_hooks_dont_run_during_tracing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_user_ctor_ctx_manager_custom_init_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_user_ctor_ctx_manager_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_user_defined_object_callable_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_validate_model_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_weakref_callback_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_weakref_construction_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_weakref_del_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_weakref_proxy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_weakref_reconstruct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_while_loop_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesReproTests::test_while_loop_graph_break_inside_call_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_access_by_keys_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_children_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_constloop_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_conv_call_super_forward_directly_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_conv_transpose_call_forward_directly_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_conv_transpose_call_super_forward_directly_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_densenet_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_enumvalues_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_fnmembercmp1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_fnmembercmp2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_generation_tag_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_iseval1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_istraining1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_lazy_module2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_lazy_module4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_lazy_module6_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_lazy_module_speculation_log_divergence_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_module_attribute_precedence_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_module_call_module_with_static_forward_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_module_name_string_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_module_static_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_moduledict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_modulelist_custom_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_modulelist_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_modulelist_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_modulemethod1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_modulemethod2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_named_children_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_nn_module_unspec_int_attr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_nn_moduledict_contains_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_parameters1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_parameters2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_self_mutating1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_sequential_with_duplicated_module2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_simple_torch_function_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_submodules1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_super1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesNNModuleTests::test_torch_function_with_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_byte_tensor_does_not_crash_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_capture_symbolic_tracing_simple_within_fake_mode_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_capture_symbolic_tracing_within_fake_mode_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_cond_raise_user_error_on_mismatch_return_length_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_cond_raise_user_error_on_missing_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_cond_raise_user_error_on_non_list_operands_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dataclass_input_output_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_2_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_and_bypass_reorder_with_non_tensor_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_and_bypass_with_non_tensor_arg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_and_bypass_with_non_tensor_arg_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_and_bypass_with_non_tensor_output_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_and_bypass_with_non_tensor_output_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dupes_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_dynamic_slicing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_empty_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_compare_optimize_with_make_fx_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_control_flow_with_getattr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_decomp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_dynamic_dim_cleanup_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_graph_bypass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_graph_bypass_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_graph_with_complex_reorder_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_masking_with_no_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_mismatched_out_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_multi_dynamic_dim_constraint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_multi_dynamic_dim_unsafe_relationship_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_no_raise_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_pass_arg_by_name_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_pass_arg_by_name_star_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_persist_assert_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_preserve_constraints_as_metadata_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_preserves_nn_module_stack_for_get_attr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_raise_guard_full_constraint_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_raise_on_relationship_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_shape_control_flow_1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_specialized_int_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_symbolic_shape_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_args_with_default_float_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_args_with_default_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_cond_branches_calling_methods_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_cond_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_dict_values_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_free_function_and_class_method_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_method_on_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_method_on_module_invoke_twice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_not_none_control_flow_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_constant_not_none_control_flow_pos_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_functools_wrapped_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_kwargs_with_default_None_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_kwargs_with_default_float_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_kwargs_with_default_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_kwargs_with_default_tuple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_map_cond_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_stack_trace_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_export_with_symbool_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_immutable_list_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_input_container_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_input_nonlocal_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_input_unused_nonlocal_ok_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_list_contains_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_list_unpack_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_multiple_outputs_op_with_evaluator_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_no_tensor_computation_2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_no_tensor_computation_2_with_aten_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_predispatch_with_for_out_dtype_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_predispatch_with_for_out_dtype_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_predispatch_with_higher_order_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_preserve_fx_node_metadata_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_preserve_fx_node_metadata_inline_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_preserve_fx_node_metadata_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_remove_redundant_dynamic_dim_in_error_message_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_retracibility_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_retracibility_nested_list_out_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_round_dynamic_shapes_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_sym_contains_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_symbolic_tracing_within_fake_mode_with_constraints_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_symbolic_tracing_within_fake_mode_with_constraints_with_parameters_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_symbool_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_torch_inference_mode_ctx_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_zeroes_in_and_out_different_shape_on_test_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_zeroes_in_new_shape_scalar_out_permute_dupe_and_bypass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesExportTests::test_zeroes_in_new_shape_scalar_out_permute_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_control_flow1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_dynamic_getitem_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_extended_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_graph_break_on_item_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_indirect_unsupported3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_multigraph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_restore_state_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_resume2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_resume3_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_resume_paths_join_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_resume_with_no_grad1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_stack_state1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_start1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_start4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesSubGraphTests::test_tuple_iterator_mutate_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_access_module_attr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_capture_constants_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_capture_global_num_adds_guard_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_capture_input_num_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_capture_tracked_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_capture_untracked_global_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_concat_unbacked_shape_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_branches_no_arguments_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_free_variable_in_both_branches_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_graph_break_in_one_branch_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_source_fn_stack_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_subgraph_name_is_valid_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_cond_with_empty_operands_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_dynamic_shapes_over_vmap_batch_size_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_fallback_on_graph_break_simple_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_hints_wrapper_incorrect_type_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_hopify_generic_wrap_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_lift_tensors_with_shared_symbols_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_make_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_lowers_to_graph_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_side_effect_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_source_fn_stack_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_map_subgraph_name_is_valid_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_modules_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_nested_wrap_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_no_freevars_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_output_with_dict_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_register_subclass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_same_freevar_twice_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_del_existing_attr_global_obj_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_del_existing_attr_nonlocal_obj_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_in_body_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_local_list_append_no_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_mutate_global_num_builtin_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_mutate_global_num_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_mutate_nonlocal_num_builtin_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_mutate_nonlocal_num_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_mutate_nonlocal_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_nested_nonlocal_list_append_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_nonlocal_list_append_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_set_new_attr_global_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_side_effect_set_new_attr_global_obj_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_support_float_in_output_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_tensor_with_unbacked_shape_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_unbacked_symbol_closure_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_vmap_multiply_scalar_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_inductor_compiled_regions_option_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_inductor_compiled_regions_with_backward_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_kwarg_default_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_kwarg_default_if_branch_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_kwarg_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_kwarg_only_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_kwarg_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_pytree_args_nested_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_pytree_kwargs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_source_fn_stack_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesHigherOrderOpTests::test_wrap_subgraph_name_is_valid_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_functional_call_disable_inline_nn_module_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_capture_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_freevar_python_scalar_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_non_tensor_input_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_pytree_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_with_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_grad_with_side_effect_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jacfwd_randomness_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jacrev_two_tensors_argnums_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jvp_call_torch_compile_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jvp_has_aux_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jvp_jvp_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jvp_two_tensors_disable_grad_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_jvp_two_tensors_has_aux_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_linearize_jvp_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vjp_call_compiled_backward_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vjp_has_aux_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vjp_multiple_outputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vjp_multiple_outputs_python_struct_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_call_compiled_backward_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_call_torch_compile_fn_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_free_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_multiple_invocation_in_dims_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_multiple_outputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_new_tensor_unused_in_body_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_over_vmap_captured_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_previous_illegal_op_no_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_pytree_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_side_effects_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_two_inputs_tuple_in_dims_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_with_conditional_graph_break_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_with_graph_break_2_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesFuncTorchHigherOrderOpTests::test_vmap_with_graph_break_lambda_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_alias_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_aot_autograd_expand_mutation_backwards_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_aot_autograd_expand_mutation_error_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_aot_autograd_raises_invalid_leaf_set_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_aot_autograd_stride_reconstruction_on_zero_dim_dynamic_shaped_tensor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_aot_sequence_nr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_arg_dupe_via_dynamo_recompiles_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_arg_dupe_via_dynamo_recompiles_many_with_global_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_autograd_function_tangent_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_data_ptr_access_copy_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_data_ptr_access_fails_in_backward_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_donated_buffer1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_donated_buffer4_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_donated_buffer_with_retain_or_create_graph1_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_double_backward_errors_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_eager_sequence_nr_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_grad_inputs_alias_inputs_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_inputs_overlapping_with_mutation_recompile_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_inputs_overlapping_with_mutation_stress_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_joint_custom_pass_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_multiple_aot_autograd_calls_dupe_args_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_negative_testing_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_nn_parameter_construction_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_no_storage_overlap_guards_no_aliasing_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_no_storage_overlap_guards_no_mutation_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_split_with_sizes_aot_autograd_cleans_up_traceback_meta_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesAotAutogradFallbackTests::test_unbacked_activation_specialized_in_inductor_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesTestSDPA::test_graph_break_SDPAParams_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesTestSDPA::test_input_SDPAParams_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesTestSDPA::test_returns_SDPAParams_dynamic_shapes, test/dynamo/test_dynamic_shapes.py::DynamicShapesTestSDPA::test_sdpa_c_functions_no_graph_break_dynamic_shapes 2025-12-04T10:14:11.7589726Z 2025-12-04T10:14:11.7589862Z Finished dynamo/test_dynamic_shapes 1/2 ... [2025-12-04 10:14:11.729601][4968034.225784955], took 7.79min 2025-12-04T10:14:11.7590270Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:14:11.7590626Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:14:11.7590850Z Running inductor/test_cpu_repro 5/5 ... [2025-12-04 10:14:11.746349][4968034.242536119] 2025-12-04T10:14:11.7591045Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:14:11.7591431Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_repro.py', '--shard-id=5', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:14:11.746836] 2025-12-04T10:20:33.7870024Z 2025-12-04T10:20:33.7871448Z inductor/test_cpu_repro 5/5 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_repro_5.5_37b1aa99df7e8fd5_.log 2025-12-04T10:20:33.7972667Z Running 149 items in this shard: test/inductor/test_cpu_repro.py::CPUReproTests::test_add_layernorm, test/inductor/test_cpu_repro.py::CPUReproTests::test_argmax_argmin_with_nan_value, test/inductor/test_cpu_repro.py::CPUReproTests::test_argmin, test/inductor/test_cpu_repro.py::CPUReproTests::test_auto_simd, test/inductor/test_cpu_repro.py::CPUReproTests::test_broadcast_mul_lowp_fp, test/inductor/test_cpu_repro.py::CPUReproTests::test_channel_shuffle_cl_output, test/inductor/test_cpu_repro.py::CPUReproTests::test_channels_last_view_as_complex, test/inductor/test_cpu_repro.py::CPUReproTests::test_concat_inner_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_convert_double_to_fp32_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_convert_fp32_to_double_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_cpu_vec_cosim, test/inductor/test_cpu_repro.py::CPUReproTests::test_decomposed_dequant_relu_quant_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_maxpool2d_lowering_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_dequant_quant_lowering_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_disabled_amp_is_inference_True, test/inductor/test_cpu_repro.py::CPUReproTests::test_expr_vec_non_contiguous, test/inductor/test_cpu_repro.py::CPUReproTests::test_for_loop_collapsed, test/inductor/test_cpu_repro.py::CPUReproTests::test_fp8_cast_float16_shape_15,3,13, test/inductor/test_cpu_repro.py::CPUReproTests::test_fp8_cast_float16_shape_4,2048,4096, test/inductor/test_cpu_repro.py::CPUReproTests::test_fused_node, test/inductor/test_cpu_repro.py::CPUReproTests::test_group_norm_large_input, test/inductor/test_cpu_repro.py::CPUReproTests::test_group_norm_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_index_propagation_issue_102065, test/inductor/test_cpu_repro.py::CPUReproTests::test_index_put, test/inductor/test_cpu_repro.py::CPUReproTests::test_index_put2, test/inductor/test_cpu_repro.py::CPUReproTests::test_inplace_add_alpha, test/inductor/test_cpu_repro.py::CPUReproTests::test_int32_pointwise_vec, test/inductor/test_cpu_repro.py::CPUReproTests::test_int_div, test/inductor/test_cpu_repro.py::CPUReproTests::test_linear_float64, test/inductor/test_cpu_repro.py::CPUReproTests::test_linear_packed, test/inductor/test_cpu_repro.py::CPUReproTests::test_load_inf_bf16, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_False_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_1_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_False_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_False_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_1_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_False_empty_state_True_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_False_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_False_empty_state_False_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_False_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_1_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_False_batch_first_True_batch_size_7_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_False_batch_size_7_seq_len_1, test/inductor/test_cpu_repro.py::CPUReproTests::test_lstm_packed_unbatched_True_input_size_7_hidden_size_7_num_layers_7_bidirectional_True_bias_True_empty_state_True_batch_first_True_batch_size_1_seq_len_7, test/inductor/test_cpu_repro.py::CPUReproTests::test_maxpool2d_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_non_contiguous_index_with_constant_stride, test/inductor/test_cpu_repro.py::CPUReproTests::test_non_contiguous_load_buf_quant_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_non_contiguous_reduction_store, test/inductor/test_cpu_repro.py::CPUReproTests::test_parallel_reduction_vectorization, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_channel_fake_quant_module_uint8, test/inductor/test_cpu_repro.py::CPUReproTests::test_per_tensor_fake_quant_int8, test/inductor/test_cpu_repro.py::CPUReproTests::test_pow_cos, test/inductor/test_cpu_repro.py::CPUReproTests::test_redundant_to_node_elimination_lowp_fp, test/inductor/test_cpu_repro.py::CPUReproTests::test_require_stride_order_non_owning, test/inductor/test_cpu_repro.py::CPUReproTests::test_scalar_mul_bfloat16, test/inductor/test_cpu_repro.py::CPUReproTests::test_share_local_buffers_in_outer_loop_fusion, test/inductor/test_cpu_repro.py::CPUReproTests::test_sign_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_skip_cpp_codegen, test/inductor/test_cpu_repro.py::CPUReproTests::test_symbolic_shape_scalar_value_reduction, test/inductor/test_cpu_repro.py::CPUReproTests::test_timed_cpu_only, test/inductor/test_cpu_repro.py::CPUReproTests::test_to_channels_last_lowp_fp, test/inductor/test_cpu_repro.py::CPUReproTests::test_to_dtype_float_bool, test/inductor/test_cpu_repro.py::CPUReproTests::test_to_uint8_rounding_method, test/inductor/test_cpu_repro.py::CPUReproTests::test_transpose_with_norm, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_bitwise, test/inductor/test_cpu_repro.py::CPUReproTests::test_vec_randn, test/inductor/test_cpu_repro.py::CPUReproTests::test_view_dtype 2025-12-04T10:20:33.8072745Z 2025-12-04T10:20:33.8073125Z Finished inductor/test_cpu_repro 5/5 ... [2025-12-04 10:20:33.786888][4968416.283072958], took 6.37min 2025-12-04T10:20:33.8074412Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:20:33.8075605Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:20:33.8076380Z Running inductor/test_inductor_freezing 1/1 ... [2025-12-04 10:20:33.803898][4968416.30008419] 2025-12-04T10:20:33.8077026Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:20:33.8078516Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_freezing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:20:33.804348] 2025-12-04T10:21:15.9151538Z 2025-12-04T10:21:15.9153034Z inductor/test_inductor_freezing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_freezing_1.1_0cb1f07bc4cec3c3_.log 2025-12-04T10:21:15.9174323Z Running 48 items in this shard: test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_aliased_param_return_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_autocast_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_functional_bn_with_multi_bn_share_conv_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_layout_convert_with_view_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_multiple_uses_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_weight_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_conv_with_as_strided_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_cpp_wrapper_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_dont_change_dtype_folding_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_error_on_eager_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_hardswish_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_folded_conv_functional_bn_with_module_sharing_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mm_concat_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_mutation_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_param_deallocated_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_redundant_clone_for_layout_convert_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_rng_op_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_static_indices_cudagraph_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_symint_not_folded_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unequal_bias_horizontal_addmm_fusion_cpu, test/inductor/test_inductor_freezing.py::FreezingCpuTests::test_unfolded_bn_cpu, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_aliased_param_return_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_autocast_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_functional_bn_with_multi_bn_share_conv_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_layout_convert_with_view_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_multiple_uses_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_weight_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_conv_with_as_strided_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_cpp_wrapper_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_dont_change_dtype_folding_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_error_on_eager_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_hardswish_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_folded_conv_functional_bn_with_module_sharing_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_mm_concat_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_mutation_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_param_deallocated_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_redundant_clone_for_layout_convert_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_rng_op_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_static_indices_cudagraph_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_symint_not_folded_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_unequal_bias_horizontal_addmm_fusion_cuda, test/inductor/test_inductor_freezing.py::FreezingGpuTests::test_unfolded_bn_cuda 2025-12-04T10:21:15.9194464Z 2025-12-04T10:21:15.9194882Z Finished inductor/test_inductor_freezing 1/1 ... [2025-12-04 10:21:15.914840][4968458.41102337], took 0.70min 2025-12-04T10:21:15.9196194Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:21:15.9317639Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:21:15.9325634Z Running inductor/test_flex_attention 2/4 ... [2025-12-04 10:21:15.932124][4968458.428305186] 2025-12-04T10:21:15.9326302Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:21:15.9329264Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_flex_attention.py', '--shard-id=2', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:21:15.932617] 2025-12-04T10:28:56.3091264Z 2025-12-04T10:28:56.3092155Z inductor/test_flex_attention 2/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_flex_attention_2.4_ff42703fa27499c0_.log 2025-12-04T10:28:56.3204856Z Running 191 items in this shard: test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_GQA_causal_mask_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod4_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_aot_eager_gradcheck_score_mod5_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_defaults_and_rejects_invalid_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_rejects_legacy_force_use_flag_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_backend_triton_decode_errors_with_non_power_of_two_gqa_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_block_mask_non_divisible_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_automatic_dynamic_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE3_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod0_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_128_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod1_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod2_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod3_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE3_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod5_BLOCK_SIZE_256_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_128_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod6_BLOCK_SIZE_256_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE2_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE2_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_128_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_different_block_size_score_mod7_BLOCK_SIZE_256_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_dynamic_score_mask_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod5_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_custom_sparse_block_size_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_builtin_score_mods_seqlen_lt_default_sparse_block_size_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_cant_lower_error_message_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_captured_score_mod_aot_eager_gradcheck_score_mod_name__head_offset_mode_aot_eager_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_cpu_error_message_return_lse_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_debug_flag_disables_internal_compilation_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dependent_causal_bidirectional_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_document_masking_edge_case_mode_aot_eager_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_document_masking_edge_case_mode_eager_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_dynamic_shapes_bug_dynamic_batch_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_eager_backward_strides_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order0_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order1_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order2_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_backward_stride_ordering_mode_inductor_permute_order3_shape0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_eager_permute_order4_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order3_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_flex_attention_stride_ordering_mode_paged_attention_permute_order4_shape1_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_free_symbol_dynamic_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_fully_masked_out_rows_0_check_compile_False_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_function_composition_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_function_composition_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims0_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims0_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims0_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod3_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims1_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_batch_dims2_head_dims1_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims0_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims0_head_dims1_score_mod4_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims0_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims1_head_dims1_score_mod7_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims0_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_kv_batch_broadcast_causal_mask_batch_dims2_head_dims1_score_mod6_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod0_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_correctness_score_mod1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_logsumexp_only_return_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_lse_masked_output_backend_flex_decode_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_max_autotune_with_captured_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_mixed_device_error_message_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_mixed_dtypes_fails_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_modular_indexing_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_new_empty_mask_mod_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_njt_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_njt_causal_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod0_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims0_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod1_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod2_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod3_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod4_head_dims1_cuda_bfloat16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_non_equal_head_dims_score_mod6_head_dims1_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_padded_dense_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__inverse_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__rel_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux__times_two_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_aux_deprecation_warnings_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__rel_causal_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_return_max__times_two_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_selective_ac_ops_to_save0_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_seq_masking_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_skip_odd_keys_cuda_float32, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s3_v_s3_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s3_v_s3_do_s1_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s0_k_s3_v_s3_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s0_v_s0_do_s2_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_strided_inputs_q_s1_k_s2_v_s2_do_s0_cuda_float16, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_tma_with_customer_kernel_options_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_triton_template_warp_specialization_cuda, test/inductor/test_flex_attention.py::TestFlexAttentionCUDA::test_validate_small_embedding_size_error_message_cuda, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod0_cuda_float32, test/inductor/test_flex_attention.py::TestPagedAttentionCUDA::test_paged_builtin_score_mods_score_mod5_cuda_float16, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_mask_attributes_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE_256_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_block_size_changes_BLOCK_SIZE_64_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_forward_pass_with_none_q_indices_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_getitem_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_init_mismatched_full_q_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_pytree_flatten_unflatten_cuda, test/inductor/test_flex_attention.py::TestBlockMaskCUDA::test_pytree_preserves_new_attributes_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_backprop_error_case_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_batch_head_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_comparison_vs_sdpa_with_learnable_bias_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_distinct_biases_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_flipped_indexed_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_global_tokens_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_head_specific_gate_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_indirect_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_learnable_bias_global_compiled_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_local_window_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_multiplicative_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:37_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_relative_1d_bias_only_grad_batch:2_head:4_seq_len:37_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:bfloat16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_default_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_symmetric_bias_batch:2_head:4_seq_len:37_headdim:16_dtype:float32_mode_max-autotune-no-cudagraphs_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:256_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:bfloat16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float16_cuda, test/inductor/test_flex_attention.py::TestLearnableBiasesCUDA::test_weird_bias_batch:2_head:4_seq_len:277_headdim:16_dtype:float32_cuda 2025-12-04T10:28:56.3237419Z 2025-12-04T10:28:56.3237559Z Finished inductor/test_flex_attention 2/4 ... [2025-12-04 10:28:56.312736][4968918.808922994], took 7.67min 2025-12-04T10:28:56.3237983Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:28:56.3238408Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:28:56.3238643Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:28:56.3238835Z Uploading artifacts took 0.00 seconds 2025-12-04T10:28:56.3239032Z Running inductor/test_cpu_cpp_wrapper 1/1 ... [2025-12-04 10:28:56.319516][4968918.815699183] 2025-12-04T10:28:56.3239238Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:28:56.3239651Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_cpu_cpp_wrapper.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:28:56.319844] 2025-12-04T10:29:02.6529917Z 2025-12-04T10:29:02.6530726Z inductor/test_cpu_cpp_wrapper 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_cpu_cpp_wrapper_1.1_ea064cb6b9134938_.log 2025-12-04T10:29:02.6531027Z 2025-12-04T10:29:02.6531157Z Finished inductor/test_cpu_cpp_wrapper 1/1 ... [2025-12-04 10:29:02.652529][4968925.14871225], took 0.11min 2025-12-04T10:29:02.6537670Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:29:02.6690809Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:29:02.6697276Z Running inductor/test_compile_subprocess 3/3 ... [2025-12-04 10:29:02.669479][4968925.165665523] 2025-12-04T10:29:02.6697963Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:29:02.6702526Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_compile_subprocess.py', '--shard-id=3', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:29:02.669966] 2025-12-04T10:38:46.7769620Z 2025-12-04T10:38:46.7771006Z PRINTING LOG FILE of inductor/test_compile_subprocess 3/3 (test/test-reports/inductor.test_compile_subprocess_3.3_8d9c6bb3e356ae25_.log) 2025-12-04T10:38:46.7772498Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-545f778f39a758b6.xml 2025-12-04T10:38:46.7773566Z ============================= test session starts ============================== 2025-12-04T10:38:46.7774369Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:38:46.7775024Z cachedir: .pytest_cache 2025-12-04T10:38:46.7775854Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:38:46.7776673Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:38:46.7777075Z configfile: pytest.ini 2025-12-04T10:38:46.7778191Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:38:46.7779006Z collecting ... collected 897 items 2025-12-04T10:38:46.7779492Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T10:38:46.7888313Z Running 297 items in this shard: test/inductor/test_compile_subprocess.py::GPUTests::test__dyn_quant_matmul_4bit_bf16_input_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test__unsafe_masked_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test__unsafe_masked_index_put_accumulate_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool_errors_with_long_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex7_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_add_complex_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_addmv_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_dtype_device_layout_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_override_registration_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_support_out_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_arange2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_arange5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin_with_duplicates_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin_with_nan_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_argmax_to_float_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d6_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_avg_pool3d_backward2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_baddbmm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_batch_norm_2d_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bernoulli2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bitwise2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bitwise_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bmm1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_broadcast_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_default_kwargs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int32_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_buffer_batch_norm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_buffer_copied_in_graph_with_different_shapes_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_buffer_use_after_remove_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_neg_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_of_loops_and_extern_kernel_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_single_empty_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cat_upcasting_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cauchy_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_chunk_recompiles_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_computed_buffer_inlining_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_consecutive_split_cumprod_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_const_int32_to_float_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_1d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_nd_inplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_conv_functional_bn_fuse_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_convolution4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_copy_with_scalar_src_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_cpu_scalar_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cpp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cumsum_inf_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_cumsum_no_mask_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_op_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_op_fixed_layout_sequential_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_custom_scan_op_multi_input_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_data_type_propogation_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dense_mask_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_div8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dropout_trivial_1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float16_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_bfloat16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float64_int64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_bfloat16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_float16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_bfloat16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_bfloat16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_uint8_int16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_uint8_uint8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_elu_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_erfc_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_erfinv_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_exp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_expanded_reduction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_expm1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fallback_mutable_op_with_return_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fill1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_flip_cat_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_flip_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_float_index_expression_type_promotion_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_floordiv_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fmod_zero_dim_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_forced_buffer_realize_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_full_like_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fuse_large_params_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_fusing_write_into_disjoint_read_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_gather1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_gather_scatter_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_generate_rand_fp8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_glu_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_both_scalars_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_constant_tensor2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_mutation_real_name_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_no_inputs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_pad_dynamic_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_refcount_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_grid_sampler_2d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_dynamic_shapes_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_abs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_floordiv_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_remainder_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put_failed_reinplace_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put_fallback1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_put_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_index_select_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_indirect_load_broadcast_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inductor_layout_optimization_input_mutations_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inner_fn_str_and_stride_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_inplace_add_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_input_mutation2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_input_mutation3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_input_mutation4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_insignificant_strides_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_isinf2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_isinf_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_issue102546_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_kernel_names_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_l1_loss_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_large_grid_use_block_ptr_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_layer_norm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_leaky_relu_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lgamma_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_like_rands2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_linear_mixed_dtype_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lite_regional_compile_repeated_blocks_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_lite_triton_kernel_wrapper_functional_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_log2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_log_fp64_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_logcumsumexp_zero_dim_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mark_dynamic_with_hint_override_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_masked_fill_promotion_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_matmul_layer_norm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_min_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mean_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_min_max_reduction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_misaligned_address_issue1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mm_mixed_dtype_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mul_index_expr_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multi_gpu_device_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multi_threading_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multilayer_any_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multilayer_sum_low_prec_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_multilayer_var_lowp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_mutable_custom_op_fixed_layout2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nan_sort_stable_False_descending_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_new_empty_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nll_loss_backward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_nll_loss_forward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_one_hot_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pattern_matcher_unbacked_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_j0_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_y0_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_y1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_t_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_w_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_entr_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erfcx_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_expm1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_gammaincc_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_gammaln_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_laguerre_polynomial_l_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_log_ndtr_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_logit_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_multigammaln_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_ndtri_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_psi_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pointwise_scaled_modified_bessel_k1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pow3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_pow_symfloat_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_prod_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_progressive, test/inductor/test_compile_subprocess.py::GPUTests::test_rand_like_deterministic_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_randint_distribution_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_randn_generator_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_randn_like_empty_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reduction2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_reduction5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remainder_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_scatter_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_decomposition_has_clamp_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_require_stride_expanded_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_resize_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_roi_align_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_roll_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_round_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_rsqrt_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_rsqrt_dynamic_shapes_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scaled_dot_product_attention_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter_add3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_scatter_reduce3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_shape_padding_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_signbit_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_silu_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sin_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_mutation1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sort_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sort_stable_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sort_transpose_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_special_polygamma_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_cumprod_low_prec_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_low_prec_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_reduction_with_int64_size_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_split_with_unbacked_symints_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sqrt_dynamic_shapes_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_squeeze1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_squeeze2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_squeeze_varargs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_stack_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_std_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_strided_inputs_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum4_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum5_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_sum_keepdims_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tanh_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue3_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_to_dtype_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_triton_argmin_argmax_transpose_logical_index_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_uint4x2_mixed_mm_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unbacked_floordiv_simplify_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unbacked_floordiv_simplify_errors_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unroll_small_reduction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_float16_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_int32_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_int8_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_upsample_nearest2d_backward_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_var_correction_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_var_mean_div_by_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_var_mean_tile_reduction_False_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_var_mean_tile_reduction_True_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_vertical_fusion1_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_view_as_complex_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_views2_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_weight_norm_bwd_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_weight_norm_conv2d_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_where_broadcast_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_where_with_logical_op_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_xblock_divides_xnumel_cuda, test/inductor/test_compile_subprocess.py::GPUTests::test_zero_dim_reductions_cuda 2025-12-04T10:38:46.7994946Z 2025-12-04T10:38:46.7995814Z inductor/test_compile_subprocess.py::GPUTests::test__dyn_quant_matmul_4bit_bf16_input_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0022s] (No _dyn_quant_matmul_4bit implementation on CUDA) [ 0%] 2025-12-04T10:38:46.7997361Z inductor/test_compile_subprocess.py::GPUTests::test__unsafe_masked_index_cuda <- test/inductor/test_torchinductor.py PASSED [5.7467s] [ 0%] 2025-12-04T10:38:46.7999094Z inductor/test_compile_subprocess.py::GPUTests::test__unsafe_masked_index_put_accumulate_cuda <- test/inductor/test_torchinductor.py PASSED [1.1119s] [ 1%] 2025-12-04T10:38:46.8000586Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda <- test/inductor/test_torchinductor.py PASSED [0.8183s] [ 1%] 2025-12-04T10:38:46.8001944Z inductor/test_compile_subprocess.py::GPUTests::test_adaptive_avg_pool_errors_with_long_cuda <- test/inductor/test_torchinductor.py PASSED [0.4064s] [ 1%] 2025-12-04T10:38:46.8003226Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex4_cuda <- test/inductor/test_torchinductor.py PASSED [1.0544s] [ 2%] 2025-12-04T10:38:46.8004417Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex7_cuda <- test/inductor/test_torchinductor.py PASSED [0.5395s] [ 2%] 2025-12-04T10:38:46.8005723Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex8_cuda <- test/inductor/test_torchinductor.py PASSED [0.4493s] [ 2%] 2025-12-04T10:38:46.8006922Z inductor/test_compile_subprocess.py::GPUTests::test_add_complex_cuda <- test/inductor/test_torchinductor.py PASSED [0.4675s] [ 3%] 2025-12-04T10:38:46.8008627Z inductor/test_compile_subprocess.py::GPUTests::test_addmv_cuda <- test/inductor/test_torchinductor.py W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8010221Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8011701Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8013188Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8014526Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8016016Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8017473Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8018821Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8020166Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8021636Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8023096Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8024461Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8025913Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8027366Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8028925Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8030316Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8031713Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8033138Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8034561Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8036008Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8037474Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8039020Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8040628Z W1204 10:29:18.500000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8041765Z PASSED [0.4772s] [ 3%] 2025-12-04T10:38:46.8043023Z inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_dtype_device_layout_cuda <- test/inductor/test_torchinductor.py W1204 10:29:18.948000 404399 site-packages/torch/_export/__init__.py:71] +============================+ 2025-12-04T10:38:46.8044467Z W1204 10:29:18.949000 404399 site-packages/torch/_export/__init__.py:72] | !!! WARNING !!! | 2025-12-04T10:38:46.8045335Z W1204 10:29:18.949000 404399 site-packages/torch/_export/__init__.py:73] +============================+ 2025-12-04T10:38:46.8047022Z W1204 10:29:18.949000 404399 site-packages/torch/_export/__init__.py:74] torch._export.aot_compile()/torch._export.aot_load() is being deprecated, please switch to directly calling torch._inductor.aoti_compile_and_package(torch.export.export())/torch._inductor.aoti_load_package() instead. 2025-12-04T10:38:46.8048859Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8049943Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8051392Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8052749Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8054059Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8055549Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8056991Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8058340Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8059770Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8061231Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8062675Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8064034Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8065367Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8066770Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8068209Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8069610Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8071003Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8072508Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8073947Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8075386Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8076840Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8078384Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8079923Z W1204 10:29:19.015000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8080982Z PASSED [3.9713s] [ 3%] 2025-12-04T10:38:46.8081840Z inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_override_registration_cuda <- test/inductor/test_torchinductor.py PASSED [19.6356s] [ 4%] 2025-12-04T10:38:46.8083339Z inductor/test_compile_subprocess.py::GPUTests::test_aoti_eager_support_out_cuda <- test/inductor/test_torchinductor.py PASSED [3.5028s] [ 4%] 2025-12-04T10:38:46.8085016Z inductor/test_compile_subprocess.py::GPUTests::test_arange2_cuda <- test/inductor/test_torchinductor.py W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8086588Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8088032Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8089542Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8090857Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8092335Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8093781Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8095066Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8096394Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8097844Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8099337Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8100685Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8102010Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8103491Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8104889Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8106285Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8107674Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8109145Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8110570Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8112006Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8113464Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8114969Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8116498Z W1204 10:29:46.089000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8117551Z PASSED [0.3370s] [ 4%] 2025-12-04T10:38:46.8124965Z inductor/test_compile_subprocess.py::GPUTests::test_arange5_cuda <- test/inductor/test_torchinductor.py W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8126535Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8127959Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8129399Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8130721Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8132204Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8133651Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8134936Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8136607Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8138243Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8139709Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8141158Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8142484Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8143878Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8145276Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8146734Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8148194Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8149618Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8151031Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8152461Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8154016Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8155500Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8157018Z W1204 10:29:46.423000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8158499Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8159575Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] Traceback (most recent call last): 2025-12-04T10:38:46.8161010Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8162349Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] ).serialize() 2025-12-04T10:38:46.8163652Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8165126Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8166546Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8167902Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] pickler.dump(obj) 2025-12-04T10:38:46.8169283Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8170737Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8172223Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8173568Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] cls(obj, pickler.options), 2025-12-04T10:38:46.8174901Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8176299Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8177698Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8179158Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8180549Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8181976Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8183476Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8184913Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8186375Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8187865Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8189425Z W1204 10:29:46.713000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8190487Z PASSED [0.5838s] [ 5%] 2025-12-04T10:38:46.8191326Z inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin_with_duplicates_cuda <- test/inductor/test_torchinductor.py PASSED [1.6641s] [ 5%] 2025-12-04T10:38:46.8192635Z inductor/test_compile_subprocess.py::GPUTests::test_argmax_argmin_with_nan_cuda <- test/inductor/test_torchinductor.py PASSED [2.6923s] [ 5%] 2025-12-04T10:38:46.8194328Z inductor/test_compile_subprocess.py::GPUTests::test_argmax_to_float_cuda <- test/inductor/test_torchinductor.py W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8195902Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8197401Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8198788Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8200094Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8201569Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8203003Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8204286Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8205593Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8207058Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8208555Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8209913Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8211246Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8212758Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8214171Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8215571Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8216964Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8218434Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8219862Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8221294Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8222744Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8224239Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8225846Z W1204 10:29:51.371000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8227042Z PASSED [0.3998s] [ 6%] 2025-12-04T10:38:46.8227801Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d2_cuda <- test/inductor/test_torchinductor.py PASSED [1.7415s] [ 6%] 2025-12-04T10:38:46.8229011Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d3_cuda <- test/inductor/test_torchinductor.py PASSED [1.0597s] [ 6%] 2025-12-04T10:38:46.8230178Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d6_cuda <- test/inductor/test_torchinductor.py PASSED [0.7131s] [ 7%] 2025-12-04T10:38:46.8231392Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward2_cuda <- test/inductor/test_torchinductor.py PASSED [5.9807s] [ 7%] 2025-12-04T10:38:46.8232629Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool2d_backward_cuda <- test/inductor/test_torchinductor.py PASSED [1.3477s] [ 7%] 2025-12-04T10:38:46.8234144Z inductor/test_compile_subprocess.py::GPUTests::test_avg_pool3d_backward2_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 8%] 2025-12-04T10:38:46.8235610Z inductor/test_compile_subprocess.py::GPUTests::test_baddbmm_cuda <- test/inductor/test_torchinductor.py PASSED [2.1179s] [ 8%] 2025-12-04T10:38:46.8236792Z inductor/test_compile_subprocess.py::GPUTests::test_batch_norm_2d_2_cuda <- test/inductor/test_torchinductor.py PASSED [1.8178s] [ 8%] 2025-12-04T10:38:46.8238485Z inductor/test_compile_subprocess.py::GPUTests::test_bernoulli2_cuda <- test/inductor/test_torchinductor.py W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8240077Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8241524Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8242963Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8244273Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8245740Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8247176Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8248512Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8249843Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8251310Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8252762Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8254117Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8255436Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8256931Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8258403Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8259788Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8261174Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8262600Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8264025Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8265459Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8266918Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8268472Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8270043Z W1204 10:30:06.607000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8271509Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8272676Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8274108Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8275460Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8276760Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8278289Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8279734Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8281010Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8282343Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8283799Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8285243Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8286690Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8288019Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8289484Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8290885Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8292279Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8293695Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8295131Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8296573Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8298001Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8299540Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8301034Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8302685Z W1204 10:30:07.408000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8303779Z PASSED [1.0324s] [ 9%] 2025-12-04T10:38:46.8304521Z inductor/test_compile_subprocess.py::GPUTests::test_bitwise2_cuda <- test/inductor/test_torchinductor.py PASSED [0.2824s] [ 9%] 2025-12-04T10:38:46.8305480Z inductor/test_compile_subprocess.py::GPUTests::test_bitwise_cuda <- test/inductor/test_torchinductor.py PASSED [0.2380s] [ 9%] 2025-12-04T10:38:46.8306208Z inductor/test_compile_subprocess.py::GPUTests::test_bmm1_cuda <- test/inductor/test_torchinductor.py PASSED [0.5738s] [ 10%] 2025-12-04T10:38:46.8306965Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_broadcast_cuda <- test/inductor/test_torchinductor.py PASSED [0.5631s] [ 10%] 2025-12-04T10:38:46.8307783Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_default_kwargs_cuda <- test/inductor/test_torchinductor.py PASSED [0.2027s] [ 10%] 2025-12-04T10:38:46.8308663Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int32_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [1.8496s] [ 11%] 2025-12-04T10:38:46.8309473Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int16_cuda <- test/inductor/test_torchinductor.py PASSED [1.8527s] [ 11%] 2025-12-04T10:38:46.8310288Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_int32_cuda <- test/inductor/test_torchinductor.py PASSED [1.8232s] [ 11%] 2025-12-04T10:38:46.8311103Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int64_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [1.7905s] [ 12%] 2025-12-04T10:38:46.8311974Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int32_cuda <- test/inductor/test_torchinductor.py PASSED [1.8926s] [ 12%] 2025-12-04T10:38:46.8312789Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_int8_int64_cuda <- test/inductor/test_torchinductor.py PASSED [1.8365s] [ 12%] 2025-12-04T10:38:46.8313600Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int16_cuda <- test/inductor/test_torchinductor.py PASSED [1.9004s] [ 13%] 2025-12-04T10:38:46.8314408Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int32_cuda <- test/inductor/test_torchinductor.py PASSED [1.8587s] [ 13%] 2025-12-04T10:38:46.8315183Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_int8_cuda <- test/inductor/test_torchinductor.py PASSED [1.9283s] [ 13%] 2025-12-04T10:38:46.8315994Z inductor/test_compile_subprocess.py::GPUTests::test_bucketize_int_uint8_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [1.9585s] [ 14%] 2025-12-04T10:38:46.8316794Z inductor/test_compile_subprocess.py::GPUTests::test_buffer_batch_norm_cuda <- test/inductor/test_torchinductor.py PASSED [0.8424s] [ 14%] 2025-12-04T10:38:46.8317651Z inductor/test_compile_subprocess.py::GPUTests::test_buffer_copied_in_graph_with_different_shapes_cuda <- test/inductor/test_torchinductor.py PASSED [3.4485s] [ 14%] 2025-12-04T10:38:46.8318618Z inductor/test_compile_subprocess.py::GPUTests::test_buffer_use_after_remove_cuda <- test/inductor/test_torchinductor.py PASSED [1.5010s] [ 15%] 2025-12-04T10:38:46.8319442Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_neg_cuda <- test/inductor/test_torchinductor.py PASSED [0.2612s] [ 15%] 2025-12-04T10:38:46.8320282Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda <- test/inductor/test_torchinductor.py PASSED [0.2119s] [ 15%] 2025-12-04T10:38:46.8321137Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda <- test/inductor/test_torchinductor.py PASSED [0.2117s] [ 16%] 2025-12-04T10:38:46.8322052Z inductor/test_compile_subprocess.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda <- test/inductor/test_torchinductor.py PASSED [0.2159s] [ 16%] 2025-12-04T10:38:46.8323185Z inductor/test_compile_subprocess.py::GPUTests::test_cat_of_loops_and_extern_kernel_cuda <- test/inductor/test_torchinductor.py W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8324226Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8325136Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8326000Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8326828Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8327781Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8328723Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8329531Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8330362Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8331338Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8332262Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8333119Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8333954Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8334831Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8335728Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8336604Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8337480Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8338414Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8339308Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8340206Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8341188Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8342122Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8343151Z W1204 10:30:34.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8343881Z PASSED [0.5112s] [ 16%] 2025-12-04T10:38:46.8344401Z inductor/test_compile_subprocess.py::GPUTests::test_cat_single_empty_cuda <- test/inductor/test_torchinductor.py PASSED [0.1329s] [ 17%] 2025-12-04T10:38:46.8345153Z inductor/test_compile_subprocess.py::GPUTests::test_cat_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.3226s] [ 17%] 2025-12-04T10:38:46.8345828Z inductor/test_compile_subprocess.py::GPUTests::test_cat_upcasting_cuda <- test/inductor/test_torchinductor.py PASSED [0.9399s] [ 17%] 2025-12-04T10:38:46.8346576Z inductor/test_compile_subprocess.py::GPUTests::test_cauchy_cuda <- test/inductor/test_torchinductor.py PASSED [0.1778s] [ 18%] 2025-12-04T10:38:46.8347327Z inductor/test_compile_subprocess.py::GPUTests::test_chunk_recompiles_cuda <- test/inductor/test_torchinductor.py PASSED [0.4604s] [ 18%] 2025-12-04T10:38:46.8348467Z inductor/test_compile_subprocess.py::GPUTests::test_computed_buffer_inlining_cuda <- test/inductor/test_torchinductor.py W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8349567Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8350478Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8351340Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8352162Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8353093Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8354005Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8354820Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8355598Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8356513Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8357418Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8358303Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8359132Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8360052Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8360922Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8361793Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8362661Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8363551Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8364451Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8365281Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8366197Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8367306Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8368390Z W1204 10:30:37.419000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8369048Z PASSED [0.2434s] [ 18%] 2025-12-04T10:38:46.8369551Z inductor/test_compile_subprocess.py::GPUTests::test_consecutive_split_cumprod_cuda <- test/inductor/test_torchinductor.py PASSED [0.3399s] [ 19%] 2025-12-04T10:38:46.8370639Z inductor/test_compile_subprocess.py::GPUTests::test_const_int32_to_float_cuda <- test/inductor/test_torchinductor.py W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8371640Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8372540Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8373393Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8374219Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8375152Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8376085Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8376882Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8377711Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8378727Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8397994Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8398939Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8399794Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8400682Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8401565Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8402448Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8403331Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8404230Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8405252Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8406158Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8407075Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8408027Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8409110Z W1204 10:30:38.007000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8409830Z PASSED [0.3636s] [ 19%] 2025-12-04T10:38:46.8410336Z inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_1d_cuda <- test/inductor/test_torchinductor.py PASSED [0.6292s] [ 19%] 2025-12-04T10:38:46.8411123Z inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_float64_cuda <- test/inductor/test_torchinductor.py PASSED [0.2080s] [ 20%] 2025-12-04T10:38:46.8411929Z inductor/test_compile_subprocess.py::GPUTests::test_constant_pad_nd_inplace_cuda <- test/inductor/test_torchinductor.py PASSED [0.1572s] [ 20%] 2025-12-04T10:38:46.8412837Z inductor/test_compile_subprocess.py::GPUTests::test_conv_functional_bn_fuse_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0019s] (only support cpu conv bn test) [ 20%] 2025-12-04T10:38:46.8413722Z inductor/test_compile_subprocess.py::GPUTests::test_convolution4_cuda <- test/inductor/test_torchinductor.py PASSED [0.6331s] [ 21%] 2025-12-04T10:38:46.8414846Z inductor/test_compile_subprocess.py::GPUTests::test_copy_non_blocking_is_pinned_use_cat_True_cuda <- test/inductor/test_torchinductor.py W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8416000Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8416914Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8417775Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8418646Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8419596Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8420512Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8421328Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8422172Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8423094Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8424009Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8425324Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8426172Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8427057Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8427938Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8428871Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8429757Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8430656Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8431554Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8432457Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8433375Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8434406Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8435403Z W1204 10:30:40.325000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.device_put.default 2025-12-04T10:38:46.8436260Z W1204 10:30:40.379000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8436845Z W1204 10:30:40.380000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8437427Z W1204 10:30:40.380000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8438001Z W1204 10:30:40.380000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8438618Z W1204 10:30:40.381000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8439198Z W1204 10:30:40.381000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8439774Z W1204 10:30:40.381000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8440347Z W1204 10:30:40.381000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8440925Z W1204 10:30:40.382000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8441498Z W1204 10:30:40.382000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8442070Z W1204 10:30:40.382000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8442701Z W1204 10:30:40.382000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8443274Z W1204 10:30:40.382000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8443852Z W1204 10:30:40.383000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8444425Z W1204 10:30:40.383000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8445001Z W1204 10:30:40.383000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8445576Z W1204 10:30:40.383000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8446150Z W1204 10:30:40.383000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8446727Z W1204 10:30:40.384000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8447306Z W1204 10:30:40.384000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8447883Z W1204 10:30:40.384000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8448496Z W1204 10:30:40.384000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8449068Z W1204 10:30:40.384000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8449640Z W1204 10:30:40.385000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8450212Z W1204 10:30:40.385000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8450785Z W1204 10:30:40.385000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8451365Z W1204 10:30:40.385000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8451936Z W1204 10:30:40.385000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8452560Z W1204 10:30:40.386000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8453132Z W1204 10:30:40.386000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8453706Z W1204 10:30:40.386000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8454282Z W1204 10:30:40.386000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8454856Z W1204 10:30:40.386000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8455428Z W1204 10:30:40.387000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8456005Z W1204 10:30:40.387000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8456584Z W1204 10:30:40.387000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8457159Z W1204 10:30:40.387000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8457732Z W1204 10:30:40.387000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8458347Z W1204 10:30:40.388000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8458921Z W1204 10:30:40.388000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8459492Z W1204 10:30:40.388000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8460117Z W1204 10:30:40.388000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8460694Z W1204 10:30:40.388000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8461273Z W1204 10:30:40.389000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8461855Z W1204 10:30:40.389000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8462428Z W1204 10:30:40.389000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8463001Z W1204 10:30:40.389000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8463578Z W1204 10:30:40.389000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8464154Z W1204 10:30:40.390000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8464737Z W1204 10:30:40.390000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8465312Z W1204 10:30:40.390000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8465886Z W1204 10:30:40.390000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8466466Z W1204 10:30:40.390000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8467042Z W1204 10:30:40.391000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8467759Z W1204 10:30:40.391000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8468443Z W1204 10:30:40.391000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8469025Z W1204 10:30:40.391000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8469597Z W1204 10:30:40.391000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8470266Z W1204 10:30:40.392000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8470844Z W1204 10:30:40.392000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8471417Z W1204 10:30:40.392000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8471992Z W1204 10:30:40.392000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8472569Z W1204 10:30:40.392000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8473139Z W1204 10:30:40.393000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8473717Z W1204 10:30:40.393000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8474298Z W1204 10:30:40.393000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8474788Z W1204 10:30:40.393000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8475073Z W1204 10:30:40.393000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8475357Z W1204 10:30:40.394000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8475637Z W1204 10:30:40.394000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8475918Z W1204 10:30:40.394000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8476251Z W1204 10:30:40.394000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8476539Z W1204 10:30:40.394000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8476824Z W1204 10:30:40.395000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8477108Z W1204 10:30:40.395000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8477389Z W1204 10:30:40.395000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8477673Z W1204 10:30:40.395000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8477955Z W1204 10:30:40.395000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8478276Z W1204 10:30:40.396000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8478568Z W1204 10:30:40.396000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8478853Z W1204 10:30:40.396000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8479136Z W1204 10:30:40.396000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8479418Z W1204 10:30:40.396000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8479701Z W1204 10:30:40.397000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8479980Z W1204 10:30:40.397000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8480249Z W1204 10:30:40.397000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8480524Z W1204 10:30:40.397000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8480839Z W1204 10:30:40.397000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8481123Z W1204 10:30:40.398000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8481406Z W1204 10:30:40.398000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8481688Z W1204 10:30:40.398000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8481972Z W1204 10:30:40.398000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8482254Z W1204 10:30:40.399000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8482539Z W1204 10:30:40.399000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8482822Z W1204 10:30:40.399000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8483124Z W1204 10:30:40.399000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8483409Z W1204 10:30:40.399000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8483693Z W1204 10:30:40.400000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8483970Z W1204 10:30:40.400000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8484256Z W1204 10:30:40.400000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8484559Z [W1204 10:30:41.610846016 collection.cpp:1148] Warning: ROCTracer produced duplicate flow start: 1 (function operator()) 2025-12-04T10:38:46.8484819Z PASSED [1.2835s] [ 21%] 2025-12-04T10:38:46.8485066Z inductor/test_compile_subprocess.py::GPUTests::test_copy_with_scalar_src_cuda <- test/inductor/test_torchinductor.py PASSED [0.4744s] [ 21%] 2025-12-04T10:38:46.8485465Z inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_cpu_scalar_cuda <- test/inductor/test_torchinductor.py PASSED [0.5106s] [ 22%] 2025-12-04T10:38:46.8485981Z inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_gpu_tensor_cpp_cuda <- test/inductor/test_torchinductor.py W1204 10:30:42.309000 405259 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8486357Z PASSED [1.0344s] [ 22%] 2025-12-04T10:38:46.8486721Z inductor/test_compile_subprocess.py::GPUTests::test_cpu_scalar_with_gpu_tensor_dynamic_cuda <- test/inductor/test_torchinductor.py W1204 10:30:43.390000 405259 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8487102Z PASSED [0.2625s] [ 22%] 2025-12-04T10:38:46.8487330Z inductor/test_compile_subprocess.py::GPUTests::test_cumsum_inf_cuda <- test/inductor/test_torchinductor.py PASSED [0.4481s] [ 23%] 2025-12-04T10:38:46.8487694Z inductor/test_compile_subprocess.py::GPUTests::test_cumsum_no_mask_cuda <- test/inductor/test_torchinductor.py PASSED [0.3300s] [ 23%] 2025-12-04T10:38:46.8488229Z inductor/test_compile_subprocess.py::GPUTests::test_custom_op_2_cuda <- test/inductor/test_torchinductor.py W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8488712Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8489153Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8489574Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8490010Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8490461Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8490908Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8491299Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8491706Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8492162Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8492604Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8493025Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8493433Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8493859Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8494317Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8494742Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8495166Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8495597Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8496025Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8496458Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8496896Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8497346Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8497808Z W1204 10:30:44.367000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.foo2.default 2025-12-04T10:38:46.8498151Z PASSED [0.2703s] [ 23%] 2025-12-04T10:38:46.8498547Z inductor/test_compile_subprocess.py::GPUTests::test_custom_op_fixed_layout_sequential_cuda <- test/inductor/test_torchinductor.py W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8499082Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8499514Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8499923Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8500315Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8500767Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8501204Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8501588Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8501987Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8502431Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8502874Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8503322Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8503721Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8504140Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8504560Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8504980Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8505402Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8505831Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8506257Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8506686Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8507128Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8507603Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8508061Z W1204 10:30:44.656000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.bar.default 2025-12-04T10:38:46.8508705Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8509033Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8509463Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8509870Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8510264Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8510710Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8511142Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8511528Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8511977Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8512419Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8512859Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8513266Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8513666Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8514084Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8514510Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8514925Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8515346Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8515773Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8516202Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8516704Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8517145Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8517597Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8518049Z W1204 10:30:45.329000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.test.bar.default 2025-12-04T10:38:46.8518399Z PASSED [0.7857s] [ 24%] 2025-12-04T10:38:46.8518651Z inductor/test_compile_subprocess.py::GPUTests::test_custom_scan_op_multi_input_cuda <- test/inductor/test_torchinductor.py PASSED [0.0759s] [ 24%] 2025-12-04T10:38:46.8519075Z inductor/test_compile_subprocess.py::GPUTests::test_data_type_propogation_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0017s] (triton not supported) [ 24%] 2025-12-04T10:38:46.8519484Z inductor/test_compile_subprocess.py::GPUTests::test_dense_mask_index_cuda <- test/inductor/test_torchinductor.py PASSED [0.4270s] [ 25%] 2025-12-04T10:38:46.8519838Z inductor/test_compile_subprocess.py::GPUTests::test_div1_cuda <- test/inductor/test_torchinductor.py PASSED [0.5918s] [ 25%] 2025-12-04T10:38:46.8520178Z inductor/test_compile_subprocess.py::GPUTests::test_div8_cuda <- test/inductor/test_torchinductor.py PASSED [0.4033s] [ 25%] 2025-12-04T10:38:46.8520536Z inductor/test_compile_subprocess.py::GPUTests::test_dropout_trivial_1_cuda <- test/inductor/test_torchinductor.py PASSED [0.1580s] [ 26%] 2025-12-04T10:38:46.8520955Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_float64_cuda <- test/inductor/test_torchinductor.py PASSED [1.0383s] [ 26%] 2025-12-04T10:38:46.8521349Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int16_cuda <- test/inductor/test_torchinductor.py PASSED [0.5041s] [ 26%] 2025-12-04T10:38:46.8521739Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int32_cuda <- test/inductor/test_torchinductor.py PASSED [0.5242s] [ 27%] 2025-12-04T10:38:46.8522130Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_int8_cuda <- test/inductor/test_torchinductor.py PASSED [0.5457s] [ 27%] 2025-12-04T10:38:46.8522518Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_bfloat16_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.5422s] [ 27%] 2025-12-04T10:38:46.8522905Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float16_int8_cuda <- test/inductor/test_torchinductor.py PASSED [0.5300s] [ 28%] 2025-12-04T10:38:46.8523295Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_bfloat16_cuda <- test/inductor/test_torchinductor.py PASSED [0.2501s] [ 28%] 2025-12-04T10:38:46.8523690Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_float32_cuda <- test/inductor/test_torchinductor.py PASSED [0.2665s] [ 28%] 2025-12-04T10:38:46.8524081Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float32_float64_cuda <- test/inductor/test_torchinductor.py PASSED [0.2966s] [ 29%] 2025-12-04T10:38:46.8524470Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_float64_int64_cuda <- test/inductor/test_torchinductor.py PASSED [0.3044s] [ 29%] 2025-12-04T10:38:46.8524859Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_bfloat16_cuda <- test/inductor/test_torchinductor.py PASSED [0.9957s] [ 29%] 2025-12-04T10:38:46.8525248Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_float16_cuda <- test/inductor/test_torchinductor.py PASSED [0.5585s] [ 30%] 2025-12-04T10:38:46.8525637Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_float32_cuda <- test/inductor/test_torchinductor.py PASSED [0.5398s] [ 30%] 2025-12-04T10:38:46.8526052Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int16_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.5582s] [ 30%] 2025-12-04T10:38:46.8526430Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int32_int8_cuda <- test/inductor/test_torchinductor.py PASSED [0.0024s] [ 31%] 2025-12-04T10:38:46.8526815Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_bfloat16_cuda <- test/inductor/test_torchinductor.py PASSED [0.0019s] [ 31%] 2025-12-04T10:38:46.8527198Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_int8_cuda <- test/inductor/test_torchinductor.py PASSED [0.0017s] [ 31%] 2025-12-04T10:38:46.8527579Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int64_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.0017s] [ 32%] 2025-12-04T10:38:46.8527967Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_bfloat16_cuda <- test/inductor/test_torchinductor.py PASSED [0.0017s] [ 32%] 2025-12-04T10:38:46.8528397Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float16_cuda <- test/inductor/test_torchinductor.py PASSED [0.0018s] [ 32%] 2025-12-04T10:38:46.8528783Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float32_cuda <- test/inductor/test_torchinductor.py PASSED [0.0016s] [ 33%] 2025-12-04T10:38:46.8529171Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_float64_cuda <- test/inductor/test_torchinductor.py PASSED [0.0017s] [ 33%] 2025-12-04T10:38:46.8529552Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_int8_int16_cuda <- test/inductor/test_torchinductor.py PASSED [0.0016s] [ 34%] 2025-12-04T10:38:46.8529930Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_uint8_int16_cuda <- test/inductor/test_torchinductor.py PASSED [0.0016s] [ 34%] 2025-12-04T10:38:46.8530345Z inductor/test_compile_subprocess.py::GPUTests::test_dtypeview_uint8_uint8_cuda <- test/inductor/test_torchinductor.py PASSED [0.0016s] [ 34%] 2025-12-04T10:38:46.8530845Z inductor/test_compile_subprocess.py::GPUTests::test_elu_cuda <- test/inductor/test_torchinductor.py W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8531311Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8531743Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8532150Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8532549Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8533001Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8533433Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8533817Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8534215Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8534656Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8535121Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8535529Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8535929Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8536351Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8536770Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8537191Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8537607Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8538034Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8538493Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8538922Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8539411Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8539859Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8540343Z W1204 10:30:54.925000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8540682Z PASSED [0.6755s] [ 35%] 2025-12-04T10:38:46.8540901Z inductor/test_compile_subprocess.py::GPUTests::test_erfc_cuda <- test/inductor/test_torchinductor.py PASSED [0.7389s] [ 35%] 2025-12-04T10:38:46.8541247Z inductor/test_compile_subprocess.py::GPUTests::test_erfinv_cuda <- test/inductor/test_torchinductor.py PASSED [0.6779s] [ 35%] 2025-12-04T10:38:46.8541593Z inductor/test_compile_subprocess.py::GPUTests::test_exp_cuda <- test/inductor/test_torchinductor.py PASSED [0.5104s] [ 36%] 2025-12-04T10:38:46.8541950Z inductor/test_compile_subprocess.py::GPUTests::test_expanded_reduction_cuda <- test/inductor/test_torchinductor.py PASSED [0.4589s] [ 36%] 2025-12-04T10:38:46.8542310Z inductor/test_compile_subprocess.py::GPUTests::test_expm1_cuda <- test/inductor/test_torchinductor.py PASSED [3.0932s] [ 36%] 2025-12-04T10:38:46.8542821Z inductor/test_compile_subprocess.py::GPUTests::test_fallback_mutable_op_with_return_cuda <- test/inductor/test_torchinductor.py W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8543461Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] Traceback (most recent call last): 2025-12-04T10:38:46.8543885Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8544312Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] ).serialize() 2025-12-04T10:38:46.8544698Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8545136Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8545562Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8545940Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] pickler.dump(obj) 2025-12-04T10:38:46.8546333Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8546766Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8547198Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8547599Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] cls(obj, pickler.options), 2025-12-04T10:38:46.8547990Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8548540Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8548958Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8549372Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8549784Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8550207Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8550634Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8551060Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8551492Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8552017Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8552480Z W1204 10:31:00.727000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.inplace_.default 2025-12-04T10:38:46.8552803Z PASSED [0.0306s] [ 37%] 2025-12-04T10:38:46.8553023Z inductor/test_compile_subprocess.py::GPUTests::test_fill1_cuda <- test/inductor/test_torchinductor.py PASSED [0.4948s] [ 37%] 2025-12-04T10:38:46.8553551Z inductor/test_compile_subprocess.py::GPUTests::test_flip_cat_cuda <- test/inductor/test_torchinductor.py W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8554022Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8554452Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8554859Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8555254Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8555702Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8556135Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8556520Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8556920Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8557386Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8557825Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8558276Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8558678Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8559097Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8559516Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8559941Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8560362Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8560790Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8561214Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8561647Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8562116Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8562566Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8563026Z W1204 10:31:01.289000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8563454Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8563780Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8564211Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8564630Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8565029Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8565479Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8565914Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8566334Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8566740Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8567181Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8567623Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8568036Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8568498Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8568923Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8569345Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8569767Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8570192Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8570624Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8571102Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8571540Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8571982Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8572440Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8572911Z W1204 10:31:01.563000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8573237Z PASSED [0.5301s] [ 37%] 2025-12-04T10:38:46.8573594Z inductor/test_compile_subprocess.py::GPUTests::test_flip_cuda <- test/inductor/test_torchinductor.py W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8574065Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8574501Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8574916Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8575358Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8575808Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8576241Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8576625Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8577027Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8577470Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8577912Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8578354Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8578756Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8579177Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8579598Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8580052Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8580470Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8580898Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8581333Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8581774Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8582220Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8582670Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8583128Z W1204 10:31:01.806000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8583557Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8583885Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8584355Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8584764Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8585161Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8585612Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8586047Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8586437Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8586841Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8587280Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8587717Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8588170Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8588573Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8589030Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8589455Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8589876Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8590302Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8590734Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8591165Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8591601Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8592043Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8592495Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8592953Z W1204 10:31:02.086000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8593302Z PASSED [0.5506s] [ 38%] 2025-12-04T10:38:46.8593566Z inductor/test_compile_subprocess.py::GPUTests::test_float_index_expression_type_promotion_cuda <- test/inductor/test_torchinductor.py PASSED [0.2417s] [ 38%] 2025-12-04T10:38:46.8593959Z inductor/test_compile_subprocess.py::GPUTests::test_floordiv_cuda <- test/inductor/test_torchinductor.py PASSED [0.5280s] [ 38%] 2025-12-04T10:38:46.8594317Z inductor/test_compile_subprocess.py::GPUTests::test_fmod_zero_dim_cuda <- test/inductor/test_torchinductor.py PASSED [0.8740s] [ 39%] 2025-12-04T10:38:46.8594830Z inductor/test_compile_subprocess.py::GPUTests::test_forced_buffer_realize_cuda <- test/inductor/test_torchinductor.py W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8595322Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8595757Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8596167Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8596564Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8597012Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8597449Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8597835Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8598304Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8598746Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8599193Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8599601Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8600002Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8600425Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8600848Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8601270Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8601693Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8602152Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8602585Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8603016Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8603457Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8603911Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8604389Z W1204 10:31:03.996000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops._inductor_test.realize.default 2025-12-04T10:38:46.8604836Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8605165Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8605596Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8606006Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8606407Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8606890Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8607325Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8607715Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8608146Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8608586Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8609030Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8609439Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8609839Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8610258Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8610680Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8611153Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8611577Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8612006Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8612437Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8612871Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8613313Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8613766Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8614242Z W1204 10:31:04.205000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops._inductor_test.realize.default 2025-12-04T10:38:46.8614571Z PASSED [0.4017s] [ 39%] 2025-12-04T10:38:46.8614816Z inductor/test_compile_subprocess.py::GPUTests::test_fractional_max_pool2d2_cuda <- test/inductor/test_torchinductor.py PASSED [0.8797s] [ 39%] 2025-12-04T10:38:46.8615187Z inductor/test_compile_subprocess.py::GPUTests::test_full_like_cuda <- test/inductor/test_torchinductor.py PASSED [0.3485s] [ 40%] 2025-12-04T10:38:46.8615660Z inductor/test_compile_subprocess.py::GPUTests::test_fuse_large_params_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0016s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 40%] 2025-12-04T10:38:46.8616270Z inductor/test_compile_subprocess.py::GPUTests::test_fusing_write_into_disjoint_read_cuda <- test/inductor/test_torchinductor.py W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8616775Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8617210Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8617621Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8618018Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8618500Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8618936Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8619324Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8619728Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8620211Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8620650Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8621060Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8621461Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8621881Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8622305Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8622728Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8623159Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8623593Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8624028Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8624473Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8624943Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8625396Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8625862Z W1204 10:31:05.638000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8626296Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8626629Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8627060Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8627468Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8627861Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8628351Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8628788Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8629215Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8629618Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8630063Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8630502Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8630908Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8631312Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8631731Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8632154Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8632574Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8632995Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8633428Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8633887Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8634323Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8634762Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8635215Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8635679Z W1204 10:31:05.861000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.rev.default 2025-12-04T10:38:46.8636001Z PASSED [1.3787s] [ 40%] 2025-12-04T10:38:46.8636232Z inductor/test_compile_subprocess.py::GPUTests::test_gather1_cuda <- test/inductor/test_torchinductor.py PASSED [0.5325s] [ 41%] 2025-12-04T10:38:46.8636594Z inductor/test_compile_subprocess.py::GPUTests::test_gather_scatter_cuda <- test/inductor/test_torchinductor.py PASSED [0.6659s] [ 41%] 2025-12-04T10:38:46.8636968Z inductor/test_compile_subprocess.py::GPUTests::test_generate_rand_fp8_cuda <- test/inductor/test_torchinductor.py PASSED [0.0018s] [ 41%] 2025-12-04T10:38:46.8637327Z inductor/test_compile_subprocess.py::GPUTests::test_glu_cuda <- test/inductor/test_torchinductor.py PASSED [0.8031s] [ 42%] 2025-12-04T10:38:46.8637733Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_both_scalars_cuda <- test/inductor/test_torchinductor.py PASSED [0.3947s] [ 42%] 2025-12-04T10:38:46.8638187Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_constant_tensor2_cuda <- test/inductor/test_torchinductor.py PASSED [0.2386s] [ 42%] 2025-12-04T10:38:46.8638751Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_mutation_real_name_cuda <- test/inductor/test_torchinductor.py W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8639269Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8639707Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8640128Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8640529Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8640982Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8641418Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8641808Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8642217Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8642693Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8643134Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8643547Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8643955Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8644381Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8644811Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8645234Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8645657Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8646088Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8646520Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8646992Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8647439Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8647896Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8648448Z W1204 10:31:09.696000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8648877Z W1204 10:31:09.708000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8649167Z W1204 10:31:09.708000 404399 site-packages/torch/_inductor/utils.py:2565] [0/0] DeviceCopy in input program 2025-12-04T10:38:46.8649369Z PASSED [0.2902s] [ 43%] 2025-12-04T10:38:46.8649759Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_no_inputs_cuda <- test/inductor/test_torchinductor.py W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8650257Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] Traceback (most recent call last): 2025-12-04T10:38:46.8650692Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8651109Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] ).serialize() 2025-12-04T10:38:46.8651511Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8651992Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8652430Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8652821Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] pickler.dump(obj) 2025-12-04T10:38:46.8653225Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8653673Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8654116Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8654527Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8654934Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8655359Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8655786Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8656241Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8656666Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8657098Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8657531Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8657975Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8658461Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8658920Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8659401Z W1204 10:31:09.968000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [1/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8659736Z PASSED [0.3840s] [ 43%] 2025-12-04T10:38:46.8659989Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_pad_dynamic_cuda <- test/inductor/test_torchinductor.py PASSED [2.3882s] [ 43%] 2025-12-04T10:38:46.8660394Z inductor/test_compile_subprocess.py::GPUTests::test_graph_partition_refcount_cuda <- test/inductor/test_torchinductor.py PASSED [268.3422s] [ 44%] 2025-12-04T10:38:46.8660947Z inductor/test_compile_subprocess.py::GPUTests::test_grid_sampler_2d_cuda <- test/inductor/test_torchinductor.py W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8661435Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8661873Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8662286Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8662686Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8663140Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8663582Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8663973Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8664378Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8664856Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8665301Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8665714Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8666120Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8666544Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8666969Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8667398Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8667823Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8668301Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8668733Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8669168Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8669639Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8670096Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8670561Z W1204 10:35:41.216000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8670884Z PASSED [2.9892s] [ 44%] 2025-12-04T10:38:46.8671111Z inductor/test_compile_subprocess.py::GPUTests::test_index2_cuda <- test/inductor/test_torchinductor.py PASSED [0.5450s] [ 44%] 2025-12-04T10:38:46.8671618Z inductor/test_compile_subprocess.py::GPUTests::test_index_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8672115Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8672552Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8672964Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8673365Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8673817Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8674296Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8674688Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8675092Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8675536Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8675979Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8676397Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8676805Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8677236Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8677663Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8678135Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8678564Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8679027Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8679466Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8679902Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8680344Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8680801Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8681267Z W1204 10:35:44.653000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8681702Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8682036Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8682472Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8682931Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8683333Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8683784Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8684221Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8684610Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8685014Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8685461Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8685907Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8686318Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8686721Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8687144Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8687592Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8688017Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8688482Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8688915Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8689348Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8689787Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8690233Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8690692Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8691157Z W1204 10:35:45.231000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8691479Z PASSED [1.1235s] [ 45%] 2025-12-04T10:38:46.8691864Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_abs_cuda <- test/inductor/test_torchinductor.py W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8692404Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8692840Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8693252Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8693653Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8694112Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8694548Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8694933Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8695340Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8695786Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8696228Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8696674Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8697080Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8697508Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8697937Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8698408Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8698836Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8699269Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8699700Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8700141Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8700585Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8701076Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8701543Z W1204 10:35:45.738000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8701866Z PASSED [0.1877s] [ 45%] 2025-12-04T10:38:46.8702245Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_cuda <- test/inductor/test_torchinductor.py W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8702736Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8703175Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8703589Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8703984Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8704433Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8704869Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8705261Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8705701Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8706144Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8706587Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8707004Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8707409Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8712297Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8712744Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8713165Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8713582Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8714011Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8714501Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8714934Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8715369Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8715822Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8716293Z W1204 10:35:45.923000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8716615Z PASSED [0.1596s] [ 45%] 2025-12-04T10:38:46.8717008Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_floordiv_cuda <- test/inductor/test_torchinductor.py W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8717507Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8717939Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8718399Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8718800Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8719286Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8719723Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8720112Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8720512Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8720954Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8721404Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8721815Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8722219Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8722639Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8723061Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8723521Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8723947Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8724381Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8724810Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8725241Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8725687Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8726140Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8726606Z W1204 10:35:46.088000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8726923Z PASSED [0.2137s] [ 46%] 2025-12-04T10:38:46.8727313Z inductor/test_compile_subprocess.py::GPUTests::test_index_propagation_remainder_cuda <- test/inductor/test_torchinductor.py W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8727817Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8728326Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8728735Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8729131Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8729580Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8730016Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8730411Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8730816Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8731252Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8731688Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8732092Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8732525Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8732955Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8733379Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8733799Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8734220Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8734647Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8735079Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8735513Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8735955Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8736408Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8737509Z W1204 10:35:46.300000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8737828Z PASSED [0.2484s] [ 46%] 2025-12-04T10:38:46.8738114Z inductor/test_compile_subprocess.py::GPUTests::test_index_put_failed_reinplace_cuda <- test/inductor/test_torchinductor.py PASSED [0.5983s] [ 46%] 2025-12-04T10:38:46.8738503Z inductor/test_compile_subprocess.py::GPUTests::test_index_put_fallback1_cuda <- test/inductor/test_torchinductor.py PASSED [0.6049s] [ 47%] 2025-12-04T10:38:46.8738881Z inductor/test_compile_subprocess.py::GPUTests::test_index_put_index_cuda <- test/inductor/test_torchinductor.py PASSED [0.5197s] [ 47%] 2025-12-04T10:38:46.8739244Z inductor/test_compile_subprocess.py::GPUTests::test_index_select_cuda <- test/inductor/test_torchinductor.py PASSED [1.2820s] [ 47%] 2025-12-04T10:38:46.8739617Z inductor/test_compile_subprocess.py::GPUTests::test_indirect_load_broadcast_cuda <- test/inductor/test_torchinductor.py PASSED [0.7527s] [ 48%] 2025-12-04T10:38:46.8740041Z inductor/test_compile_subprocess.py::GPUTests::test_inductor_layout_optimization_input_mutations_cuda <- test/inductor/test_torchinductor.py PASSED [0.7998s] [ 48%] 2025-12-04T10:38:46.8740598Z inductor/test_compile_subprocess.py::GPUTests::test_inner_fn_str_and_stride_cuda <- test/inductor/test_torchinductor.py W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8741091Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8741530Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8741987Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8742388Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8742838Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8743270Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8743656Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8744055Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8744502Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8744940Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8745348Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8745750Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8746173Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8746643Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8747061Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8747478Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8747907Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8748373Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8748808Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8749247Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8749698Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8750176Z W1204 10:35:51.111000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops._inductor_test.realize.default 2025-12-04T10:38:46.8750509Z PASSED [0.2033s] [ 48%] 2025-12-04T10:38:46.8750785Z inductor/test_compile_subprocess.py::GPUTests::test_inplace_add_cuda <- test/inductor/test_torchinductor.py PASSED [0.1991s] [ 49%] 2025-12-04T10:38:46.8751150Z inductor/test_compile_subprocess.py::GPUTests::test_input_mutation2_cuda <- test/inductor/test_torchinductor.py PASSED [0.2455s] [ 49%] 2025-12-04T10:38:46.8751520Z inductor/test_compile_subprocess.py::GPUTests::test_input_mutation3_cuda <- test/inductor/test_torchinductor.py PASSED [0.2923s] [ 49%] 2025-12-04T10:38:46.8751885Z inductor/test_compile_subprocess.py::GPUTests::test_input_mutation4_cuda <- test/inductor/test_torchinductor.py PASSED [0.2072s] [ 50%] 2025-12-04T10:38:46.8752265Z inductor/test_compile_subprocess.py::GPUTests::test_insignificant_strides_cuda <- test/inductor/test_torchinductor.py PASSED [0.2323s] [ 50%] 2025-12-04T10:38:46.8752636Z inductor/test_compile_subprocess.py::GPUTests::test_isinf2_cuda <- test/inductor/test_torchinductor.py PASSED [0.4741s] [ 50%] 2025-12-04T10:38:46.8752978Z inductor/test_compile_subprocess.py::GPUTests::test_isinf_cuda <- test/inductor/test_torchinductor.py PASSED [0.9038s] [ 51%] 2025-12-04T10:38:46.8753331Z inductor/test_compile_subprocess.py::GPUTests::test_issue102546_cuda <- test/inductor/test_torchinductor.py PASSED [0.3700s] [ 51%] 2025-12-04T10:38:46.8753694Z inductor/test_compile_subprocess.py::GPUTests::test_kernel_names_cuda <- test/inductor/test_torchinductor.py PASSED [0.1791s] [ 51%] 2025-12-04T10:38:46.8754045Z inductor/test_compile_subprocess.py::GPUTests::test_l1_loss_cuda <- test/inductor/test_torchinductor.py PASSED [0.2410s] [ 52%] 2025-12-04T10:38:46.8754420Z inductor/test_compile_subprocess.py::GPUTests::test_large_grid_use_block_ptr_False_cuda <- test/inductor/test_torchinductor.py PASSED [0.9315s] [ 52%] 2025-12-04T10:38:46.8754818Z inductor/test_compile_subprocess.py::GPUTests::test_layer_norm_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 52%] 2025-12-04T10:38:46.8755339Z inductor/test_compile_subprocess.py::GPUTests::test_leaky_relu_cuda <- test/inductor/test_torchinductor.py W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8755861Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8756302Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8756716Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8757112Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8757563Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8758004Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8758439Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8758837Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8759276Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8759712Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8760161Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8760569Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8760996Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8761420Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8761842Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8762272Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8762706Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8763130Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8763568Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8764008Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8764459Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8764974Z W1204 10:35:55.866000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8765311Z PASSED [0.5269s] [ 53%] 2025-12-04T10:38:46.8765531Z inductor/test_compile_subprocess.py::GPUTests::test_lgamma_cuda <- test/inductor/test_torchinductor.py PASSED [1.1767s] [ 53%] 2025-12-04T10:38:46.8766015Z inductor/test_compile_subprocess.py::GPUTests::test_like_rands2_cuda <- test/inductor/test_torchinductor.py W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8766492Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8766928Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8767334Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8767738Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8768301Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8768735Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8769162Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8769565Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8770004Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8770448Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8770855Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8771260Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8771680Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8772104Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8772522Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8772945Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8773378Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8773834Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8774268Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8774709Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8775158Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8775631Z W1204 10:35:57.318000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8775670Z PASSED [0.3140s] [ 53%] 2025-12-04T10:38:46.8775847Z inductor/test_compile_subprocess.py::GPUTests::test_linear_mixed_dtype_cuda <- test/inductor/test_torchinductor.py PASSED [0.2228s] [ 54%] 2025-12-04T10:38:46.8776043Z inductor/test_compile_subprocess.py::GPUTests::test_lite_regional_compile_repeated_blocks_cuda <- test/inductor/test_torchinductor.py PASSED [0.1721s] [ 54%] 2025-12-04T10:38:46.8776239Z inductor/test_compile_subprocess.py::GPUTests::test_lite_triton_kernel_wrapper_functional_cuda <- test/inductor/test_torchinductor.py PASSED [0.2754s] [ 54%] 2025-12-04T10:38:46.8776392Z inductor/test_compile_subprocess.py::GPUTests::test_log2_cuda <- test/inductor/test_torchinductor.py PASSED [0.4243s] [ 55%] 2025-12-04T10:38:46.8776547Z inductor/test_compile_subprocess.py::GPUTests::test_log_fp64_cuda <- test/inductor/test_torchinductor.py PASSED [0.4057s] [ 55%] 2025-12-04T10:38:46.8776754Z inductor/test_compile_subprocess.py::GPUTests::test_logcumsumexp_zero_dim_cuda <- test/inductor/test_torchinductor.py PASSED [0.3318s] [ 55%] 2025-12-04T10:38:46.8777082Z inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_1_dim_2_cuda <- test/inductor/test_torchinductor.py W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8777230Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8777488Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8777609Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8777852Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8778025Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8778300Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8778423Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8778663Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8778829Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8779090Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8779226Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8779457Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8779614Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8779847Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8780005Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8780232Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8780398Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8780624Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8780826Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8781056Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8781244Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8781525Z W1204 10:35:59.496000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8781678Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8781823Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8782078Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8782197Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8782436Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8782608Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8782831Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8782982Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8783222Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8783383Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8783621Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8783757Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8783993Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8784149Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8784377Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8784529Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8784756Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8784946Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8785173Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8785341Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8785573Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8785758Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8786041Z W1204 10:36:00.152000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8786083Z PASSED [1.3252s] [ 56%] 2025-12-04T10:38:46.8786410Z inductor/test_compile_subprocess.py::GPUTests::test_low_memory_max_pool_dilation_2_dim_3_cuda <- test/inductor/test_torchinductor.py W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8786550Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8786803Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8786920Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8787186Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8787354Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8787580Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8787704Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8787943Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8788154Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8788394Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8788529Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8788755Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8788911Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8789177Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8789333Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8789559Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8789724Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8789949Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8790117Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8790351Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8790534Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8790811Z W1204 10:36:00.830000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8790962Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8791102Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8791379Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8791497Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8791738Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8791907Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8792133Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8792257Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8792497Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8792660Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8792896Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8793059Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8793290Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8793445Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8793673Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8793828Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8794053Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8794220Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8794447Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8794617Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8794847Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8795033Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8795330Z W1204 10:36:02.616000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8795369Z PASSED [3.7243s] [ 56%] 2025-12-04T10:38:46.8795558Z inductor/test_compile_subprocess.py::GPUTests::test_mark_dynamic_with_hint_override_cuda <- test/inductor/test_torchinductor.py PASSED [0.6137s] [ 56%] 2025-12-04T10:38:46.8795730Z inductor/test_compile_subprocess.py::GPUTests::test_masked_fill_promotion_cuda <- test/inductor/test_torchinductor.py PASSED [0.4869s] [ 57%] 2025-12-04T10:38:46.8795895Z inductor/test_compile_subprocess.py::GPUTests::test_matmul_layer_norm_cuda <- test/inductor/test_torchinductor.py PASSED [0.3089s] [ 57%] 2025-12-04T10:38:46.8796049Z inductor/test_compile_subprocess.py::GPUTests::test_max_min_cuda <- test/inductor/test_torchinductor.py PASSED [0.4774s] [ 57%] 2025-12-04T10:38:46.8796352Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d3_cuda <- test/inductor/test_torchinductor.py W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8796493Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8796749Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8796868Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8797105Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8797304Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8797528Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8797653Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8797890Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8798053Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8798339Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8798475Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8798704Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8798859Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8799091Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8799247Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8799504Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8799668Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8799894Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8800061Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8800295Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8800478Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8800753Z W1204 10:36:06.429000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8800906Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8801045Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8801326Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8801443Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8801689Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8801858Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8802082Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8802209Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8802450Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8802614Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8802848Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8802984Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8803215Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8803397Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8803625Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8803781Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8804010Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8804173Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8804401Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8804568Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8804801Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8804982Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8805258Z W1204 10:36:07.042000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims._low_memory_max_pool_with_offsets.default 2025-12-04T10:38:46.8805318Z PASSED [1.2422s] [ 58%] 2025-12-04T10:38:46.8805512Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward2_cuda <- test/inductor/test_torchinductor.py PASSED [6.3543s] [ 58%] 2025-12-04T10:38:46.8805701Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda <- test/inductor/test_torchinductor.py PASSED [7.1842s] [ 58%] 2025-12-04T10:38:46.8805886Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward5_cuda <- test/inductor/test_torchinductor.py PASSED [0.1425s] [ 59%] 2025-12-04T10:38:46.8806071Z inductor/test_compile_subprocess.py::GPUTests::test_max_pool2d_with_indices_backward_cuda <- test/inductor/test_torchinductor.py PASSED [1.3940s] [ 59%] 2025-12-04T10:38:46.8806221Z inductor/test_compile_subprocess.py::GPUTests::test_mean_cuda <- test/inductor/test_torchinductor.py PASSED [0.5794s] [ 59%] 2025-12-04T10:38:46.8806389Z inductor/test_compile_subprocess.py::GPUTests::test_min_max_reduction_cuda <- test/inductor/test_torchinductor.py PASSED [0.6506s] [ 60%] 2025-12-04T10:38:46.8806568Z inductor/test_compile_subprocess.py::GPUTests::test_misaligned_address_issue1_cuda <- test/inductor/test_torchinductor.py PASSED [0.3969s] [ 60%] 2025-12-04T10:38:46.8806865Z inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm2_cuda <- test/inductor/test_torchinductor.py W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8807007Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8807265Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8807386Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8807656Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8807830Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8808058Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8808214Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8808452Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8808621Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8808858Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8808993Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8809223Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8809377Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8809640Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8809795Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8810025Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8810187Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8810414Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8810585Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8810817Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8811001Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8811262Z W1204 10:36:24.381000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8811415Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8811554Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8811836Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8811952Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8812194Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8812364Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8812592Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8812718Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8812954Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8813119Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8813353Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8813505Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8813734Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8813889Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8814115Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8814270Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8814498Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8814663Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8814891Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8815061Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8815294Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8815476Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8815763Z W1204 10:36:24.612000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8815803Z PASSED [0.4757s] [ 60%] 2025-12-04T10:38:46.8816099Z inductor/test_compile_subprocess.py::GPUTests::test_mixed_mm3_cuda <- test/inductor/test_torchinductor.py W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8816240Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8816494Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8816614Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8816853Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8817024Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8817248Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8817372Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8817627Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8817793Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8818028Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8818196Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8818424Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8818585Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8818819Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8818972Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8819199Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8819362Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8819589Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8819791Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8820021Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8820204Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8820594Z W1204 10:36:24.856000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8820749Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8820892Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8821148Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8821264Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8821502Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8821673Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8821929Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8822054Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8822291Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8822455Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8822688Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8822825Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8823056Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8823212Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8823441Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8823593Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8823823Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8824013Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8824241Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8824408Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8824640Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8824822Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8825086Z W1204 10:36:25.118000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8825127Z PASSED [0.5162s] [ 61%] 2025-12-04T10:38:46.8825293Z inductor/test_compile_subprocess.py::GPUTests::test_mm_mixed_dtype_cuda <- test/inductor/test_torchinductor.py PASSED [0.0882s] [ 61%] 2025-12-04T10:38:46.8825599Z inductor/test_compile_subprocess.py::GPUTests::test_mul_index_expr_cuda <- test/inductor/test_torchinductor.py W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8825741Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8826020Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8826137Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8826378Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8826547Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8826773Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8826900Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8827139Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8827302Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8827540Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8827675Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8827903Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8828153Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8828380Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8828537Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8828764Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8828930Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8829157Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8829326Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8829558Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8829739Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8829984Z W1204 10:36:25.456000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8830051Z PASSED [0.2683s] [ 61%] 2025-12-04T10:38:46.8830268Z inductor/test_compile_subprocess.py::GPUTests::test_multi_gpu_device_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (requires multiple cuda devices) [ 62%] 2025-12-04T10:38:46.8830435Z inductor/test_compile_subprocess.py::GPUTests::test_multi_threading_cuda <- test/inductor/test_torchinductor.py PASSED [0.1541s] [ 62%] 2025-12-04T10:38:46.8830596Z inductor/test_compile_subprocess.py::GPUTests::test_multilayer_any_cuda <- test/inductor/test_torchinductor.py PASSED [0.6366s] [ 62%] 2025-12-04T10:38:46.8830769Z inductor/test_compile_subprocess.py::GPUTests::test_multilayer_sum_low_prec_cuda <- test/inductor/test_torchinductor.py PASSED [0.2781s] [ 63%] 2025-12-04T10:38:46.8830938Z inductor/test_compile_subprocess.py::GPUTests::test_multilayer_var_lowp_cuda <- test/inductor/test_torchinductor.py PASSED [1.2065s] [ 63%] 2025-12-04T10:38:46.8831266Z inductor/test_compile_subprocess.py::GPUTests::test_mutable_custom_op_fixed_layout2_cuda <- test/inductor/test_torchinductor.py W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8831407Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8831663Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8831779Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8832021Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8832218Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8832444Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8832567Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8832805Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8832969Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8833204Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8833343Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8833571Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8833728Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8833959Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8834135Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8834366Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8834527Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8834752Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8834919Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8835153Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8835336Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8835577Z W1204 10:36:28.019000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.bar.default 2025-12-04T10:38:46.8835730Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8835868Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8836119Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8836256Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8836499Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8836668Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8836892Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8837015Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8837259Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8837423Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8837657Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8837793Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8838021Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8838240Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8838474Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8838634Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8838861Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8839026Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8839252Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8839422Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8839654Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8839836Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8840076Z W1204 10:36:28.298000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.mylib.bar.default 2025-12-04T10:38:46.8840117Z PASSED [0.3735s] [ 63%] 2025-12-04T10:38:46.8840348Z inductor/test_compile_subprocess.py::GPUTests::test_nan_sort_stable_False_descending_False_cuda <- test/inductor/test_torchinductor.py PASSED [0.3859s] [ 64%] 2025-12-04T10:38:46.8840508Z inductor/test_compile_subprocess.py::GPUTests::test_new_empty_cuda <- test/inductor/test_torchinductor.py PASSED [0.1399s] [ 64%] 2025-12-04T10:38:46.8840820Z inductor/test_compile_subprocess.py::GPUTests::test_nll_loss_backward_cuda <- test/inductor/test_torchinductor.py W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8840959Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8841211Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8841331Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8841570Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8841743Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8841967Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8842092Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8842354Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8842520Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8842755Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8842889Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8843118Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8843275Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8843505Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8843659Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8843887Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8844051Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8844279Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8844468Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8844698Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8844879Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8845120Z W1204 10:36:28.951000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8845275Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8845415Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8845668Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8845782Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8846024Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8846195Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8846438Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8846562Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8846800Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8846965Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8847200Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8847340Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8847570Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8847727Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8847952Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8848143Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8848372Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8848559Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8848787Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8848954Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8849186Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8849373Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8849639Z W1204 10:36:29.198000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8849679Z PASSED [0.6059s] [ 64%] 2025-12-04T10:38:46.8849984Z inductor/test_compile_subprocess.py::GPUTests::test_nll_loss_forward_cuda <- test/inductor/test_torchinductor.py W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8850123Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8850374Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8850519Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8850756Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8850926Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8851151Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8851275Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8851518Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8851685Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8851923Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8852057Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8852286Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8852443Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8852693Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8852848Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8853075Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8853240Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8853466Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8853639Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8853874Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8854060Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8854320Z W1204 10:36:29.527000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8854502Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8854644Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8854897Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8855016Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8855253Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8855426Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8855649Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8855774Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8856017Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8856183Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8856418Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8856555Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8856801Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8856955Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8857183Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8857336Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8857564Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8857727Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8857954Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8858161Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8858395Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8858608Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8858872Z W1204 10:36:29.813000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8858913Z PASSED [0.6383s] [ 65%] 2025-12-04T10:38:46.8859205Z inductor/test_compile_subprocess.py::GPUTests::test_one_hot_cuda <- test/inductor/test_torchinductor.py W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8859348Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8859600Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8859717Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8859954Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8860122Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8860349Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8860472Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8860740Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8860902Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8861137Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8861271Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8861500Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8861660Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8861890Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8862045Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8862271Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8862436Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8862683Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8862855Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8863084Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8863269Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8863511Z W1204 10:36:30.160000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8863553Z PASSED [0.2883s] [ 65%] 2025-12-04T10:38:46.8863875Z inductor/test_compile_subprocess.py::GPUTests::test_pattern_matcher_unbacked_cuda <- test/inductor/test_torchinductor.py W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8864016Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8864268Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8864383Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8864621Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8864814Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8865041Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8865169Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8865406Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8865572Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8865810Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8865946Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8866173Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8866329Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8866555Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8866731Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8866961Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8867125Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8867349Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8867517Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8867751Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8867935Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8868249Z W1204 10:36:30.482000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8868289Z PASSED [0.4373s] [ 65%] 2025-12-04T10:38:46.8868464Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_j0_cuda <- test/inductor/test_torchinductor.py PASSED [0.4753s] [ 66%] 2025-12-04T10:38:46.8868635Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_y0_cuda <- test/inductor/test_torchinductor.py PASSED [0.4994s] [ 66%] 2025-12-04T10:38:46.8868804Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_bessel_y1_cuda <- test/inductor/test_torchinductor.py PASSED [0.5585s] [ 67%] 2025-12-04T10:38:46.8869027Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_t_cuda <- test/inductor/test_torchinductor.py PASSED [0.0897s] [ 67%] 2025-12-04T10:38:46.8869213Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_chebyshev_polynomial_w_cuda <- test/inductor/test_torchinductor.py PASSED [0.0724s] [ 67%] 2025-12-04T10:38:46.8869518Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_entr_cuda <- test/inductor/test_torchinductor.py W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8869657Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8869918Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8870036Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8870277Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8870448Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8870672Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8870826Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8871064Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8871231Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8871468Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8871606Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8871834Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8871994Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8872225Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8872378Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8872607Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8872773Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8873021Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8873189Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8873423Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8873607Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8873873Z W1204 10:36:33.070000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8873916Z PASSED [0.6951s] [ 68%] 2025-12-04T10:38:46.8874085Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_erfcx_cuda <- test/inductor/test_torchinductor.py PASSED [0.3958s] [ 68%] 2025-12-04T10:38:46.8874252Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_expm1_cuda <- test/inductor/test_torchinductor.py PASSED [0.4435s] [ 68%] 2025-12-04T10:38:46.8874421Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_gammaincc_cuda <- test/inductor/test_torchinductor.py PASSED [0.1023s] [ 69%] 2025-12-04T10:38:46.8874590Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_gammaln_cuda <- test/inductor/test_torchinductor.py PASSED [0.5563s] [ 69%] 2025-12-04T10:38:46.8874775Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_laguerre_polynomial_l_cuda <- test/inductor/test_torchinductor.py PASSED [0.2157s] [ 69%] 2025-12-04T10:38:46.8875105Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_log_ndtr_cuda <- test/inductor/test_torchinductor.py W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8875246Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8875502Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8875618Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8875861Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8876037Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8876264Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8876388Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8876629Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8876794Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8877032Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8877189Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8877416Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8877573Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8877802Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8877957Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8878233Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8878396Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8878623Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8878792Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8879024Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8879241Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8879504Z W1204 10:36:35.462000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8879546Z PASSED [0.7166s] [ 70%] 2025-12-04T10:38:46.8879848Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_logit_cuda <- test/inductor/test_torchinductor.py W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8879990Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8880245Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8880366Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8880604Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8880775Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8880999Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8881125Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8881390Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8881553Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8881790Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8881924Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8882157Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8882313Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8882546Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8882700Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8882928Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8883123Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8883348Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8883520Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8883750Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8883934Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8884198Z W1204 10:36:35.957000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8884239Z PASSED [0.4436s] [ 70%] 2025-12-04T10:38:46.8884556Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_multigammaln_cuda <- test/inductor/test_torchinductor.py W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8884699Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8884957Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8885074Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8885318Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8885514Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8885741Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8885863Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8886102Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8886268Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8886502Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8886637Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8886866Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8887024Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8887273Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8887430Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8887657Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8887822Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8888049Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8888258Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8888493Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8888674Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8888915Z W1204 10:36:36.172000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8889067Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8889209Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8889489Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8889606Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8889844Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8890013Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8890242Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8890366Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8890606Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8890768Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8891003Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8891136Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8891395Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8891552Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8891778Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8891934Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8892158Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8892325Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8892550Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8892721Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8892949Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8893132Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8893401Z W1204 10:36:36.483000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8893462Z PASSED [0.6275s] [ 70%] 2025-12-04T10:38:46.8893633Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_ndtri_cuda <- test/inductor/test_torchinductor.py PASSED [0.0776s] [ 71%] 2025-12-04T10:38:46.8893794Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_psi_cuda <- test/inductor/test_torchinductor.py PASSED [0.3563s] [ 71%] 2025-12-04T10:38:46.8893989Z inductor/test_compile_subprocess.py::GPUTests::test_pointwise_scaled_modified_bessel_k1_cuda <- test/inductor/test_torchinductor.py PASSED [0.0802s] [ 71%] 2025-12-04T10:38:46.8894140Z inductor/test_compile_subprocess.py::GPUTests::test_pow3_cuda <- test/inductor/test_torchinductor.py PASSED [0.4555s] [ 72%] 2025-12-04T10:38:46.8894440Z inductor/test_compile_subprocess.py::GPUTests::test_pow_symfloat_cuda <- test/inductor/test_torchinductor.py W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8894584Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8894839Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8894956Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8895198Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8895389Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8895614Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8895738Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8895976Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8896140Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8896373Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8896511Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8896741Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8896899Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8897129Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8897283Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8897513Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8897696Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8897925Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8898131Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8898364Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8898549Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8898813Z W1204 10:36:37.783000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8898855Z PASSED [0.4019s] [ 72%] 2025-12-04T10:38:46.8899008Z inductor/test_compile_subprocess.py::GPUTests::test_prod_cuda <- test/inductor/test_torchinductor.py PASSED [1.7421s] [ 72%] 2025-12-04T10:38:46.8899518Z inductor/test_compile_subprocess.py::GPUTests::test_progressive SKIPPED [0.0005s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/157724 for platform(s) rocm. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 73%] 2025-12-04T10:38:46.8899876Z inductor/test_compile_subprocess.py::GPUTests::test_rand_like_deterministic_cuda <- test/inductor/test_torchinductor.py W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8900019Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8900276Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8900395Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8900636Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8900807Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8901035Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8901158Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8901400Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8901561Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8901823Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8901958Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8902189Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8902347Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8902577Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8902737Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8902965Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8903130Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8903354Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8903525Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8903777Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8903961Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8904215Z W1204 10:36:39.921000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8904253Z PASSED [0.4952s] [ 73%] 2025-12-04T10:38:46.8904570Z inductor/test_compile_subprocess.py::GPUTests::test_randint_distribution_cuda <- test/inductor/test_torchinductor.py W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8904714Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8904971Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8905087Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8905329Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8905498Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8905724Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8905850Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8906109Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8906274Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8906506Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8906641Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8906873Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8907030Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8907257Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8907412Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8907640Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8907825Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8908052Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8908261Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8908493Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8908676Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8908934Z W1204 10:36:40.412000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8908979Z PASSED [0.7376s] [ 73%] 2025-12-04T10:38:46.8909286Z inductor/test_compile_subprocess.py::GPUTests::test_randn_generator_cuda <- test/inductor/test_torchinductor.py W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8909428Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8909679Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8909799Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8910064Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8910238Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8910461Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8910586Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8910826Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8910991Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8911231Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8911366Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8911597Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8911751Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8912003Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8912159Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8912389Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8912554Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8912780Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8912952Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8913184Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8913369Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8913623Z W1204 10:36:41.156000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8913665Z PASSED [0.6757s] [ 74%] 2025-12-04T10:38:46.8913971Z inductor/test_compile_subprocess.py::GPUTests::test_randn_like_empty_cuda <- test/inductor/test_torchinductor.py W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8914133Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8914389Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8914506Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8914748Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8914918Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8915148Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8915271Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8915512Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8915677Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8915913Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8916074Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8916302Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8916460Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8916687Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8916845Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8917073Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8917243Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8917469Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8917638Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8917875Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8918058Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8918379Z W1204 10:36:41.832000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8918529Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8918670Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8918923Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8919044Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8919286Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8919458Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8919685Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8919811Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8920054Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8920243Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8920480Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8920612Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8920842Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8920996Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8921231Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8921385Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8921612Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8921778Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8922003Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8922177Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8922436Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8922622Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8922874Z W1204 10:36:41.885000 404399 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.inductor_seeds.default 2025-12-04T10:38:46.8922915Z PASSED [0.1111s] [ 74%] 2025-12-04T10:38:46.8923081Z inductor/test_compile_subprocess.py::GPUTests::test_reduction2_cuda <- test/inductor/test_torchinductor.py PASSED [0.5564s] [ 74%] 2025-12-04T10:38:46.8923242Z inductor/test_compile_subprocess.py::GPUTests::test_reduction5_cuda <- test/inductor/test_torchinductor.py PASSED [0.3800s] [ 75%] 2025-12-04T10:38:46.8923402Z inductor/test_compile_subprocess.py::GPUTests::test_remainder_cuda <- test/inductor/test_torchinductor.py PASSED [0.5582s] [ 75%] 2025-12-04T10:38:46.8923596Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.2854s] [ 75%] 2025-12-04T10:38:46.8923791Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda <- test/inductor/test_torchinductor.py ('RERUN', {'yellow': True}) [0.2158s] [ 75%] 2025-12-04T10:38:46.8923955Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda <- test/inductor/test_torchinductor.py FAILED [0.2204s] [ 75%] 2025-12-04T10:38:46.8923981Z 2025-12-04T10:38:46.8924041Z ==================================== RERUNS ==================================== 2025-12-04T10:38:46.8924121Z _____________________ GPUTests.test_remove_noop_slice_cuda _____________________ 2025-12-04T10:38:46.8924172Z Traceback (most recent call last): 2025-12-04T10:38:46.8924299Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T10:38:46.8924341Z return value(self) 2025-12-04T10:38:46.8924483Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 6708, in test_remove_noop_slice 2025-12-04T10:38:46.8924537Z self.assertExpectedInline( 2025-12-04T10:38:46.8924713Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3272, in assertExpectedInline 2025-12-04T10:38:46.8924851Z return super().assertExpectedInline(actual if isinstance(actual, str) else str(actual), expect, skip + 1) 2025-12-04T10:38:46.8925002Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 413, in assertExpectedInline 2025-12-04T10:38:46.8925051Z assert_expected_inline( 2025-12-04T10:38:46.8925201Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 378, in assert_expected_inline 2025-12-04T10:38:46.8925257Z assert_eq(expect, actual, msg=help_text) 2025-12-04T10:38:46.8925424Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 450, in assertMultiLineEqualMaybeCppStack 2025-12-04T10:38:46.8925502Z self.assertMultiLineEqual(expect, actual, *args, **kwargs) 2025-12-04T10:38:46.8925622Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 1226, in assertMultiLineEqual 2025-12-04T10:38:46.8925685Z self.fail(self._formatMessage(msg, standardMsg)) 2025-12-04T10:38:46.8925783Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 675, in fail 2025-12-04T10:38:46.8925832Z raise self.failureException(msg) 2025-12-04T10:38:46.8925929Z AssertionError: 'def forward(self, arg0_1: "Sym(s77)", arg[333 chars]_9,)' != '' 2025-12-04T10:38:46.8926053Z - def forward(self, arg0_1: "Sym(s77)", arg1_1: "Sym(s27)", arg2_1: "Sym(s53)", arg3_1: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0"): 2025-12-04T10:38:46.8926189Z - add: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(arg3_1, 1); arg3_1 = None 2025-12-04T10:38:46.8926288Z - add_9: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(add, 1); add = None 2025-12-04T10:38:46.8926470Z - return (add_9,) : To accept the new output, re-run test with envvar EXPECTTEST_ACCEPT=1 (we recommend staging/committing your changes before doing this) 2025-12-04T10:38:46.8926472Z 2025-12-04T10:38:46.8926549Z To execute this test, run the following from the base repo dir: 2025-12-04T10:38:46.8926734Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_remove_noop_slice_cuda 2025-12-04T10:38:46.8926738Z 2025-12-04T10:38:46.8926828Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:38:46.8926911Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8926957Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8927018Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8927121Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8927446Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8927522Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8927739Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8927854Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8928102Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8928193Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8928272Z _____________________ GPUTests.test_remove_noop_slice_cuda _____________________ 2025-12-04T10:38:46.8928322Z Traceback (most recent call last): 2025-12-04T10:38:46.8928551Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T10:38:46.8928592Z return value(self) 2025-12-04T10:38:46.8928730Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 6708, in test_remove_noop_slice 2025-12-04T10:38:46.8928782Z self.assertExpectedInline( 2025-12-04T10:38:46.8928964Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3272, in assertExpectedInline 2025-12-04T10:38:46.8929104Z return super().assertExpectedInline(actual if isinstance(actual, str) else str(actual), expect, skip + 1) 2025-12-04T10:38:46.8929252Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 413, in assertExpectedInline 2025-12-04T10:38:46.8929296Z assert_expected_inline( 2025-12-04T10:38:46.8929444Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 378, in assert_expected_inline 2025-12-04T10:38:46.8929500Z assert_eq(expect, actual, msg=help_text) 2025-12-04T10:38:46.8929663Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 450, in assertMultiLineEqualMaybeCppStack 2025-12-04T10:38:46.8929741Z self.assertMultiLineEqual(expect, actual, *args, **kwargs) 2025-12-04T10:38:46.8929862Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 1226, in assertMultiLineEqual 2025-12-04T10:38:46.8929925Z self.fail(self._formatMessage(msg, standardMsg)) 2025-12-04T10:38:46.8930061Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 675, in fail 2025-12-04T10:38:46.8930112Z raise self.failureException(msg) 2025-12-04T10:38:46.8930205Z AssertionError: 'def forward(self, arg0_1: "Sym(s77)", arg[333 chars]_9,)' != '' 2025-12-04T10:38:46.8930328Z - def forward(self, arg0_1: "Sym(s77)", arg1_1: "Sym(s27)", arg2_1: "Sym(s53)", arg3_1: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0"): 2025-12-04T10:38:46.8930432Z - add: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(arg3_1, 1); arg3_1 = None 2025-12-04T10:38:46.8930529Z - add_9: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(add, 1); add = None 2025-12-04T10:38:46.8930706Z - return (add_9,) : To accept the new output, re-run test with envvar EXPECTTEST_ACCEPT=1 (we recommend staging/committing your changes before doing this) 2025-12-04T10:38:46.8930712Z 2025-12-04T10:38:46.8930788Z To execute this test, run the following from the base repo dir: 2025-12-04T10:38:46.8930976Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_remove_noop_slice_cuda 2025-12-04T10:38:46.8930978Z 2025-12-04T10:38:46.8931066Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:38:46.8931143Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8931187Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8931247Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8931348Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8931667Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8931775Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8931990Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8932079Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8932291Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8932379Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8932452Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8932500Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8932560Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8932661Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8932978Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8933055Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8933266Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8933355Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8933563Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8933655Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8933732Z =================================== FAILURES =================================== 2025-12-04T10:38:46.8933813Z _____________________ GPUTests.test_remove_noop_slice_cuda _____________________ 2025-12-04T10:38:46.8933860Z Traceback (most recent call last): 2025-12-04T10:38:46.8933981Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 14842, in new_test 2025-12-04T10:38:46.8934019Z return value(self) 2025-12-04T10:38:46.8934159Z File "/var/lib/jenkins/pytorch/test/inductor/test_torchinductor.py", line 6708, in test_remove_noop_slice 2025-12-04T10:38:46.8934207Z self.assertExpectedInline( 2025-12-04T10:38:46.8934384Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3272, in assertExpectedInline 2025-12-04T10:38:46.8934522Z return super().assertExpectedInline(actual if isinstance(actual, str) else str(actual), expect, skip + 1) 2025-12-04T10:38:46.8934675Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 413, in assertExpectedInline 2025-12-04T10:38:46.8934718Z assert_expected_inline( 2025-12-04T10:38:46.8934868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 378, in assert_expected_inline 2025-12-04T10:38:46.8934920Z assert_eq(expect, actual, msg=help_text) 2025-12-04T10:38:46.8935085Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/expecttest/__init__.py", line 450, in assertMultiLineEqualMaybeCppStack 2025-12-04T10:38:46.8935159Z self.assertMultiLineEqual(expect, actual, *args, **kwargs) 2025-12-04T10:38:46.8935282Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 1226, in assertMultiLineEqual 2025-12-04T10:38:46.8935343Z self.fail(self._formatMessage(msg, standardMsg)) 2025-12-04T10:38:46.8935462Z File "/opt/conda/envs/py_3.10/lib/python3.10/unittest/case.py", line 675, in fail 2025-12-04T10:38:46.8935509Z raise self.failureException(msg) 2025-12-04T10:38:46.8935605Z AssertionError: 'def forward(self, arg0_1: "Sym(s77)", arg[333 chars]_9,)' != '' 2025-12-04T10:38:46.8935726Z - def forward(self, arg0_1: "Sym(s77)", arg1_1: "Sym(s27)", arg2_1: "Sym(s53)", arg3_1: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0"): 2025-12-04T10:38:46.8935827Z - add: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(arg3_1, 1); arg3_1 = None 2025-12-04T10:38:46.8935924Z - add_9: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(add, 1); add = None 2025-12-04T10:38:46.8936098Z - return (add_9,) : To accept the new output, re-run test with envvar EXPECTTEST_ACCEPT=1 (we recommend staging/committing your changes before doing this) 2025-12-04T10:38:46.8936101Z 2025-12-04T10:38:46.8936177Z To execute this test, run the following from the base repo dir: 2025-12-04T10:38:46.8936358Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_remove_noop_slice_cuda 2025-12-04T10:38:46.8936360Z 2025-12-04T10:38:46.8936450Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:38:46.8936525Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8936569Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8936625Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8936726Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8937040Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8937117Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8937350Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8937441Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8937650Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8937740Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8937815Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8937858Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8937916Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8938015Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8938373Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8938447Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8938656Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8938741Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8938948Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8939035Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8939142Z ----------------------------- Captured stdout call ----------------------------- 2025-12-04T10:38:46.8939184Z frames [('total', 1), ('ok', 1)] 2025-12-04T10:38:46.8939241Z stats [('calls_captured', 3), ('unique_graphs', 1)] 2025-12-04T10:38:46.8939342Z aot_autograd [('total', 1), ('autograd_cache_miss', 1), ('autograd_cache_saved', 1), ('ok', 1)] 2025-12-04T10:38:46.8939663Z inductor [('triton_bundler_save_kernel', 24), ('benchmarking.InductorBenchmarker.benchmark', 3), ('benchmarking.InductorBenchmarker.benchmark_gpu', 3), ('fxgraph_cache_miss', 1), ('async_compile_cache_miss', 1), ('triton_bundler_save_static_autotuner', 1)] 2025-12-04T10:38:46.8939738Z ----------------------------- Captured stderr call ----------------------------- 2025-12-04T10:38:46.8939948Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__getstate__, please add missing op schema 2025-12-04T10:38:46.8940036Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8940244Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/ops_handler.py:772: UserWarning: undefined OpHandler.__setstate__, please add missing op schema 2025-12-04T10:38:46.8940334Z warnings.warn(f"undefined OpHandler.{name}, please add missing op schema") 2025-12-04T10:38:46.8940571Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-545f778f39a758b6.xml - 2025-12-04T10:38:46.8940634Z =========================== short test summary info ============================ 2025-12-04T10:38:46.8940848Z FAILED [0.2204s] inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda - AssertionError: 'def forward(self, arg0_1: "Sym(s77)", arg[333 chars]_9,)' != '' 2025-12-04T10:38:46.8940971Z - def forward(self, arg0_1: "Sym(s77)", arg1_1: "Sym(s27)", arg2_1: "Sym(s53)", arg3_1: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0"): 2025-12-04T10:38:46.8941071Z - add: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(arg3_1, 1); arg3_1 = None 2025-12-04T10:38:46.8941194Z - add_9: "f32[s77, s27, s53][s27*s53, s53, 1]cuda:0" = torch.ops.aten.add.Tensor(add, 1); add = None 2025-12-04T10:38:46.8941371Z - return (add_9,) : To accept the new output, re-run test with envvar EXPECTTEST_ACCEPT=1 (we recommend staging/committing your changes before doing this) 2025-12-04T10:38:46.8941373Z 2025-12-04T10:38:46.8941449Z To execute this test, run the following from the base repo dir: 2025-12-04T10:38:46.8941629Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/inductor/test_compile_subprocess.py GPUTests.test_remove_noop_slice_cuda 2025-12-04T10:38:46.8941631Z 2025-12-04T10:38:46.8941719Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T10:38:46.8941780Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T10:38:46.8941863Z ======== 1 failed, 216 passed, 8 skipped, 2 rerun in 456.31s (0:07:36) ========= 2025-12-04T10:38:46.8941902Z Got exit code 1 2025-12-04T10:38:46.8941946Z Retrying single test... 2025-12-04T10:38:46.8942139Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-06d560f9a985c464.xml 2025-12-04T10:38:46.8942200Z ============================= test session starts ============================== 2025-12-04T10:38:46.8942316Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:38:46.8942358Z cachedir: .pytest_cache 2025-12-04T10:38:46.8942519Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:38:46.8942566Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:38:46.8942608Z configfile: pytest.ini 2025-12-04T10:38:46.8942772Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:38:46.8942874Z collecting ... collected 897 items / 296 deselected / 601 selected 2025-12-04T10:38:46.8943052Z stepcurrent: skipping 224 already run items. Running only test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda 2025-12-04T10:38:46.8943099Z Running 1 items in this shard 2025-12-04T10:38:46.8943101Z 2025-12-04T10:38:46.8943277Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda <- test/inductor/test_torchinductor.py PASSED [5.9060s] [100%] 2025-12-04T10:38:46.8943279Z 2025-12-04T10:38:46.8943512Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-06d560f9a985c464.xml - 2025-12-04T10:38:46.8943577Z ====================== 1 passed, 296 deselected in 5.93s ======================= 2025-12-04T10:38:46.8943616Z Got exit code 0 2025-12-04T10:38:46.8943700Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T10:38:46.8943889Z Test results will be stored in test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-af8fb06866c88361.xml 2025-12-04T10:38:46.8943950Z ============================= test session starts ============================== 2025-12-04T10:38:46.8944062Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T10:38:46.8944102Z cachedir: .pytest_cache 2025-12-04T10:38:46.8944258Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T10:38:46.8944304Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T10:38:46.8944344Z configfile: pytest.ini 2025-12-04T10:38:46.8944503Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T10:38:46.8944583Z collecting ... collected 897 items / 225 deselected / 672 selected 2025-12-04T10:38:46.8944638Z stepcurrent: skipping 225 already run items. 2025-12-04T10:38:46.8944685Z Running 72 items in this shard 2025-12-04T10:38:46.8944687Z 2025-12-04T10:38:46.8945288Z inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_scatter_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0005s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/151378 for platform(s) linux, rocm, slow. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 1%] 2025-12-04T10:38:46.8945451Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_cuda <- test/inductor/test_torchinductor.py PASSED [6.5828s] [ 2%] 2025-12-04T10:38:46.8945795Z inductor/test_compile_subprocess.py::GPUTests::test_repeat_interleave_decomposition_has_clamp_cuda <- test/inductor/test_torchinductor.py W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8945940Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8946204Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8946325Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8946569Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8946744Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8947004Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8947128Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8947365Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8947531Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8947765Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8947906Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8948186Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8948346Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8948582Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8948738Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8948970Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8949165Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8949393Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8949563Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8949797Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8949984Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8950230Z W1204 10:37:19.285000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8950273Z PASSED [0.8170s] [ 4%] 2025-12-04T10:38:46.8950451Z inductor/test_compile_subprocess.py::GPUTests::test_require_stride_expanded_cuda <- test/inductor/test_torchinductor.py PASSED [0.5650s] [ 5%] 2025-12-04T10:38:46.8950609Z inductor/test_compile_subprocess.py::GPUTests::test_resize_cuda <- test/inductor/test_torchinductor.py PASSED [7.0981s] [ 6%] 2025-12-04T10:38:46.8950904Z inductor/test_compile_subprocess.py::GPUTests::test_roi_align_cuda <- test/inductor/test_torchinductor.py W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8951073Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8951328Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8951448Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8951689Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8951862Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8952091Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8952217Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8952456Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8952618Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8952857Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8952995Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8953244Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8953399Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8953628Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8953786Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8954014Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8954182Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8954405Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8954577Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8954808Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8954991Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8955272Z W1204 10:37:27.801000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.torchvision.roi_align.default 2025-12-04T10:38:46.8955427Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8955569Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8955821Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8955938Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8956177Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8956350Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8956578Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8956706Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8956946Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8957113Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8957373Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8957508Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8957736Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8957890Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8958152Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8958310Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8958539Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8958702Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8958931Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8959139Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8959371Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8959555Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8959808Z W1204 10:37:27.861000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.torchvision.roi_align.default 2025-12-04T10:38:46.8959850Z PASSED [0.1900s] [ 8%] 2025-12-04T10:38:46.8960139Z inductor/test_compile_subprocess.py::GPUTests::test_roll_cuda <- test/inductor/test_torchinductor.py W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8960282Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8960535Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8960652Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8960894Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8961065Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8961293Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8961443Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8961682Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8964253Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8964499Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8964640Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8964879Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8965036Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8965264Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8965420Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8965648Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8965853Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8966078Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8966249Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8966480Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8966663Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8966908Z W1204 10:37:27.964000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8967059Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8967197Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8967450Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8967571Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8967818Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8968011Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8968277Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8968400Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8968639Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8968807Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8969045Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8969180Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8969409Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8969566Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8969832Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8969991Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8970219Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8970383Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8970608Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8970784Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8971017Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8971200Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8971442Z W1204 10:37:28.377000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8971483Z PASSED [1.0377s] [ 9%] 2025-12-04T10:38:46.8971647Z inductor/test_compile_subprocess.py::GPUTests::test_round_cuda <- test/inductor/test_torchinductor.py PASSED [0.5868s] [ 11%] 2025-12-04T10:38:46.8971803Z inductor/test_compile_subprocess.py::GPUTests::test_rsqrt_cuda <- test/inductor/test_torchinductor.py PASSED [0.5019s] [ 12%] 2025-12-04T10:38:46.8972143Z inductor/test_compile_subprocess.py::GPUTests::test_rsqrt_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8972286Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8972546Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8972662Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8972904Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8973078Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8973306Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8973432Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8973671Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8973836Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8974096Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8974233Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8974464Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8974623Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8974856Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8975013Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8975244Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8975414Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8975642Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8975809Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8976066Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8976249Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8976511Z W1204 10:37:30.122000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8976668Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8976807Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8977062Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8977179Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8977418Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8977589Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8977815Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8977971Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8978262Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8978427Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8978661Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8978798Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8979028Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8979191Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8979420Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8979574Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8979803Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8979970Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8980232Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8980401Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8980638Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8980822Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8981087Z W1204 10:37:32.017000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8981128Z PASSED [3.0611s] [ 13%] 2025-12-04T10:38:46.8981319Z inductor/test_compile_subprocess.py::GPUTests::test_scaled_dot_product_attention_cuda <- test/inductor/test_torchinductor.py PASSED [0.8655s] [ 15%] 2025-12-04T10:38:46.8981477Z inductor/test_compile_subprocess.py::GPUTests::test_scatter3_cuda <- test/inductor/test_torchinductor.py PASSED [0.6035s] [ 16%] 2025-12-04T10:38:46.8981635Z inductor/test_compile_subprocess.py::GPUTests::test_scatter4_cuda <- test/inductor/test_torchinductor.py PASSED [1.1815s] [ 18%] 2025-12-04T10:38:46.8981794Z inductor/test_compile_subprocess.py::GPUTests::test_scatter_add3_cuda <- test/inductor/test_torchinductor.py PASSED [0.9226s] [ 19%] 2025-12-04T10:38:46.8981961Z inductor/test_compile_subprocess.py::GPUTests::test_scatter_reduce3_cuda <- test/inductor/test_torchinductor.py PASSED [0.9561s] [ 20%] 2025-12-04T10:38:46.8982195Z inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_False_use_block_ptr_True_cuda <- test/inductor/test_torchinductor.py PASSED [0.8742s] [ 22%] 2025-12-04T10:38:46.8982398Z inductor/test_compile_subprocess.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_cuda <- test/inductor/test_torchinductor.py PASSED [0.6564s] [ 23%] 2025-12-04T10:38:46.8982726Z inductor/test_compile_subprocess.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda <- test/inductor/test_torchinductor.py W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8982867Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8983127Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8983248Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8983494Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8983669Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8983895Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8984022Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8984253Z W1204 10:37:39.243000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] AttributeError: Can't pickle local object 'CommonTemplate.test_sdpa_unaligned_mask_freezing..Mod' 2025-12-04T10:38:46.8984295Z PASSED [0.0955s] [ 25%] 2025-12-04T10:38:46.8984482Z inductor/test_compile_subprocess.py::GPUTests::test_shape_padding_cuda <- test/inductor/test_torchinductor.py PASSED [1.5707s] [ 26%] 2025-12-04T10:38:46.8984640Z inductor/test_compile_subprocess.py::GPUTests::test_signbit_cuda <- test/inductor/test_torchinductor.py PASSED [0.5913s] [ 27%] 2025-12-04T10:38:46.8984926Z inductor/test_compile_subprocess.py::GPUTests::test_silu_cuda <- test/inductor/test_torchinductor.py W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8985068Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8985320Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8985442Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8985686Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8985860Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8986086Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8986209Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8986469Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8986632Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8986869Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8987002Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8987231Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8987392Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8987620Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8987776Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8988003Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8988202Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8988427Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8988628Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8988860Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8989043Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8989307Z W1204 10:37:41.677000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.8989347Z PASSED [0.4316s] [ 29%] 2025-12-04T10:38:46.8989504Z inductor/test_compile_subprocess.py::GPUTests::test_sin_cuda <- test/inductor/test_torchinductor.py PASSED [0.8059s] [ 30%] 2025-12-04T10:38:46.8989657Z inductor/test_compile_subprocess.py::GPUTests::test_slice1_cuda <- test/inductor/test_torchinductor.py PASSED [0.6862s] [ 31%] 2025-12-04T10:38:46.8989811Z inductor/test_compile_subprocess.py::GPUTests::test_slice2_cuda <- test/inductor/test_torchinductor.py PASSED [0.4519s] [ 33%] 2025-12-04T10:38:46.8989977Z inductor/test_compile_subprocess.py::GPUTests::test_slice_mutation1_cuda <- test/inductor/test_torchinductor.py PASSED [0.6194s] [ 34%] 2025-12-04T10:38:46.8990145Z inductor/test_compile_subprocess.py::GPUTests::test_slice_scatter5_cuda <- test/inductor/test_torchinductor.py PASSED [0.5377s] [ 36%] 2025-12-04T10:38:46.8990296Z inductor/test_compile_subprocess.py::GPUTests::test_sort_cuda <- test/inductor/test_torchinductor.py PASSED [1.1947s] [ 37%] 2025-12-04T10:38:46.8990567Z inductor/test_compile_subprocess.py::GPUTests::test_sort_stable_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0003s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 38%] 2025-12-04T10:38:46.8990730Z inductor/test_compile_subprocess.py::GPUTests::test_sort_transpose_cuda <- test/inductor/test_torchinductor.py PASSED [20.7722s] [ 40%] 2025-12-04T10:38:46.8990898Z inductor/test_compile_subprocess.py::GPUTests::test_special_polygamma_cuda <- test/inductor/test_torchinductor.py PASSED [0.3718s] [ 41%] 2025-12-04T10:38:46.8991068Z inductor/test_compile_subprocess.py::GPUTests::test_split_cumprod_low_prec_cuda <- test/inductor/test_torchinductor.py PASSED [0.4211s] [ 43%] 2025-12-04T10:38:46.8991238Z inductor/test_compile_subprocess.py::GPUTests::test_split_cumsum_low_prec_cuda <- test/inductor/test_torchinductor.py PASSED [0.2363s] [ 44%] 2025-12-04T10:38:46.8991422Z inductor/test_compile_subprocess.py::GPUTests::test_split_reduction_with_int64_size_cuda <- test/inductor/test_torchinductor.py PASSED [0.6999s] [ 45%] 2025-12-04T10:38:46.8991743Z inductor/test_compile_subprocess.py::GPUTests::test_split_with_unbacked_symints_cuda <- test/inductor/test_torchinductor.py W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8991885Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8992138Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8992260Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8992501Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8992707Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8992935Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8993059Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8993301Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8993465Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8993702Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8993837Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8994066Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8994221Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.8994453Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.8994631Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.8994860Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.8995025Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.8995248Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.8995419Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.8995650Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.8995833Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.8996074Z W1204 10:38:08.741000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.8996114Z PASSED [0.5063s] [ 47%] 2025-12-04T10:38:46.8996341Z inductor/test_compile_subprocess.py::GPUTests::test_sqrt_dynamic_shapes_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0018s] (sqrt dynamic shapes only supports cpu) [ 48%] 2025-12-04T10:38:46.8996501Z inductor/test_compile_subprocess.py::GPUTests::test_squeeze1_cuda <- test/inductor/test_torchinductor.py PASSED [0.4538s] [ 50%] 2025-12-04T10:38:46.8996661Z inductor/test_compile_subprocess.py::GPUTests::test_squeeze2_cuda <- test/inductor/test_torchinductor.py PASSED [0.3970s] [ 51%] 2025-12-04T10:38:46.8996848Z inductor/test_compile_subprocess.py::GPUTests::test_squeeze_varargs_cuda <- test/inductor/test_torchinductor.py PASSED [0.4796s] [ 52%] 2025-12-04T10:38:46.8997001Z inductor/test_compile_subprocess.py::GPUTests::test_stack_cuda <- test/inductor/test_torchinductor.py PASSED [0.6075s] [ 54%] 2025-12-04T10:38:46.8997288Z inductor/test_compile_subprocess.py::GPUTests::test_std_cuda <- test/inductor/test_torchinductor.py W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.8997430Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.8997684Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.8997806Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.8998044Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.8998258Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.8998484Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.8998606Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.8998882Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.8999047Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.8999282Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.8999416Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.8999651Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.8999806Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9000039Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9000195Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9000420Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9000586Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9000813Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9001012Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9001245Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9001430Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9001693Z W1204 10:38:11.678000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9001737Z PASSED [1.0077s] [ 55%] 2025-12-04T10:38:46.9001908Z inductor/test_compile_subprocess.py::GPUTests::test_strided_inputs_cuda <- test/inductor/test_torchinductor.py PASSED [0.2640s] [ 56%] 2025-12-04T10:38:46.9002060Z inductor/test_compile_subprocess.py::GPUTests::test_sum2_cuda <- test/inductor/test_torchinductor.py PASSED [1.5991s] [ 58%] 2025-12-04T10:38:46.9002210Z inductor/test_compile_subprocess.py::GPUTests::test_sum3_cuda <- test/inductor/test_torchinductor.py PASSED [0.6401s] [ 59%] 2025-12-04T10:38:46.9002358Z inductor/test_compile_subprocess.py::GPUTests::test_sum4_cuda <- test/inductor/test_torchinductor.py PASSED [0.8799s] [ 61%] 2025-12-04T10:38:46.9002505Z inductor/test_compile_subprocess.py::GPUTests::test_sum5_cuda <- test/inductor/test_torchinductor.py PASSED [1.3060s] [ 62%] 2025-12-04T10:38:46.9002665Z inductor/test_compile_subprocess.py::GPUTests::test_sum_keepdims_cuda <- test/inductor/test_torchinductor.py PASSED [0.4161s] [ 63%] 2025-12-04T10:38:46.9002846Z inductor/test_compile_subprocess.py::GPUTests::test_tanh_cuda <- test/inductor/test_torchinductor.py PASSED [0.5912s] [ 65%] 2025-12-04T10:38:46.9003184Z inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue1_use_block_ptr_True_cuda <- test/inductor/test_torchinductor.py W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9003325Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9003583Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9003705Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9003947Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9004121Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9004351Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9004473Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9004711Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9004876Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9005136Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9005271Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9005500Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9005657Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9005886Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9006046Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9006274Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9006438Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9006661Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9006854Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9007087Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9007272Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9007534Z W1204 10:38:18.157000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9007574Z PASSED [0.5230s] [ 66%] 2025-12-04T10:38:46.9007892Z inductor/test_compile_subprocess.py::GPUTests::test_tmp_not_defined_issue3_cuda <- test/inductor/test_torchinductor.py W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9008033Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9008324Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9008441Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9008684Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9008854Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9009085Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9009251Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9009489Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9009654Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9009888Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9010024Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9010254Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9010410Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9010638Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9010794Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9011051Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9011216Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9011440Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9011608Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9011838Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9012024Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9012270Z W1204 10:38:19.825000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.iota.default 2025-12-04T10:38:46.9012311Z PASSED [3.3521s] [ 68%] 2025-12-04T10:38:46.9012607Z inductor/test_compile_subprocess.py::GPUTests::test_to_dtype_cuda <- test/inductor/test_torchinductor.py W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9012747Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9012998Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9013119Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9013379Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9013551Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9013775Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9013899Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9014140Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9014306Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9014547Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9014682Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9014909Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9015088Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9015318Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9015472Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9015698Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9015860Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9016085Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9016257Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9016493Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9016676Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9016941Z W1204 10:38:21.766000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9017094Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9018458Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9018713Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9018830Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9019070Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9019241Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9019468Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9019591Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9019831Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9019995Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9020232Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9020398Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9020627Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9020782Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9021011Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9021167Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9021394Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9021559Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9021784Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9021951Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9022186Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9022370Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9022657Z W1204 10:38:21.935000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9022698Z PASSED [0.3681s] [ 69%] 2025-12-04T10:38:46.9022904Z inductor/test_compile_subprocess.py::GPUTests::test_triton_argmin_argmax_transpose_logical_index_cuda <- test/inductor/test_torchinductor.py PASSED [2.1703s] [ 70%] 2025-12-04T10:38:46.9023207Z inductor/test_compile_subprocess.py::GPUTests::test_uint4x2_mixed_mm_cuda <- test/inductor/test_torchinductor.py W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9023349Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9023607Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9023723Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9023961Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9024131Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9024359Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9024514Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9024753Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9024915Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9025148Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9025284Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9025513Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9025672Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9025900Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9026055Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9026281Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9026451Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9026696Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9026866Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9027098Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9027279Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9027544Z W1204 10:38:24.329000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9027696Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9027837Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] Traceback (most recent call last): 2025-12-04T10:38:46.9028121Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9028239Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] ).serialize() 2025-12-04T10:38:46.9028476Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9028676Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9028903Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9029024Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] pickler.dump(obj) 2025-12-04T10:38:46.9029263Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9029425Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9029661Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9029796Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] cls(obj, pickler.options), 2025-12-04T10:38:46.9030025Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9030179Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9030405Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9030560Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9030813Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9030979Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9031203Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9031374Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9031605Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9031791Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9032054Z W1204 10:38:24.635000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9032094Z PASSED [0.5961s] [ 72%] 2025-12-04T10:38:46.9032278Z inductor/test_compile_subprocess.py::GPUTests::test_unbacked_floordiv_simplify_cuda <- test/inductor/test_torchinductor.py PASSED [0.9079s] [ 73%] 2025-12-04T10:38:46.9032465Z inductor/test_compile_subprocess.py::GPUTests::test_unbacked_floordiv_simplify_errors_cuda <- test/inductor/test_torchinductor.py PASSED [0.0131s] [ 75%] 2025-12-04T10:38:46.9032658Z inductor/test_compile_subprocess.py::GPUTests::test_unroll_small_reduction_cuda <- test/inductor/test_torchinductor.py PASSED [2.1181s] [ 76%] 2025-12-04T10:38:46.9032829Z inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_float16_cuda <- test/inductor/test_torchinductor.py PASSED [0.4519s] [ 77%] 2025-12-04T10:38:46.9033001Z inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_int32_cuda <- test/inductor/test_torchinductor.py PASSED [0.3841s] [ 79%] 2025-12-04T10:38:46.9033168Z inductor/test_compile_subprocess.py::GPUTests::test_unspec_inputs_int8_cuda <- test/inductor/test_torchinductor.py PASSED [0.4446s] [ 80%] 2025-12-04T10:38:46.9033350Z inductor/test_compile_subprocess.py::GPUTests::test_upsample_nearest2d_backward_cuda <- test/inductor/test_torchinductor.py PASSED [1.5347s] [ 81%] 2025-12-04T10:38:46.9033511Z inductor/test_compile_subprocess.py::GPUTests::test_var_correction_cuda <- test/inductor/test_torchinductor.py PASSED [0.9064s] [ 83%] 2025-12-04T10:38:46.9033674Z inductor/test_compile_subprocess.py::GPUTests::test_var_mean_div_by_cuda <- test/inductor/test_torchinductor.py PASSED [0.4038s] [ 84%] 2025-12-04T10:38:46.9033857Z inductor/test_compile_subprocess.py::GPUTests::test_var_mean_tile_reduction_False_cuda <- test/inductor/test_torchinductor.py PASSED [0.5446s] [ 86%] 2025-12-04T10:38:46.9034430Z inductor/test_compile_subprocess.py::GPUTests::test_var_mean_tile_reduction_True_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0009s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/165879 for platform(s) inductor, rocm. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 87%] 2025-12-04T10:38:46.9034599Z inductor/test_compile_subprocess.py::GPUTests::test_vertical_fusion1_cuda <- test/inductor/test_torchinductor.py PASSED [0.7349s] [ 88%] 2025-12-04T10:38:46.9034763Z inductor/test_compile_subprocess.py::GPUTests::test_view_as_complex_cuda <- test/inductor/test_torchinductor.py PASSED [0.6651s] [ 90%] 2025-12-04T10:38:46.9034940Z inductor/test_compile_subprocess.py::GPUTests::test_views2_cuda <- test/inductor/test_torchinductor.py PASSED [2.3453s] [ 91%] 2025-12-04T10:38:46.9035242Z inductor/test_compile_subprocess.py::GPUTests::test_weight_norm_bwd_cuda <- test/inductor/test_torchinductor.py W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9035386Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Traceback (most recent call last): 2025-12-04T10:38:46.9035644Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9035766Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] ).serialize() 2025-12-04T10:38:46.9036008Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9036181Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9036408Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9036535Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] pickler.dump(obj) 2025-12-04T10:38:46.9036774Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9036963Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9037202Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9037341Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] cls(obj, pickler.options), 2025-12-04T10:38:46.9037575Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9037736Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9037970Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9038165Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9038392Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9038557Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9038782Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9038986Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9039220Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9039405Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9039671Z W1204 10:38:37.546000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9039825Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9039967Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] Traceback (most recent call last): 2025-12-04T10:38:46.9040219Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9040342Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] ).serialize() 2025-12-04T10:38:46.9040581Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9040753Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9041013Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9041138Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] pickler.dump(obj) 2025-12-04T10:38:46.9041380Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9041548Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9041785Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9041922Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] cls(obj, pickler.options), 2025-12-04T10:38:46.9042153Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9042308Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9042538Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9042693Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9042920Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9043108Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9043335Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9043506Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9043738Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9043927Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9044192Z W1204 10:38:37.794000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] [0/0_1] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9044232Z PASSED [1.7871s] [ 93%] 2025-12-04T10:38:46.9044422Z inductor/test_compile_subprocess.py::GPUTests::test_weight_norm_conv2d_cuda <- test/inductor/test_torchinductor.py SKIPPED [0.0002s] (Skipped!) [ 94%] 2025-12-04T10:38:46.9044720Z inductor/test_compile_subprocess.py::GPUTests::test_where_broadcast_cuda <- test/inductor/test_torchinductor.py W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] Unable to pickle input graph or example inputs 2025-12-04T10:38:46.9044856Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] Traceback (most recent call last): 2025-12-04T10:38:46.9045129Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 486, in serialize_compile 2025-12-04T10:38:46.9045243Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] ).serialize() 2025-12-04T10:38:46.9045478Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_inductor/compile_fx_ext.py", line 210, in serialize 2025-12-04T10:38:46.9045643Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] return _WireProtocolPickledInput(GraphPickler.dumps(self)) 2025-12-04T10:38:46.9045870Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 127, in dumps 2025-12-04T10:38:46.9045990Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] pickler.dump(obj) 2025-12-04T10:38:46.9046228Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 94, in reducer_override 2025-12-04T10:38:46.9046387Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] return _GraphModulePickleData.reduce_helper(self, obj) 2025-12-04T10:38:46.9046619Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 352, in reduce_helper 2025-12-04T10:38:46.9046747Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] cls(obj, pickler.options), 2025-12-04T10:38:46.9046974Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 364, in __init__ 2025-12-04T10:38:46.9047153Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] self.graph = _GraphPickleData(gm._graph, options) 2025-12-04T10:38:46.9047379Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 571, in __init__ 2025-12-04T10:38:46.9047531Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] nodes[node] = _NodePickleData(node, nodes, options) 2025-12-04T10:38:46.9047757Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 387, in __init__ 2025-12-04T10:38:46.9047919Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] self.target = _OpPickleData.pickle(node.target, options) 2025-12-04T10:38:46.9048180Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 435, in pickle 2025-12-04T10:38:46.9048346Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] return cls._pickle_op(name, _OpOverloadPickleData, options) 2025-12-04T10:38:46.9048573Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/fx/_graph_pickler.py", line 456, in _pickle_op 2025-12-04T10:38:46.9048755Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] raise BypassFxGraphCache(f"Unable to pickle non-standard op: {name}") 2025-12-04T10:38:46.9049013Z W1204 10:38:38.691000 431225 site-packages/torch/_inductor/compile_fx_ext.py:493] torch._inductor.codecache.BypassFxGraphCache: Unable to pickle non-standard op: torch.ops.prims.convert_element_type.default 2025-12-04T10:38:46.9049086Z PASSED [0.9210s] [ 95%] 2025-12-04T10:38:46.9049259Z inductor/test_compile_subprocess.py::GPUTests::test_where_with_logical_op_cuda <- test/inductor/test_torchinductor.py PASSED [0.8332s] [ 97%] 2025-12-04T10:38:46.9049433Z inductor/test_compile_subprocess.py::GPUTests::test_xblock_divides_xnumel_cuda <- test/inductor/test_torchinductor.py PASSED [1.0249s] [ 98%] 2025-12-04T10:38:46.9049601Z inductor/test_compile_subprocess.py::GPUTests::test_zero_dim_reductions_cuda <- test/inductor/test_torchinductor.py PASSED [0.2665s] [100%] 2025-12-04T10:38:46.9049604Z 2025-12-04T10:38:46.9049838Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_compile_subprocess/inductor.test_compile_subprocess-af8fb06866c88361.xml - 2025-12-04T10:38:46.9049919Z =========== 67 passed, 5 skipped, 225 deselected in 88.55s (0:01:28) =========== 2025-12-04T10:38:46.9050109Z The following tests failed and then succeeded when run in a new process['test/inductor/test_compile_subprocess.py::GPUTests::test_remove_noop_slice_cuda'] 2025-12-04T10:38:46.9050111Z 2025-12-04T10:38:46.9050298Z FINISHED PRINTING LOG FILE of inductor/test_compile_subprocess 3/3 (test/test-reports/inductor.test_compile_subprocess_3.3_8d9c6bb3e356ae25_.log) 2025-12-04T10:38:46.9050300Z 2025-12-04T10:38:46.9050425Z Finished inductor/test_compile_subprocess 3/3 ... [2025-12-04 10:38:46.780956][4969509.277140132], took 9.74min 2025-12-04T10:38:46.9050655Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:38:46.9050744Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:38:46.9050845Z Running inductor/test_multi_kernel 1/1 ... [2025-12-04 10:38:46.798037][4969509.294223648] 2025-12-04T10:38:46.9050894Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:38:46.9051232Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_multi_kernel.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:38:46.798499] 2025-12-04T10:39:07.3483175Z 2025-12-04T10:39:07.3484598Z inductor/test_multi_kernel 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_multi_kernel_1.1_0468563e79beeb73_.log 2025-12-04T10:39:07.3493272Z Running 19 items in this shard: test/inductor/test_multi_kernel.py::MultiKernelTest::test_batchnorm_training, test/inductor/test_multi_kernel.py::MultiKernelTest::test_inplace_update, test/inductor/test_multi_kernel.py::MultiKernelTest::test_layernorm, test/inductor/test_multi_kernel.py::MultiKernelTest::test_pass_same_arg_multi_times, test/inductor/test_multi_kernel.py::MultiKernelTest::test_reduction_scratch_buffer, test/inductor/test_multi_kernel.py::MultiKernelTest::test_reduction_scratch_buffer_cpp_wrapper, test/inductor/test_multi_kernel.py::MultiKernelTest::test_reduction_scratch_buffer_cpp_wrapper_non_persistent_reduction, test/inductor/test_multi_kernel.py::MultiKernelTest::test_reduction_scratch_buffer_cpp_wrapper_persistent_reduction, test/inductor/test_multi_kernel.py::MultiKernelTest::test_softmax, test/inductor/test_multi_kernel.py::MultiKernelTest::test_softmax_cpp_wrapper, test/inductor/test_multi_kernel.py::MultiKernelTest::test_softmax_force_non_persistent_reduction_force_kernel_0, test/inductor/test_multi_kernel.py::MultiKernelTest::test_softmax_force_non_persistent_reduction_force_kernel_1, test/inductor/test_multi_kernel.py::MultiKernelTest::test_softmax_warn_mixed_layout, test/inductor/test_multi_kernel.py::MultiKernelTest::test_sort_disables_multi_kernel, test/inductor/test_multi_kernel.py::MultiKernelTest::test_split_scan, test/inductor/test_multi_kernel.py::MultiKernelTest::test_transformer_snippet, test/inductor/test_multi_kernel.py::MultiKernelTest::test_transformer_snippet_with_fallback_random, test/inductor/test_multi_kernel.py::MultiKernelTest::test_triton_gemm, test/inductor/test_multi_kernel.py::MultiKernelTest::test_triton_relu_fused_gemm 2025-12-04T10:39:07.3502136Z 2025-12-04T10:39:07.3502547Z Finished inductor/test_multi_kernel 1/1 ... [2025-12-04 10:39:07.347975][4969529.844158772], took 0.34min 2025-12-04T10:39:07.3503840Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:39:07.3649664Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:39:07.3655249Z Running inductor/test_analysis 1/1 ... [2025-12-04 10:39:07.365203][4969529.861389365] 2025-12-04T10:39:07.3655894Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:39:07.3659633Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_analysis.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:39:07.365714] 2025-12-04T10:39:13.5405337Z 2025-12-04T10:39:13.5406869Z inductor/test_analysis 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_analysis_1.1_19a6de530984529e_.log 2025-12-04T10:39:13.5420590Z Running 28 items in this shard: test/inductor/test_analysis.py::TestUtils::test_tabulate2d, test/inductor/test_analysis.py::TestUtils::test_zip_dicts, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat0_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat0_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat1_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat1_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat2_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat2_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat3_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_against_flop_counter_maxat3_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_augment_trace_helper_unit_cuda, test/inductor/test_analysis.py::TestAnalysisCUDA::test_combine_profiles_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_combine_profiles_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_diff_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_diff_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_diff_cuda_float64, test/inductor/test_analysis.py::TestAnalysisCUDA::test_noop_cuda, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat0_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat0_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat1_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat1_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat2_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat2_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat3_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_pointwise_bandwidth_maxat3_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_triton_has_metadata_maxat0_cuda_float16, test/inductor/test_analysis.py::TestAnalysisCUDA::test_triton_has_metadata_maxat0_cuda_float32, test/inductor/test_analysis.py::TestAnalysisCUDA::test_triton_has_metadata_maxat0_cuda_float64 2025-12-04T10:39:13.5432933Z 2025-12-04T10:39:13.5433318Z Finished inductor/test_analysis 1/1 ... [2025-12-04 10:39:13.540262][4969536.036446637], took 0.10min 2025-12-04T10:39:13.5434634Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:39:13.5572235Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:39:13.5577769Z Running inductor/test_pad_mm 1/1 ... [2025-12-04 10:39:13.557450][4969536.053636182] 2025-12-04T10:39:13.5578459Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:39:13.5581816Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pad_mm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:39:13.557940] 2025-12-04T10:39:51.7953489Z 2025-12-04T10:39:51.7955045Z inductor/test_pad_mm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pad_mm_1.1_482da2915636e459_.log 2025-12-04T10:39:51.7961815Z Running 19 items in this shard: test/inductor/test_pad_mm.py::PadMMTest::test_cat_pad_mm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_exclude_cat_padding, test/inductor/test_pad_mm.py::PadMMTest::test_exclude_padding, test/inductor/test_pad_mm.py::PadMMTest::test_no_autocast_in_pad_bmm_joint_graph_pass, test/inductor/test_pad_mm.py::PadMMTest::test_original_aten_preserved_pad_mm, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_2d_bias, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_pad_addmm_dyn_mn, test/inductor/test_pad_mm.py::PadMMTest::test_pad_batch, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_b, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_bm, test/inductor/test_pad_mm.py::PadMMTest::test_pad_bmm_dyn_k, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_bf16, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_k, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_m, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_mnk, test/inductor/test_pad_mm.py::PadMMTest::test_pad_mm_dyn_n, test/inductor/test_pad_mm.py::PadMMTest::test_pad_single_cat, test/inductor/test_pad_mm.py::PadMMTest::test_zero_dim 2025-12-04T10:39:51.7967240Z 2025-12-04T10:39:51.7967628Z Finished inductor/test_pad_mm 1/1 ... [2025-12-04 10:39:51.794768][4969574.290956839], took 0.64min 2025-12-04T10:39:51.7968980Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:39:51.8010429Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:39:51.8012447Z Running inductor/test_triton_syntax 1/1 ... [2025-12-04 10:39:51.801037][4969574.297227125] 2025-12-04T10:39:51.8013086Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:39:51.8014413Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_triton_syntax.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:39:51.801234] 2025-12-04T10:40:07.6439245Z 2025-12-04T10:40:07.6440752Z inductor/test_triton_syntax 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_triton_syntax_1.1_1e39380a29d2c169_.log 2025-12-04T10:40:07.6442170Z Running 1 items in this shard: test/inductor/test_triton_syntax.py::TestTritonSyntacticallyValid::test_triton_sqrt 2025-12-04T10:40:07.6442777Z 2025-12-04T10:40:07.6443182Z Finished inductor/test_triton_syntax 1/1 ... [2025-12-04 10:40:07.643454][4969590.139639201], took 0.26min 2025-12-04T10:40:07.6449525Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:07.6603326Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:07.6611680Z Running test_sparse_semi_structured 1/1 ... [2025-12-04 10:40:07.660669][4969590.156854565] 2025-12-04T10:40:07.6612385Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:07.6614428Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_sparse_semi_structured.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:07.661142] 2025-12-04T10:40:12.9853818Z 2025-12-04T10:40:12.9855305Z test_sparse_semi_structured 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_sparse_semi_structured_1.1_32543d081f92b663_.log 2025-12-04T10:40:12.9875863Z Running 42 items in this shard: test/test_sparse_semi_structured.py::SparseSemiStructuredTensorCompileTest::test_mlp_contiguous_relu_compile_cusparselt, test/test_sparse_semi_structured.py::SparseSemiStructuredTensorCompileTest::test_mlp_contiguous_relu_compile_cutlass, test/test_sparse_semi_structured.py::SparseSemiStructuredTensorCompileTest::test_sp24_compile, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_indices, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_linear, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_min_sparse_shape, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mlp, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mm_sparse_first_NN, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mm_sparse_first_NT, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mm_sparse_first_TN, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mm_sparse_second_NN, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_mm_sparse_second_NT, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_to_sparse_semi_structured, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_unsupported_dim, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_unsupported_dtype, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_unsupported_shape, test/test_sparse_semi_structured.py::TestSparseSemiStructured::test_values, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_gemm, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_pack_both_ways_edge_case1, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_pack_both_ways_id, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_pack_both_ways_meta_correctness, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_prune_dense_static_sort, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_pruning_algo_largest_abs_values_greedy, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_sp24_apply, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_sp24_apply_dense, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_sp24_matmuls, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_sp24_matmuls_bmm, test/test_sparse_semi_structured.py::TestSparseSemiStructuredTraining::test_sp24_matmuls_mat_vec, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUTLASS::test_conversions, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUTLASS::test_conversions_all_patterns, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUTLASS::test_linear_cutlass, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUTLASS::test_sparse_semi_structured_ops_cutlass, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cslt_sparse_mm_alpha, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cslt_sparse_mm_alpha_compile_autotune, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cslt_sparse_mm_alpha_mixed_dtype, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cslt_sparse_mm_mixed_dtype, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cslt_sparse_mm_search, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_csrc_cslt_sparse_mm_search, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_cusparselt_backend, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_sparse_fp8fp8_mm, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_sparse_semi_structured_scaled_mm, test/test_sparse_semi_structured.py::TestSparseSemiStructuredCUSPARSELT::test_sparse_semi_structured_scaled_mm_fp8 2025-12-04T10:40:12.9894940Z 2025-12-04T10:40:12.9895336Z Finished test_sparse_semi_structured 1/1 ... [2025-12-04 10:40:12.984960][4969595.481144752], took 0.09min 2025-12-04T10:40:12.9896628Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:13.0018679Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:13.0025467Z Running inductor/test_op_completeness 1/1 ... [2025-12-04 10:40:13.002170][4969595.498356116] 2025-12-04T10:40:13.0026138Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:13.0028683Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_op_completeness.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:13.002637] 2025-12-04T10:40:15.3215844Z 2025-12-04T10:40:15.3217351Z inductor/test_op_completeness 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_op_completeness_1.1_3c9591f185e33f4f_.log 2025-12-04T10:40:15.3221629Z Running 5 items in this shard: test/inductor/test_op_completeness.py::TestOpCompleteness::test_cpp_overrides, test/inductor/test_op_completeness.py::TestOpCompleteness::test_cpp_vec_overrides, test/inductor/test_op_completeness.py::TestOpCompleteness::test_halide_overrides, test/inductor/test_op_completeness.py::TestOpCompleteness::test_metal_overrides, test/inductor/test_op_completeness.py::TestOpCompleteness::test_triton_overrides 2025-12-04T10:40:15.3223733Z 2025-12-04T10:40:15.3224171Z Finished inductor/test_op_completeness 1/1 ... [2025-12-04 10:40:15.321146][4969597.817330456], took 0.04min 2025-12-04T10:40:15.3226776Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:15.3382187Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:15.3387073Z Running inductor/test_subgraph_choice 1/1 ... [2025-12-04 10:40:15.338423][4969597.834609608] 2025-12-04T10:40:15.3387760Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:15.3392388Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_subgraph_choice.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:15.338904] 2025-12-04T10:40:20.8155550Z 2025-12-04T10:40:20.8157205Z inductor/test_subgraph_choice 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_subgraph_choice_1.1_c782e30a68658e2d_.log 2025-12-04T10:40:20.8159289Z Running 2 items in this shard: test/inductor/test_subgraph_choice.py::TestSubgraphChoice::test_subgraph_decompose_k, test/inductor/test_subgraph_choice.py::TestSubgraphChoice::test_subgraph_freeze_layout 2025-12-04T10:40:20.8160301Z 2025-12-04T10:40:20.8160722Z Finished inductor/test_subgraph_choice 1/1 ... [2025-12-04 10:40:20.815009][4969603.311193385], took 0.09min 2025-12-04T10:40:20.8165509Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:20.8320033Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:20.8323843Z Running inductor/test_b2b_gemm 1/1 ... [2025-12-04 10:40:20.832172][4969603.328358759] 2025-12-04T10:40:20.8324462Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:20.8328820Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_b2b_gemm.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:20.832648] 2025-12-04T10:40:26.2069320Z 2025-12-04T10:40:26.2070723Z inductor/test_b2b_gemm 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_b2b_gemm_1.1_147c1277513ac35e_.log 2025-12-04T10:40:26.2071756Z Running 0 items in this shard: 2025-12-04T10:40:26.2072010Z 2025-12-04T10:40:26.2072379Z Finished inductor/test_b2b_gemm 1/1 ... [2025-12-04 10:40:26.206461][4969608.702644712], took 0.09min 2025-12-04T10:40:26.2080286Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:26.2233289Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:26.2238594Z Running inductor/test_triton_heuristics 1/1 ... [2025-12-04 10:40:26.223619][4969608.719805497] 2025-12-04T10:40:26.2239286Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:26.2244027Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_triton_heuristics.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:26.224132] 2025-12-04T10:40:41.5183162Z 2025-12-04T10:40:41.5184106Z inductor/test_triton_heuristics 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_triton_heuristics_1.1_5ab27cce500b5a2a_.log 2025-12-04T10:40:41.5191190Z Running 12 items in this shard: test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_amd_special_config_args, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_artificial_grid_cpp_wrapper, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_artificial_zgrid, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_autotune_hints_to_configs, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_pre_hook_assert, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_prune_configs_over_shared_memory_limit_do_pruning_False, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_prune_configs_over_shared_memory_limit_do_pruning_True, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_template_function_ws, test/inductor/test_triton_heuristics.py::TestTritonHeuristics::test_triton_config, test/inductor/test_triton_heuristics.py::TestArgumentCloneAndRestore::test_clone_args_with_non_zero_offset, test/inductor/test_triton_heuristics.py::TestArgumentCloneAndRestore::test_clone_contiguous_args, test/inductor/test_triton_heuristics.py::TestArgumentCloneAndRestore::test_clone_non_contiguous_args 2025-12-04T10:40:41.5197041Z 2025-12-04T10:40:41.5197477Z Finished inductor/test_triton_heuristics 1/1 ... [2025-12-04 10:40:41.517977][4969624.014161898], took 0.25min 2025-12-04T10:40:41.5198931Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:40:41.5349031Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:40:41.5356901Z Running inductor/test_ck_backend 1/1 ... [2025-12-04 10:40:41.535326][4969624.031513079] 2025-12-04T10:40:41.5357805Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:40:41.5361086Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_ck_backend.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:40:41.535814] 2025-12-04T10:44:28.7471515Z 2025-12-04T10:44:28.7472959Z inductor/test_ck_backend 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_ck_backend_1.1_68619127a77bd768_.log 2025-12-04T10:44:28.7496294Z Running 34 items in this shard: test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_ATen,Triton,CK_x_shape0, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_ATen,Triton,CK_x_shape1, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_ATen,Triton,CK_x_shape2, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_CK_x_shape0, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_CK_x_shape1, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_addmm_max_autotune_gemm_backends_CK_x_shape2, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_conv2d_max_autotune_conv_backends_ATEN,CK,TRITON, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_conv2d_max_autotune_conv_backends_CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_bmm_max_autotune_gemm_backends_ATen,Triton,CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_bmm_max_autotune_gemm_backends_CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_dynamic_max_autotune_gemm_backends_CK_autotune_in_subproc_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_ATen,Triton,CK_autotune_in_subproc_False_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_ATen,Triton,CK_autotune_in_subproc_False_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_ATen,Triton,CK_autotune_in_subproc_True_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_ATen,Triton,CK_autotune_in_subproc_True_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CKTILE_autotune_in_subproc_False_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CKTILE_autotune_in_subproc_False_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CKTILE_autotune_in_subproc_True_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CKTILE_autotune_in_subproc_True_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CK_autotune_in_subproc_False_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CK_autotune_in_subproc_False_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CK_autotune_in_subproc_True_use_aoti_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_matmul_max_autotune_gemm_backends_CK_autotune_in_subproc_True_use_aoti_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_non_contiguous_max_autotune_gemm_backends_Aten,CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_preselected_max_autotune_gemm_backends_ATen,Triton,CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_precompile_preselected_max_autotune_gemm_backends_CK, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_ATen,Triton,CK_quantize_type_rowwise_has_bias_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_ATen,Triton,CK_quantize_type_rowwise_has_bias_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_ATen,Triton,CK_quantize_type_tensorwise_has_bias_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_ATen,Triton,CK_quantize_type_tensorwise_has_bias_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_CK_quantize_type_rowwise_has_bias_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_CK_quantize_type_rowwise_has_bias_True, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_CK_quantize_type_tensorwise_has_bias_False, test/inductor/test_ck_backend.py::TestCKBackend::test_max_autotune_scaled_mm_max_autotune_gemm_backends_CK_quantize_type_tensorwise_has_bias_True 2025-12-04T10:44:28.7517514Z 2025-12-04T10:44:28.7517903Z Finished inductor/test_ck_backend 1/1 ... [2025-12-04 10:44:28.746672][4969851.242856804], took 3.79min 2025-12-04T10:44:28.7519263Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:44:28.7640876Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:44:28.7646446Z Running test_testing 1/1 ... [2025-12-04 10:44:28.764416][4969851.260601348] 2025-12-04T10:44:28.7646623Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:44:28.7650854Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_testing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:44:28.764907] 2025-12-04T10:44:54.2324510Z 2025-12-04T10:44:54.2325455Z test_testing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_testing_1.1_f73a7a8ca29fb98a_.log 2025-12-04T10:44:54.2991997Z Running 2074 items in this shard: test/test_testing.py::TestTestingCUDA::test_assertEqual_longMessage_cuda, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_bool, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_complex128, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_complex64, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_float16, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_float32, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_float64, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_int16, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_int32, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_int64, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_int8, test/test_testing.py::TestTestingCUDA::test_assertEqual_numpy_cuda_uint8, test/test_testing.py::TestTestingCUDA::test_cuda_assert_should_not_stop_common_distributed_test_suite_cuda, test/test_testing.py::TestTestingCUDA::test_cuda_assert_should_stop_common_device_type_test_suite_cuda, test/test_testing.py::TestTestingCUDA::test_cuda_assert_should_stop_common_utils_test_suite_cuda, test/test_testing.py::TestTestingCUDA::test_get_supported_dtypes_cuda, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_bool, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_float16, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_float32, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_float64, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_int16, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_int32, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_int64, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_int8, test/test_testing.py::TestTestingCUDA::test_isclose_atol_rtol_greater_than_zero_cuda_uint8, test/test_testing.py::TestTestingCUDA::test_isclose_bool_cuda, test/test_testing.py::TestTestingCUDA::test_isclose_complex_cuda_complex128, test/test_testing.py::TestTestingCUDA::test_isclose_complex_cuda_complex64, test/test_testing.py::TestTestingCUDA::test_isclose_equality_shortcut_cuda, test/test_testing.py::TestTestingCUDA::test_isclose_float_cuda_float16, test/test_testing.py::TestTestingCUDA::test_isclose_float_cuda_float32, test/test_testing.py::TestTestingCUDA::test_isclose_float_cuda_float64, test/test_testing.py::TestTestingCUDA::test_isclose_integer_cuda_int16, test/test_testing.py::TestTestingCUDA::test_isclose_integer_cuda_int32, test/test_testing.py::TestTestingCUDA::test_isclose_integer_cuda_int64, test/test_testing.py::TestTestingCUDA::test_isclose_integer_cuda_int8, test/test_testing.py::TestTestingCUDA::test_isclose_integer_cuda_uint8, test/test_testing.py::TestTestingCUDA::test_isclose_nan_equality_shortcut_cuda_complex128, test/test_testing.py::TestTestingCUDA::test_isclose_nan_equality_shortcut_cuda_complex64, test/test_testing.py::TestTestingCUDA::test_isclose_nan_equality_shortcut_cuda_float16, test/test_testing.py::TestTestingCUDA::test_isclose_nan_equality_shortcut_cuda_float32, test/test_testing.py::TestTestingCUDA::test_isclose_nan_equality_shortcut_cuda_float64, test/test_testing.py::TestTestingCUDA::test_setup_and_teardown_run_for_device_specific_tests_cuda, test/test_testing.py::TestTestingCUDA::test_supported_dtypes_abs_cuda, test/test_testing.py::TestFrameworkUtils::test_filtering_env_var, test/test_testing.py::TestAssertClose::test_bool, test/test_testing.py::TestAssertClose::test_default_tolerance_selection_mismatching_dtypes, test/test_testing.py::TestAssertClose::test_docstring_examples, test/test_testing.py::TestAssertClose::test_matching, test/test_testing.py::TestAssertClose::test_matching_atol, test/test_testing.py::TestAssertClose::test_matching_conjugate_bit, test/test_testing.py::TestAssertClose::test_matching_nan, test/test_testing.py::TestAssertClose::test_matching_nan_with_equal_nan, test/test_testing.py::TestAssertClose::test_matching_rtol, test/test_testing.py::TestAssertClose::test_meta, test/test_testing.py::TestAssertClose::test_mismatching_dtype, test/test_testing.py::TestAssertClose::test_mismatching_dtype_no_check, test/test_testing.py::TestAssertClose::test_mismatching_layout, test/test_testing.py::TestAssertClose::test_mismatching_layout_no_check, test/test_testing.py::TestAssertClose::test_mismatching_shape, test/test_testing.py::TestAssertClose::test_mismatching_stride, test/test_testing.py::TestAssertClose::test_mismatching_stride_no_check, test/test_testing.py::TestAssertClose::test_mismatching_types, test/test_testing.py::TestAssertClose::test_mismatching_types_subclasses, test/test_testing.py::TestAssertClose::test_mismatching_types_type_equality, test/test_testing.py::TestAssertClose::test_mismatching_values, test/test_testing.py::TestAssertClose::test_mismatching_values_atol, test/test_testing.py::TestAssertClose::test_mismatching_values_rtol, test/test_testing.py::TestAssertClose::test_none, test/test_testing.py::TestAssertClose::test_none_mismatch, test/test_testing.py::TestAssertClose::test_numpy, test/test_testing.py::TestAssertClose::test_only_atol, test/test_testing.py::TestAssertClose::test_only_rtol, test/test_testing.py::TestAssertClose::test_scalar, test/test_testing.py::TestAssertClose::test_unexpected_error_compare, test/test_testing.py::TestAssertClose::test_unexpected_error_originate, test/test_testing.py::TestAssertClose::test_unknown_layout, test/test_testing.py::TestAssertClose::test_unknown_type, test/test_testing.py::TestAssertCloseMultiDeviceCUDA::test_mismatching_device_cuda, test/test_testing.py::TestAssertCloseMultiDeviceCUDA::test_mismatching_device_no_check_cuda, test/test_testing.py::TestAssertCloseErrorMessage::test_abs_diff, test/test_testing.py::TestAssertCloseErrorMessage::test_abs_diff_scalar, test/test_testing.py::TestAssertCloseErrorMessage::test_atol, test/test_testing.py::TestAssertCloseErrorMessage::test_identifier_scalars, test/test_testing.py::TestAssertCloseErrorMessage::test_identifier_tensor_likes, test/test_testing.py::TestAssertCloseErrorMessage::test_mismatched_elements, test/test_testing.py::TestAssertCloseErrorMessage::test_msg_callable, test/test_testing.py::TestAssertCloseErrorMessage::test_msg_str, test/test_testing.py::TestAssertCloseErrorMessage::test_not_close, test/test_testing.py::TestAssertCloseErrorMessage::test_not_equal, test/test_testing.py::TestAssertCloseErrorMessage::test_rel_diff, test/test_testing.py::TestAssertCloseErrorMessage::test_rel_diff_scalar, test/test_testing.py::TestAssertCloseErrorMessage::test_rtol, test/test_testing.py::TestAssertCloseErrorMessage::test_small_float_dtype, test/test_testing.py::TestAssertCloseErrorMessage::test_zero_div_zero, test/test_testing.py::TestAssertCloseContainer::test_mapping_mismatching_keys, test/test_testing.py::TestAssertCloseContainer::test_mapping_mismatching_values_msg, test/test_testing.py::TestAssertCloseContainer::test_sequence_mismatching_len, test/test_testing.py::TestAssertCloseContainer::test_sequence_mismatching_values_msg, test/test_testing.py::TestAssertCloseSparseCOO::test_matching_coalesced, test/test_testing.py::TestAssertCloseSparseCOO::test_matching_uncoalesced, test/test_testing.py::TestAssertCloseSparseCOO::test_mismatching_indices_msg, test/test_testing.py::TestAssertCloseSparseCOO::test_mismatching_nnz, test/test_testing.py::TestAssertCloseSparseCOO::test_mismatching_sparse_dims, test/test_testing.py::TestAssertCloseSparseCOO::test_mismatching_values_msg, test/test_testing.py::TestAssertCloseSparseCSR::test_matching, test/test_testing.py::TestAssertCloseSparseCSR::test_mismatching_col_indices_msg, test/test_testing.py::TestAssertCloseSparseCSR::test_mismatching_crow_indices_msg, test/test_testing.py::TestAssertCloseSparseCSR::test_mismatching_values_msg, test/test_testing.py::TestAssertCloseSparseCSC::test_matching, test/test_testing.py::TestAssertCloseSparseCSC::test_mismatching_ccol_indices_msg, test/test_testing.py::TestAssertCloseSparseCSC::test_mismatching_row_indices_msg, test/test_testing.py::TestAssertCloseSparseCSC::test_mismatching_values_msg, test/test_testing.py::TestAssertCloseSparseBSR::test_matching, test/test_testing.py::TestAssertCloseSparseBSR::test_mismatching_col_indices_msg, test/test_testing.py::TestAssertCloseSparseBSR::test_mismatching_crow_indices_msg, test/test_testing.py::TestAssertCloseSparseBSR::test_mismatching_values_msg, test/test_testing.py::TestAssertCloseSparseBSC::test_matching, test/test_testing.py::TestAssertCloseSparseBSC::test_mismatching_ccol_indices_msg, test/test_testing.py::TestAssertCloseSparseBSC::test_mismatching_row_indices_msg, test/test_testing.py::TestAssertCloseSparseBSC::test_mismatching_values_msg, test/test_testing.py::TestAssertCloseQuantized::test_matching_per_channel, test/test_testing.py::TestAssertCloseQuantized::test_matching_per_tensor, test/test_testing.py::TestAssertCloseQuantized::test_mismatching_is_quantized, test/test_testing.py::TestAssertCloseQuantized::test_mismatching_qscheme, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_exclude_zero_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high0_value_types3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high1_value_types3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_ge_high_low_high2_value_types3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_boolean_integral2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_default_smoke_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_nan_low_high2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_outside_valid_range_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_low_high_smoke_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_memory_format_memory_format_and_shape4_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_memory_format_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape4_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape5_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_False_shape6_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape0_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape1_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape2_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape3_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape4_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape5_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_noncontiguous_noncontiguous_True_shape6_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_requires_grad_requires_grad_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape0_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape1_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape2_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape3_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape4_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape5_splat_shape_True_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_False_cuda_uint8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_bfloat16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_bool, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_complex128, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_complex32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_complex64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_float16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_float32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_float64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_int16, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_int32, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_int64, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_int8, test/test_testing.py::TestMakeTensorCUDA::test_smoke_shape6_splat_shape_True_cuda_uint8, test/test_testing.py::TestTestParametrization::test_apply_param_specific_decorators, test/test_testing.py::TestTestParametrization::test_compose_param_specific_decorators, test/test_testing.py::TestTestParametrization::test_default_names, test/test_testing.py::TestTestParametrization::test_modules_decorator_misuse_error, test/test_testing.py::TestTestParametrization::test_multiple_handling_of_same_param_error, test/test_testing.py::TestTestParametrization::test_name_fn, test/test_testing.py::TestTestParametrization::test_ops_decorator_misuse_error, test/test_testing.py::TestTestParametrization::test_reparametrize, test/test_testing.py::TestTestParametrization::test_subtest_expected_failure_x_1, test/test_testing.py::TestTestParametrization::test_subtest_expected_failure_x_2, test/test_testing.py::TestTestParametrization::test_subtest_expected_failure_x_3, test/test_testing.py::TestTestParametrization::test_subtest_names, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_1_y_4, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_1_y_5, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_1_y_6, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_2_y_4, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_2_y_5, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_2_y_6, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_3_y_4, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_3_y_5, test/test_testing.py::TestTestParametrization::test_two_things_subtest_expected_failure_x_3_y_6, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_default_name_non_primitive_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_default_names_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_dtypes_composition_invalid_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_dtypes_composition_valid_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_empty_param_list_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_empty_param_names_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_modules_composition_names_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_modules_decorator_applies_module_and_param_specific_decorators_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_multiple_handling_of_same_param_error_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_name_fn_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_ops_composition_names_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_ops_decorator_applies_op_and_param_specific_decorators_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_param_specific_decoration_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_subtest_expected_failure_x_1_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_subtest_expected_failure_x_2_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_subtest_expected_failure_x_3_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_subtest_names_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_1_y_4_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_1_y_5_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_1_y_6_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_2_y_4_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_2_y_5_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_2_y_6_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_3_y_4_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_3_y_5_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_two_things_subtest_expected_failure_x_3_y_6_cuda, test/test_testing.py::TestTestParametrizationDeviceTypeCUDA::test_unparametrized_names_cuda, test/test_testing.py::TestImports::test_circular_dependencies, test/test_testing.py::TestImports::test_lazy_imports_are_lazy, test/test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_functorch, test/test_testing.py::TestImports::test_no_mutate_global_logging_on_import_path_torch, test/test_testing.py::TestImports::test_no_warning_on_import, test/test_testing.py::TestImports::test_not_import_sympy, test/test_testing.py::TestOpInfos::test_sample_input, test/test_testing.py::TestOpInfos::test_sample_input_metadata, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_T_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___radd___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rand___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rdiv___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rmod___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rmul___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___ror___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rpow___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rsub___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators___rxor___cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators__chunk_cat_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_add_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_amax_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_amin_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_aminmax_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_arange_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_as_strided_scatter_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_atan2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bernoulli_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bitwise_and_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bitwise_left_shift_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bitwise_or_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bitwise_right_shift_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bitwise_xor_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_bucketize_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_cat_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_cauchy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_clamp_max_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_clamp_min_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_complex_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_copysign_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_cov_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_diag_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_diag_embed_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_diagonal_copy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_diagonal_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_diff_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_div_floor_rounding_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_div_no_rounding_mode_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_div_trunc_rounding_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_dot_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_dsplit_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_dstack_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_empty_permuted_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_eq_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_exponential_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_eye_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_fft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_fft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_fftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_hfft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_hfft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_hfftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ifft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ifft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ifftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ihfft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ihfft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_ihfftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_irfft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_irfft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_irfftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_rfft2_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_rfft_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fft_rfftn_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fliplr_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_flipud_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_float_power_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_floor_divide_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fmax_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fmin_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_fmod_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_gather_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_gcd_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_ge_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_geometric_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_gradient_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_gt_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_heaviside_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_histogramdd_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_hsplit_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_hstack_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_hypot_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_igamma_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_igammac_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_index_add_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_index_select_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_isclose_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_item_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_jiterator_binary_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_jiterator_binary_return_by_ref_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_kthvalue_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_lcm_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_ldexp_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_le_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linalg_cross_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linalg_diagonal_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linalg_lstsq_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linalg_lstsq_grad_oriented_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linspace_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_linspace_tensor_overload_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_log_normal_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logaddexp_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logcumsumexp_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logical_and_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logical_or_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logical_xor_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logspace_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_logspace_tensor_overload_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_lt_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_masked_fill_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_masked_scatter_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_masked_select_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_max_binary_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_maximum_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_mean_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_median_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_min_binary_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_minimum_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_movedim_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_mul_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_multinomial_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_narrow_copy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_narrow_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_native_layer_norm_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_ne_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_neg_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nextafter_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_avg_pool1d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_avg_pool2d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_avg_pool3d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_max_pool1d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_max_pool2d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_adaptive_max_pool3d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_avg_pool1d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_avg_pool2d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_avg_pool3d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_conv1d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_conv2d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_conv3d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_embedding_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_gaussian_nll_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_gelu_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_group_norm_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_hardtanh_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_hinge_embedding_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_huber_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_l1_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_margin_ranking_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_max_pool1d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_max_pool2d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_max_pool3d_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_multi_margin_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_multilabel_margin_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_poisson_nll_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_prelu_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_rms_norm_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_rrelu_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_soft_margin_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_softshrink_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_triplet_margin_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_normal_in_place_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_ormqr_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_polar_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_pow_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_remainder_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_renorm_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_reshape_as_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_reshape_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_roll_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_rot90_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_rsub_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_scatter_add_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_scatter_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_bartlett_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_blackman_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_cosine_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_exponential_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_gaussian_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_general_cosine_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_general_hamming_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_hamming_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_hann_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_kaiser_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_signal_windows_nuttall_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_chebyshev_polynomial_t_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_chebyshev_polynomial_u_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_chebyshev_polynomial_v_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_chebyshev_polynomial_w_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_hermite_polynomial_h_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_hermite_polynomial_he_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_laguerre_polynomial_l_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_legendre_polynomial_p_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_shifted_chebyshev_polynomial_t_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_shifted_chebyshev_polynomial_u_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_shifted_chebyshev_polynomial_v_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_shifted_chebyshev_polynomial_w_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_xlog1py_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_special_zeta_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_sub_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_sum_to_size_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_t_copy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_t_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_take_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_trace_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_tril_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_triu_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_true_divide_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_unbind_copy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_unbind_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_uniform_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_vdot_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_view_as_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_view_copy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_view_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_vsplit_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_vstack_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_where_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_error_generators_xlogy_cuda, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___radd___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rand___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rdiv___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rmod___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rmul___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___ror___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rpow___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rsub___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators___rxor___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_abs_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_acos_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_acosh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_add_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_addcdiv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_addcmul_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_angle_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_asin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_asinh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_atan2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_atan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_atanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bfloat16_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_and_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_left_shift_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_not_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_or_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_right_shift_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bitwise_xor_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bool_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_broadcast_tensors_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_bucketize_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_byte_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_cat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_cdouble_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_ceil_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_cfloat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_chalf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_char_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_chunk_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_clamp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_clamp_max_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_clamp_min_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_clone_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_complex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_conj_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_conj_physical_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_contiguous_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_copysign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_cos_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_cosh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_deg2rad_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_diag_embed_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_diagonal_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_diagonal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_digamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_div_floor_rounding_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_div_no_rounding_mode_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_div_trunc_rounding_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_double_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_empty_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_eq_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_erf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_erfc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_erfinv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_exp2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_exp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_expm1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_fill_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_flatten_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_float_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_float_power_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_floor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_floor_divide_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_fmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_fmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_fmod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_frac_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_frexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_gcd_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_ge_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_gt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_half_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_heaviside_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_hypot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_i0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_igamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_igammac_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_imag_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_index_add_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_index_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_index_fill_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_index_select_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_int_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isclose_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isfinite_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isinf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isnan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isneginf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isposinf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_isreal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_jiterator_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_jiterator_binary_return_by_ref_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_jiterator_unary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_lcm_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_ldexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_le_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_lgamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_log10_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_log1p_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_log2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_log_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logaddexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logical_and_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logical_not_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logical_or_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logical_xor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_logsumexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_long_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_lt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_max_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_maximum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_min_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_minimum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_movedim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_mul_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nan_to_num_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_narrow_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_narrow_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_ne_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_neg_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nextafter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_celu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_elu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_grid_sample_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_group_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_hardshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_hardsigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_hardtanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_hinge_embedding_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_interpolate_bicubic_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_interpolate_bilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_logsigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_margin_ranking_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_mish_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_multi_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_multilabel_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_prelu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_relu6_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_relu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_rrelu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_selu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_silu_complex_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_silu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_softplus_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_softshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_softsign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_tanhshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_threshold_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_nn_functional_upsample_bilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_permute_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_permute_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polar_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polygamma_polygamma_n_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polygamma_polygamma_n_1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polygamma_polygamma_n_2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polygamma_polygamma_n_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_polygamma_polygamma_n_4_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_positive_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_pow_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_rad2deg_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_real_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_reciprocal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_remainder_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_reshape_as_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_reshape_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_round_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_round_decimals_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_round_decimals_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_round_decimals_neg_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_rsqrt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_rsub_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sgn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_short_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_bartlett_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_blackman_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_cosine_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_exponential_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_gaussian_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_general_cosine_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_general_hamming_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_hamming_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_hann_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_kaiser_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signal_windows_nuttall_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_signbit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sinc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sinh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_airy_ai_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_bessel_j0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_bessel_j1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_bessel_y0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_bessel_y1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_chebyshev_polynomial_t_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_chebyshev_polynomial_u_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_chebyshev_polynomial_v_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_chebyshev_polynomial_w_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_entr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_erfcx_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_hermite_polynomial_h_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_hermite_polynomial_he_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_i0e_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_i1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_i1e_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_laguerre_polynomial_l_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_legendre_polynomial_p_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_log_ndtr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_modified_bessel_i0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_modified_bessel_i1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_modified_bessel_k0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_modified_bessel_k1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_ndtr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_ndtri_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_scaled_modified_bessel_k0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_scaled_modified_bessel_k1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_spherical_bessel_j0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_xlog1py_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_special_zeta_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sqrt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_square_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_sub_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_tan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_tanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_true_divide_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_trunc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_unsafe_chunk_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_view_as_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_view_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_where_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_reference_generators_xlogy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_H_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_T_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___getitem___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___radd___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rand___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rdiv___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rmatmul___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rmod___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rmul___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___ror___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rpow___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rsub___cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators___rxor___cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__batch_norm_with_update_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__chunk_cat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__native_batch_norm_legit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__segment_reduce_lengths_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__segment_reduce_offsets_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__softmax_backward_data_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__unsafe_masked_index_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__unsafe_masked_index_put_accumulate_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators__upsample_bilinear2d_aa_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_abs_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_acos_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_acosh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_add_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addbmm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addcdiv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addcmul_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addmm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addmm_decomposed_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addmv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_addr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_alias_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_all_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_allclose_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_amax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_amin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_aminmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_angle_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_any_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_arange_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_argmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_argmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_argsort_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_argwhere_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_as_strided_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_as_strided_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_as_strided_partial_views_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_as_strided_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_asin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_asinh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atan2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atleast_1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atleast_2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_atleast_3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_baddbmm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bernoulli_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bfloat16_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bincount_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_and_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_left_shift_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_not_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_or_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_right_shift_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bitwise_xor_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_block_diag_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bmm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bool_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_broadcast_shapes_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_broadcast_tensors_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_broadcast_to_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_bucketize_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_byte_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cartesian_prod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cauchy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cdist_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cdouble_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ceil_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cfloat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_chalf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_char_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cholesky_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cholesky_inverse_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cholesky_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_chunk_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_clamp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_clamp_max_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_clamp_min_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_clone_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_column_stack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_combinations_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_complex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_conj_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_conj_physical_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_constant_pad_nd_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_contiguous_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_copysign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_corrcoef_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cos_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cosh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_count_nonzero_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cov_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cross_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cummax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cummin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cumprod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cumsum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_cumulative_trapezoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_deg2rad_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diag_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diag_embed_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diagflat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diagonal_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diagonal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diagonal_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_diff_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_digamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_dist_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_div_floor_rounding_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_div_no_rounding_mode_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_div_trunc_rounding_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_dot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_double_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_dsplit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_dstack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_einsum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_empty_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_empty_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_empty_permuted_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_empty_strided_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_eq_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_equal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_erf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_erfc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_erfinv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_exp2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_exp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_expand_as_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_expand_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_expand_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_expm1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_exponential_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_eye_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_fft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_fft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_fftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_fftshift_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_hfft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_hfft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_hfftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ifft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ifft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ifftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ifftshift_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ihfft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ihfft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_ihfftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_irfft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_irfft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_irfftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_rfft2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_rfft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fft_rfftn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fill_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_flatten_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_flip_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fliplr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_flipud_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_float_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_float_power_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_floor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_floor_divide_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_fmod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_frac_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_frexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_full_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_full_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_gather_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_gcd_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ge_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_geometric_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_geqrf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_gradient_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_grid_sampler_2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_grid_sampler_3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_gt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_half_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_hash_tensor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_heaviside_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_histc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_hsplit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_hstack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_hypot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_i0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_igamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_igammac_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_imag_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_add_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_fill_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_put_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_reduce_amax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_reduce_amin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_reduce_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_reduce_prod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_index_select_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_inner_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_int_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isclose_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isfinite_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isinf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isnan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isneginf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isposinf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_isreal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_istft_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_item_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_jiterator_2inputs_2outputs_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_jiterator_4inputs_with_extra_args_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_jiterator_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_jiterator_binary_return_by_ref_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_jiterator_unary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_kron_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_kthvalue_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lcm_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ldexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_le_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lerp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lgamma_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_cholesky_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_cholesky_ex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_cond_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_cross_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_det_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_diagonal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_eig_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_eigh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_eigvals_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_eigvalsh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_householder_product_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_inv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_inv_ex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_ldl_factor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_ldl_factor_ex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_ldl_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lstsq_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lstsq_grad_oriented_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lu_factor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lu_factor_ex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_lu_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_matrix_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_matrix_power_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_matrix_rank_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_matrix_rank_hermitian_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_multi_dot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_norm_subgradients_at_zero_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_pinv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_pinv_hermitian_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_pinv_singular_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_qr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_slogdet_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_solve_ex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_solve_triangular_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_svd_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_svdvals_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_tensorinv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_tensorsolve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_vander_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_vecdot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linalg_vector_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linspace_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_linspace_tensor_overload_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log10_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log1p_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log_normal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log_softmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_log_softmax_with_dtype_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logaddexp2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logaddexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logcumsumexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logdet_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logical_and_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logical_not_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logical_or_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logical_xor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logspace_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logspace_tensor_overload_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_logsumexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_long_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lu_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_lu_unpack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mH_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mT_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_amax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_amin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_argmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_argmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_cumprod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_cumsum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_fill_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_log_softmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_logaddexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_logsumexp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_median_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_normalize_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_prod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_select_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_softmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_softmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_std_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_sum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_masked_var_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_matmul_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_matrix_exp_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_max_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_max_pool2d_with_indices_backward_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_max_reduction_no_dim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_max_reduction_with_dim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_maximum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_median_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_meshgrid_list_of_tensors_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_meshgrid_variadic_tensors_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_min_binary_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_min_reduction_no_dim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_min_reduction_with_dim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_minimum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mode_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_movedim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_msort_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mul_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_multinomial_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mv_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nan_to_num_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nanmean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nanmedian_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nanquantile_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nansum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_narrow_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_narrow_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_native_batch_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_native_dropout_backward_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_native_layer_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ne_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_neg_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_new_empty_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_new_empty_strided_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_new_full_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_new_ones_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_new_zeros_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nextafter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_alpha_dropout_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_avg_pool1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_avg_pool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_avg_pool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_batch_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_bilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_binary_cross_entropy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_celu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_channel_shuffle_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv_transpose1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv_transpose2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_conv_transpose3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_cosine_embedding_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_cosine_similarity_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_cross_entropy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_ctc_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_dropout2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_dropout3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_dropout_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_elu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_embedding_bag_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_embedding_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_fractional_max_pool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_fractional_max_pool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_gaussian_nll_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_gelu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_glu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_grid_sample_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_group_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_hardshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_hardsigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_hardswish_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_hardtanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_hinge_embedding_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_huber_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_instance_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_area_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_bicubic_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_bilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_linear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_nearest_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_interpolate_trilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_kl_div_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_l1_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_layer_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_leaky_relu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_linear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_local_response_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_logsigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_margin_ranking_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_pool1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_pool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_pool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool1d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool1d_grad_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool2d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool2d_grad_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool3d_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_max_unpool3d_grad_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_mish_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_mse_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_multi_head_attention_forward_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_multi_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_multilabel_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_nll_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_normalize_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_one_hot_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pad_circular_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pad_constant_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pad_reflect_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pad_replicate_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pad_replicate_negative_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pairwise_distance_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pdist_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pixel_shuffle_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_pixel_unshuffle_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_poisson_nll_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_prelu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_relu6_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_relu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_rms_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_rrelu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_selu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_silu_complex_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_silu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_smooth_l1_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_soft_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_softmin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_softmin_with_dtype_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_softplus_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_softshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_softsign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_tanhshrink_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_threshold_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_triplet_margin_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_unfold_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_upsample_bilinear_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nn_functional_upsample_nearest_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nonzero_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_nonzero_static_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_norm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_norm_fro_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_norm_inf_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_norm_nuc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_normal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_normal_in_place_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_normal_number_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ones_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ones_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ormqr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_outer_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_pca_lowrank_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_permute_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_permute_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_pinverse_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polar_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polygamma_polygamma_n_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polygamma_polygamma_n_1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polygamma_polygamma_n_2_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polygamma_polygamma_n_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_polygamma_polygamma_n_4_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_positive_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_pow_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_prod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_put_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_qr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_quantile_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_rad2deg_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_rand_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_randint_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_randint_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_randn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_randn_like_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_ravel_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_real_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_reciprocal_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_remainder_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_renorm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_repeat_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_repeat_interleave_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_reshape_as_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_reshape_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_resize__cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_resize_as__cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_resolve_conj_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_resolve_neg_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_roll_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_rot90_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_round_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_round_decimals_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_round_decimals_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_round_decimals_neg_3_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_rsqrt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_rsub_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scalar_tensor_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_add_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_reduce_amax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_reduce_amin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_reduce_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_reduce_prod_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_scatter_reduce_sum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_searchsorted_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_select_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_select_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sgn_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_short_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sigmoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sign_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_bartlett_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_blackman_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_cosine_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_exponential_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_gaussian_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_general_cosine_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_general_hamming_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_hamming_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_hann_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_kaiser_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signal_windows_nuttall_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_signbit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sin_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sinc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sinh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_slice_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_slice_scatter_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_softmax_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_softmax_with_dtype_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sort_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sparse_mm_reduce_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sparse_sampled_addmm_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_airy_ai_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_bessel_j0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_bessel_j1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_bessel_y0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_bessel_y1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_chebyshev_polynomial_t_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_chebyshev_polynomial_u_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_chebyshev_polynomial_v_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_chebyshev_polynomial_w_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_entr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_erfcx_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_hermite_polynomial_h_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_hermite_polynomial_he_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_i0e_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_i1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_i1e_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_laguerre_polynomial_l_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_legendre_polynomial_p_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_log_ndtr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_modified_bessel_i0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_modified_bessel_i1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_modified_bessel_k0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_modified_bessel_k1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_ndtr_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_ndtri_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_scaled_modified_bessel_k0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_scaled_modified_bessel_k1_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_spherical_bessel_j0_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_xlog1py_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_special_zeta_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_split_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_split_list_args_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_split_with_sizes_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_split_with_sizes_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sqrt_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_square_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_squeeze_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_squeeze_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_squeeze_multiple_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_stack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_std_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_std_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_std_mean_unbiased_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_std_unbiased_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_stft_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sub_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sum_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_sum_to_size_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_svd_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_svd_lowrank_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_t_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_t_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_take_along_dim_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_take_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tan_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tanh_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tensor_split_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tensordot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tile_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_to_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_to_sparse_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_topk_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_torch__scaled_mm_cuda_float8_e4m3fn, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_trace_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_transpose_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_transpose_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_trapezoid_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_trapz_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_triangular_solve_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tril_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_tril_indices_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_triu_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_triu_indices_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_true_divide_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_trunc_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unbind_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unbind_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unflatten_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unfold_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unfold_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_uniform_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unique_consecutive_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unique_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unravel_index_cuda_int64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unsafe_chunk_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unsafe_split_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unsqueeze_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_unsqueeze_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_var_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_var_mean_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_var_mean_unbiased_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_var_unbiased_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_vdot_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_view_as_complex_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_view_as_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_view_as_real_cuda_complex64, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_view_copy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_view_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_vsplit_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_vstack_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_where_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_xlogy_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_zero__cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_zeros_cuda_float32, test/test_testing.py::TestOpInfoSampleFunctionsCUDA::test_opinfo_sample_generators_zeros_like_cuda_float32 2025-12-04T10:44:54.3291709Z 2025-12-04T10:44:54.3291828Z Finished test_testing 1/1 ... [2025-12-04 10:44:54.235341][4969876.731525601], took 0.42min 2025-12-04T10:44:54.3292206Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:44:54.3292565Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:44:54.3292795Z Running inductor/test_autoheuristic 1/1 ... [2025-12-04 10:44:54.252736][4969876.748922211] 2025-12-04T10:44:54.3292987Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:44:54.3293382Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_autoheuristic.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:44:54.253214] 2025-12-04T10:44:59.6778372Z 2025-12-04T10:44:59.6780556Z inductor/test_autoheuristic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_autoheuristic_1.1_b6832b3b6853e056_.log 2025-12-04T10:44:59.6781670Z Running 0 items in this shard: 2025-12-04T10:44:59.6781944Z 2025-12-04T10:44:59.6782359Z Finished inductor/test_autoheuristic 1/1 ... [2025-12-04 10:44:59.677399][4969882.173583924], took 0.09min 2025-12-04T10:44:59.6791187Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:44:59.6948514Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:44:59.6954856Z Running export/test_strict_export_v2 1/1 ... [2025-12-04 10:44:59.695240][4969882.191422787] 2025-12-04T10:44:59.6955555Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:44:59.6963519Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_strict_export_v2.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:44:59.695816] 2025-12-04T10:46:05.4479879Z 2025-12-04T10:46:05.4481379Z export/test_strict_export_v2 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_strict_export_v2_1.1_ea71eea237c680c9_.log 2025-12-04T10:46:05.4704386Z Running 440 items in this shard: test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_assume_static_by_default_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_constraints_error_not_in_range_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_constraints_error_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_inline_constraints_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_slice_maxsize_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_slice_unbacked_dim1_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_export_strict_narrow_unbacked_expr_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_no_grad_param_inplace_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestDynamismExpression::test_reshape_view_backed_size_oblivious_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test__scaled_dot_product_flash_attention_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_additional_inputs_constants_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_allow_explicit_guards_as_runtime_asserts_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_annotate_on_assert_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_args_type_checked_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_aten_lift_fresh_copy_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_attention_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_attr_assignment_extra_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_automatic_constrain_size_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_automatic_dynamic_shapes_constant_relation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_automatic_dynamic_shapes_linear_relation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_automatic_dynamic_shapes_simple_equality_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_baddbmm_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_basic_non_strict_fake_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_basic_non_strict_real_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_bincount_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_buffer_util_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_capture_subclass_constructor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_capture_subclass_constructor_torch_ir_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_capture_subclass_wrong_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_ccode_python_mod_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cdist_forward_compute_mode_zero_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_check_specialized_int_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_checks_to_constrain_range_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cleanup_dynamic_markers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_colin_unbacked_backed_vr_sub_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_colon_parameter_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_compiling_state_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_access_identical_symint_closure_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_branches_return_constant_int_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_branches_return_same_int_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_buffers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_contains_unbacked_no_escape_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_int_closure_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_unflatten_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_with_module_stack_export_with_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cond_with_module_stack_export_with_unflatten_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_aliasing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_input_naming_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_no_user_inp_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_output_dup_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_output_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_requires_grad_const_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_return_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_tensor_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_tensor_with_non_functional_nested_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constant_tensor_with_non_functional_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constrain_decomp_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constrain_size_in_eager_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constrain_size_with_constrain_value_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_constrain_size_with_various_cases_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_conv_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_crop_like_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_cse_for_symint_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_op_auto_functionalize_pre_dispatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_op_auto_functionalize_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_op_auto_warn_pre_dispatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_op_preserve_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_pytree_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_custom_tag_metadata_re_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_decomp_batch_norm_functional_predispatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_decomp_item_in_prim_after_decomposition_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_decomp_item_in_prim_before_decomposition_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_default_decomposition_core_cia_ops_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_1_2_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_integer_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_nested_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_out_of_order_repeat_derived_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_out_of_order_simplified_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_out_of_order_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_derived_dim_repeat_derived_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_detect_leak_nonstrict_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_detect_leak_nonstrict_with_stacktrace_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_detect_leak_strict_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_device_to_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_device_to_gpu_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_device_to_mutation_float_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_device_to_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_device_to_static_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_1_2_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_auto_and_dim_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_dynamic_divisibility_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_dynamic_specialization_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_hint_range_violations_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dim_hint_ranges_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_disable_forced_specializations_errors_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_disable_forced_specializations_ok_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_distributed_all_gather_into_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_distributed_all_gather_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_distributed_all_reduce_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_distributed_all_to_all_single_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_distributed_reduce_scatter_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dont_duck_size_for_auto_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_double_lifted_constants_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_checks_aliasing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_checks_mutation_list_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_checks_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_checks_mutation_with_nan_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_fake_kernel_inference_errors_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_draft_export_infers_fake_kernel_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_duplicate_modules_with_non_persistent_buffers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_lr_shift_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_bounds_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_builder_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_builder_kwargs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_builder_pytree_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_dataclass_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_inferred_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_serdes_generic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_serdes_user_errors_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_serdes_various_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_spec_with_pytree_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_shapes_wrapped_with_shape_guards_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_dynamic_sym_round_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_ends_of_bounds_oblivious_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_enum_str_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_error_does_not_reference_eager_fallback_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_error_when_passing_mutating_primitive_op_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_exception_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_expand_copy_export_handles_implicit_true_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_api_with_dynamic_shapes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_as_backend_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_associative_scan_lifted_buffers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_associative_scan_symbol_dim_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_associative_scan_symbol_scandim_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_aten_to_unflatten_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_aten_to_unflatten_subclass_pre_dispatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_aten_to_unflatten_subclass_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_cond_preserve_torch_fn_for_subgraphs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_cond_symbool_pred_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_cond_warns_constant_pred_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_custom_decomp_table_basic_pop_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_custom_decomp_table_container_methods_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_custom_op_lib_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_custom_triton_kernel_mutable_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_custom_triton_kernel_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_cyclic_reference_leak_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_decomp_torture_case_1_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_decomp_torture_case_2_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_decomps_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_decomps_simple_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_dynamo_config_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_for_training_run_decomp_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_for_training_with_container_type_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_for_training_with_dynamic_shapes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_for_training_with_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_for_training_with_state_dict_hooks_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_default_kwargs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_keyword_only_args_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_kwargs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_pytree_kwargs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_var_keyword_args_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_var_keyword_pytree_args_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_func_with_var_postional_args_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_function_schema_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_graph_with_no_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_input_mutation_bug_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_input_mutation_dynamic_shape_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_input_mutation_static_shape_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_leak_compile_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_linear_preserve_dynamic_shape_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_max_nonstrict_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_max_onnx_reported_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_method_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_mod_constraints_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_module_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_preserve_linear_at_aot_level_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_preserve_linear_but_not_custom_op_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_rnn_variants_with_warning_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_scan_pytree_output_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_script_module_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_statically_known_true_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_then_compile_tensor_ctor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_autocast_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_fake_tensor_inputs_on_cuda_devices_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_fake_tensor_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_inline_constraints_complex_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_inline_constraints_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_set_grad_enabled_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_export_with_wrong_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_external_call_non_strict_real_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_fake_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_fake_weights_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_filter_traceback_frames_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_flex_attention_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_float_conversion_from_int_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_float_conversion_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_fqn_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_from_node_metadata_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_full_on_scalar_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_function_holding_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_hints_wrapper_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_hoo_inline_users_issue_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_if_functional_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_if_post_autograd_op_preserved_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_inductor_backend_inside_nonstrict_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_inline_script_class_method_recursive_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_inline_script_class_method_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_inline_script_function_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_inline_script_method_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_int_shape_specialization_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_intermediate_shape_comp_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_invalid_pytree_dynamo_graph_capture_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_is_exporting_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_is_nonzero_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_isnonzero_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_issue_113041_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_issue_157289_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_issue_161902_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_istft_op_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_keep_composite_ops_invalid_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_keep_composite_ops_linear_convd_for_training_ir_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_keep_composite_ops_linear_convd_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_kwarg_dynamic_shapes_diff_order_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_kwargs_reorder_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_layer_norm_unbacked_normalized_shape_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_layer_sharing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_lazy_module_kwargs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_lifted_constants_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_linear_conv_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_malformed_fqn_from_source_name_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_map_buffers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_map_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_mask_nonzero_static_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_masked_select_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_math_pow_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_mismatched_dynamic_shapes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_mixed_input_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_dict_key_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_input_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_input_subclasses_parameterization_nested_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_list_slice_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_module_with_dict_container_inp_out_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_modules_access_for_deleted_submodule_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_more_multidimensional_slicing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_multidimensional_slicing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_multinomial_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_multiple_definitions_same_name_dim_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_namedtuple_input_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_native_multi_attention_head_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_dynamic_shapes_spec_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_module_fake_tensor_leak_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_module_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_module_with_constant_buffer_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_module_with_init_buffer_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nested_module_with_parameter_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nn_module_stack_shared_submodule_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nn_module_stack_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_check_is_size_error_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_suggested_fixes_for_data_dependent_errors_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_tensor_computation_2_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_tensor_computation_3_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_tensor_computation_4_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_no_tensor_computation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_arg_name_dynamic_shapes_api_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_arg_name_dynamic_shapes_api_with_container_type_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_persistent_buffer_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_strict_dynamic_shapes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_non_strict_dynamic_shapes_suggested_fixes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_none_buffers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nonstrict_retrace_preserves_metadata_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nonzero_2_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_nonzero_dynamic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_not_registered_parameter_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_operator_aten_tensor_mode_variant_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_output_node_name_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_pad_sequence_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_param_util_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_partial_patched_forward_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_placeholder_naming_collisions_hoo_subgraphs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_placeholder_naming_collisions_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_placeholder_naming_order_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_placeholder_naming_order_variadic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_placeholder_update_preserving_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_predispatch_cond_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_predispatch_grad_wrappers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_preserve_annotation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_preserve_module_call_signature_unflatten_specialization_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_preserve_requires_grad_placeholders_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_preserve_shape_dynamism_for_unused_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_profiling_code_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_python_asserts_with_sym_int_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_pytree_register_data_class_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_pytree_register_nested_data_class_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_raise_user_error_when_guard_on_data_dependent_operation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_range_constraints_with_replacement_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_real_tensor_alias_dtype_mismatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_real_tensor_bool_cast_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_real_tensor_errors_on_aliasing_custom_op_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_real_tensor_for_max_op_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_real_tensor_size_mismatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_redundant_assert_max_upper_bound_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_redundant_asserts_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_refine_dynamic_shapes_from_suggested_fixes_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_register_constant_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_repeat_interleave_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_replace_unbacked_with_very_large_upperbound_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_replaced_unbacked_bindings_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_reshape_view_helper_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_retracable_ep_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_retrace_pre_autograd_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_run_decomposition_supports_user_input_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_run_decompositions_keep_metadata_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_run_decompositions_keep_tensor_constant_metadata_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_runtime_assert_for_prim_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_runtime_assert_for_prm_str_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_runtime_assert_with_size_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_sdpa_gqa_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_sequential_slicing_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_set_example_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_set_grad_as_side_effect_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_set_grad_empty_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_set_grad_unflatten_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_setgrad_lifted_tensor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_shared_submodule_nn_module_stack_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_simple_export_for_training_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_simple_unbacked_view_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_size_input_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_slice_nn_module_stack_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_solver_unsupported_sympy_function_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_specialize_derived_dim_roots_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_split_const_gm_with_lifted_constants_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_stack_trace_make_fx_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_stack_trace_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_state_primitives_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_state_shape_attribute_assignment_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_state_tensors_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_static_dim_constraints_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_context_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_nested_attr_access_complicated_metadata_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_nested_attr_access_const_metadata_not_top_level_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_nested_attr_access_const_metadata_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_nested_attr_access_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclass_nested_attr_access_submodule_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclasses_parameterization_nested_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_subclasses_parameterization_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_suggest_torch_checks_with_non_negative_check_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_suggest_torch_checks_with_regular_check_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_suggested_fixes_for_data_dependent_errors_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_suggested_fixes_new_roots_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_sym_float_operators_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_sym_or_sym_and_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_sym_sqrt_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symbool_item_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symfloat_item_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_input_additional_inputs_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_input_basic_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_input_ranges_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_input_shapes_collection_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_input_specialization_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_item_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_output_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_symint_tensor_return_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tag_ac_export_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tensor_attribute_zero_args_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tensor_constant_aten_to_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tensor_constant_with_wrapped_method_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_to_module_with_mutated_buffer_multiple_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_to_module_with_mutated_buffer_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tolist_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_torch_check_eq_commutativity_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_torch_fn_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_trace_under_fake_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_train_eval_on_exported_preautograd_module_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_tril_dynamic_diagonal_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_triu_dynamic_diagonal_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_3d_matmul_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_bincount_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_bindings_for_divisible_u_symint_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_deferred_runtime_retrace_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_expand_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_infer_size_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_kth_value_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_linear_layer_norm_input_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_noncontig_lin_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_pad_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_scalar_constructor_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_slice_forward_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_slice_simple_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_stack_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_to_cond_passthrough_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_to_cond_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unbacked_unsqueeze_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_asserts_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_buffer_update_child2parent_swap_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_closure_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_isinstance_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_multiple_graphs_dispatch_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_multiple_graphs_preserve_signature_no_error_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_multiple_graphs_shared_submodule_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_multiple_graphs_state_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_no_unroll_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_placeholder_update_child2parent_swap_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_placeholder_update_grandchild2cousin_swap_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_5_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_6_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_buf_8_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_const_preserving_3_1_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_const_preserving_3_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_4_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_6_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_9_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_preserving_10_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_preserving_4_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_preserving_5_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_mutating_buf_preserving_7_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unflatten_random_dag_preserving_4_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unused_aliases_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_unused_constant_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_uplift_common_custom_meta_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_uplift_common_custom_meta_with_multiple_calls_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_use_embedding_twice_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_user_input_and_buffer_mutation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_vmap_custom_autograd_function_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_vmap_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_vmap_to_assert_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_where_decomp_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_while_loop_assert_separation_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_while_loop_index_assertions_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_while_loop_simple_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_while_loop_tensor_constant_idx_strict_export_v2, test/export/test_strict_export_v2.py::StrictExportV2TestExport::test_wrapper_module_strict_export_v2 2025-12-04T10:46:05.4899115Z 2025-12-04T10:46:05.4899383Z Finished export/test_strict_export_v2 1/1 ... [2025-12-04 10:46:05.448968][4969947.945151602], took 1.10min 2025-12-04T10:46:05.4900214Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:05.4900968Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:05.4901512Z Running inductor/test_segmented_tree 1/1 ... [2025-12-04 10:46:05.466340][4969947.962526761] 2025-12-04T10:46:05.4901914Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:05.4902740Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_segmented_tree.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:05.466822] 2025-12-04T10:46:07.7361882Z 2025-12-04T10:46:07.7363111Z inductor/test_segmented_tree 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_segmented_tree_1.1_379e8da6fc9e804b_.log 2025-12-04T10:46:07.7369236Z Running 12 items in this shard: test/inductor/test_segmented_tree.py::TestSegmentedTree::test_basic_construction, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_boundary_conditions, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_empty_array, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_full_array_range, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_invalid_ranges, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_max_query_matches_naive, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_multiple_operations, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_out_of_bounds, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_overlapping_updates, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_range_update, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_sequential_updates_and_queries, test/inductor/test_segmented_tree.py::TestSegmentedTree::test_single_element_ranges 2025-12-04T10:46:07.7373866Z 2025-12-04T10:46:07.7374291Z Finished inductor/test_segmented_tree 1/1 ... [2025-12-04 10:46:07.735883][4969950.232067219], took 0.04min 2025-12-04T10:46:07.7376611Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:07.7531138Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:07.7537144Z Running inductor/test_kernel_optimization 1/1 ... [2025-12-04 10:46:07.753530][4969950.249716194] 2025-12-04T10:46:07.7537826Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:07.7542663Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_kernel_optimization.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:07.754029] 2025-12-04T10:46:22.1454788Z 2025-12-04T10:46:22.1456525Z inductor/test_kernel_optimization 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_kernel_optimization_1.1_59e620ee0cf4558d_.log 2025-12-04T10:46:22.1458056Z Running 1 items in this shard: test/inductor/test_kernel_optimization.py::TestKernelOptimization::test_einsum_to_pointwise 2025-12-04T10:46:22.1458966Z 2025-12-04T10:46:22.1459393Z Finished inductor/test_kernel_optimization 1/1 ... [2025-12-04 10:46:22.145226][4969964.641409946], took 0.24min 2025-12-04T10:46:22.1469564Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:22.1624445Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:22.1629248Z Running inductor/test_inductor_scheduler 1/1 ... [2025-12-04 10:46:22.162629][4969964.658815654] 2025-12-04T10:46:22.1629928Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:22.1633499Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inductor_scheduler.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:22.163114] 2025-12-04T10:46:40.6142639Z 2025-12-04T10:46:40.6144186Z inductor/test_inductor_scheduler 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inductor_scheduler_1.1_8bbde3da16ff753f_.log 2025-12-04T10:46:40.6150237Z Running 8 items in this shard: test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_disable_get_estimated_runtime_logging_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_disable_get_estimated_runtime_logging_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options0_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options0_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options1_cuda_float16, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_flop_counter_op_options1_cuda_float32, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_fusion_prevent_too_many_reads_and_writes_allows_fusion_cuda, test/inductor/test_inductor_scheduler.py::TestSchedulerCUDA::test_fusion_prevent_too_many_reads_and_writes_prevents_fusion_cuda 2025-12-04T10:46:40.6154603Z 2025-12-04T10:46:40.6155051Z Finished inductor/test_inductor_scheduler 1/1 ... [2025-12-04 10:46:40.613800][4969983.109984271], took 0.31min 2025-12-04T10:46:40.6157417Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:40.6309505Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:40.6317445Z Running inductor/test_remote_cache 1/1 ... [2025-12-04 10:46:40.631324][4969983.127510226] 2025-12-04T10:46:40.6318150Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:40.6320633Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_remote_cache.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:40.631818] 2025-12-04T10:46:43.2016466Z 2025-12-04T10:46:43.2017867Z inductor/test_remote_cache 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_remote_cache_1.1_e3b942eada16ec90_.log 2025-12-04T10:46:43.2020002Z Running 3 items in this shard: test/inductor/test_remote_cache.py::TestRemoteCache::test_failure_logging, test/inductor/test_remote_cache.py::TestRemoteCache::test_failure_no_sample, test/inductor/test_remote_cache.py::TestRemoteCache::test_normal_logging 2025-12-04T10:46:43.2021282Z 2025-12-04T10:46:43.2021687Z Finished inductor/test_remote_cache 1/1 ... [2025-12-04 10:46:43.201378][4969985.697561265], took 0.04min 2025-12-04T10:46:43.2034553Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:43.2188674Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:43.2194417Z Running inductor/test_coordinate_descent_tuner 1/1 ... [2025-12-04 10:46:43.219272][4969985.715458384] 2025-12-04T10:46:43.2195136Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:43.2201529Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_coordinate_descent_tuner.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:43.219769] 2025-12-04T10:46:51.8515663Z 2025-12-04T10:46:51.8518021Z inductor/test_coordinate_descent_tuner 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_coordinate_descent_tuner_1.1_a4b00e21f05d3cba_.log 2025-12-04T10:46:51.8521734Z Running 5 items in this shard: test/inductor/test_coordinate_descent_tuner.py::TestCoordinateDescentTuner::test_abs_function, test/inductor/test_coordinate_descent_tuner.py::TestCoordinateDescentTuner::test_get_neighbour_values, test/inductor/test_coordinate_descent_tuner.py::TestCoordinateDescentTuner::test_no_neighbors, test/inductor/test_coordinate_descent_tuner.py::TestCoordinateDescentTuner::test_persistent_reduction, test/inductor/test_coordinate_descent_tuner.py::TestCoordinateDescentTuner::test_value_too_large 2025-12-04T10:46:51.8525179Z 2025-12-04T10:46:51.8525652Z Finished inductor/test_coordinate_descent_tuner 1/1 ... [2025-12-04 10:46:51.851203][4969994.347388346], took 0.14min 2025-12-04T10:46:51.8556322Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:46:51.8684572Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:46:51.8690751Z Running inductor/test_inplace_padding 1/1 ... [2025-12-04 10:46:51.868870][4969994.365056668] 2025-12-04T10:46:51.8691770Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:46:51.8695968Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inplace_padding.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:46:51.869345] 2025-12-04T10:47:09.6654774Z 2025-12-04T10:47:09.6656241Z inductor/test_inplace_padding 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inplace_padding_1.1_9dd72fbdd294c684_.log 2025-12-04T10:47:09.6660981Z Running 9 items in this shard: test/inductor/test_inplace_padding.py::InplacePaddingTest::test_linear_and_cel, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_linear_and_cel_max_autotune, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_mutating_padding_input, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_mutating_padding_output, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_pad_non_zero, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_pad_non_zero_cpp_wrapper, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_pad_too_large, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_skip_pad_due_to_fusion, test/inductor/test_inplace_padding.py::InplacePaddingTest::test_skip_pad_input 2025-12-04T10:47:09.6664675Z 2025-12-04T10:47:09.6665115Z Finished inductor/test_inplace_padding 1/1 ... [2025-12-04 10:47:09.665078][4970012.161262935], took 0.30min 2025-12-04T10:47:09.6669927Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:47:09.6823659Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:09.6830012Z Running inductor/test_minifier_utils 1/1 ... [2025-12-04 10:47:09.682726][4970012.178911907] 2025-12-04T10:47:09.6830710Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:09.6834620Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_minifier_utils.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:09.683206] 2025-12-04T10:47:12.5025325Z 2025-12-04T10:47:12.5026845Z inductor/test_minifier_utils 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_minifier_utils_1.1_24b6480b0e221e9a_.log 2025-12-04T10:47:12.5029106Z Running 3 items in this shard: test/inductor/test_minifier_utils.py::MinifierUtilsTests::test_convert_module_to_string, test/inductor/test_minifier_utils.py::MinifierUtilsTests::test_invalid_output, test/inductor/test_minifier_utils.py::MinifierUtilsTests::test_non_exportable 2025-12-04T10:47:12.5030532Z 2025-12-04T10:47:12.5030975Z Finished inductor/test_minifier_utils 1/1 ... [2025-12-04 10:47:12.502232][4970014.998416916], took 0.05min 2025-12-04T10:47:12.5042129Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:47:12.5198671Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:12.5203202Z Running inductor/test_debug_trace 1/1 ... [2025-12-04 10:47:12.519961][4970015.016147447] 2025-12-04T10:47:12.5203876Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:12.5206805Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_debug_trace.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:12.520437] 2025-12-04T10:47:25.8077042Z 2025-12-04T10:47:25.8078256Z inductor/test_debug_trace 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_debug_trace_1.1_b9889104d3384cdf_.log 2025-12-04T10:47:25.8080022Z Running 3 items in this shard: test/inductor/test_debug_trace.py::TestDebugTrace::test_debug_multi_tempalte, test/inductor/test_debug_trace.py::TestDebugTrace::test_debug_printer_const, test/inductor/test_debug_trace.py::TestDebugTrace::test_debug_trace 2025-12-04T10:47:25.8080722Z 2025-12-04T10:47:25.8080951Z Finished inductor/test_debug_trace 1/1 ... [2025-12-04 10:47:25.807300][4970028.303485112], took 0.22min 2025-12-04T10:47:25.8094731Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:47:25.8247188Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:25.8253189Z Running dynamo/test_recompiles 1/1 ... [2025-12-04 10:47:25.825074][4970028.321260331] 2025-12-04T10:47:25.8253865Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:25.8257968Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_recompiles.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:25.825542] 2025-12-04T10:47:29.6971031Z 2025-12-04T10:47:29.6972609Z dynamo/test_recompiles 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_recompiles_1.1_febdaffa8518667b_.log 2025-12-04T10:47:29.6980944Z Running 18 items in this shard: test/dynamo/test_recompiles.py::RecompileTests::test_aliasing_guard_failures, test/dynamo/test_recompiles.py::RecompileTests::test_aliasing_guard_failures_with_globals, test/dynamo/test_recompiles.py::RecompileTests::test_ambient_autocast_recompile, test/dynamo/test_recompiles.py::RecompileTests::test_autocast_constant_fold, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_on_closed_ints, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_reduce_recompiles, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_shapes_mark_as_oblivious, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_shapes_mark_as_oblivious_fail_counterfactual, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_shapes_mark_as_unbacked, test/dynamo/test_recompiles.py::RecompileTests::test_automatic_dynamic_tensor_scalar_change, test/dynamo/test_recompiles.py::RecompileTests::test_dunder_call_recompile, test/dynamo/test_recompiles.py::RecompileTests::test_dynamic_shape_parameter_recompile, test/dynamo/test_recompiles.py::RecompileTests::test_inline_inbuilt_nn_modules_candidate, test/dynamo/test_recompiles.py::RecompileTests::test_no_recompile_over_unused_objects, test/dynamo/test_recompiles.py::RecompileTests::test_no_recursive_compile_after_cache_limit_hit, test/dynamo/test_recompiles.py::RecompileTests::test_recompiles_true_false_flop, test/dynamo/test_recompiles.py::RecompileTests::test_run_mode_after_cache_limit_hit, test/dynamo/test_recompiles.py::RecompileTests::test_simple_module_recompile 2025-12-04T10:47:29.6989323Z 2025-12-04T10:47:29.6989714Z Finished dynamo/test_recompiles 1/1 ... [2025-12-04 10:47:29.696781][4970032.192967621], took 0.06min 2025-12-04T10:47:29.6991001Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:47:29.7134953Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:47:29.7141110Z Running inductor/test_foreach 1/1 ... [2025-12-04 10:47:29.713875][4970032.210061121] 2025-12-04T10:47:29.7141835Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:47:29.7145542Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_foreach.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:47:29.714337] 2025-12-04T10:51:50.6365815Z 2025-12-04T10:51:50.6368295Z inductor/test_foreach 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_foreach_1.1_6378f598bfcf2ef2_.log 2025-12-04T10:51:50.6582821Z Running 536 items in this shard: test/inductor/test_foreach.py::ForeachTests::test_2d_block_mixed_sizes_with_mask, test/inductor/test_foreach.py::ForeachTests::test_2d_block_no_mixed_sizes_no_mask, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_elems_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_2d_blocking_partitioning_mixed_sizes_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_aliasing, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_broadcasting__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_broadcasting_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_cpu_cpp_fallback_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_decomp__foreach_addcdiv, test/inductor/test_foreach.py::ForeachTests::test_decomp__foreach_addcmul, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_dynamic_shapes_fallback_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_enable_dynamic_shapes_cpp_wrapper_cuda, test/inductor/test_foreach.py::ForeachTests::test_enable_dynamic_shapes_cpp_wrapper_xpu, test/inductor/test_foreach.py::ForeachTests::test_enable_dynamic_shapes_python_wrapper, test/inductor/test_foreach.py::ForeachTests::test_foreach_cpp_wrapper_cuda, test/inductor/test_foreach.py::ForeachTests::test_foreach_cpp_wrapper_xpu, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_binary_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_unary_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_unary_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_unary_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_backward_unary_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_foreach_map_input_mutation, test/inductor/test_foreach.py::ForeachTests::test_fuse_concat, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_fusion_duplicate_buffer_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_kernel_split_arg_limit_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_multi_device, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_producer_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_consumer_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_non_foreach_producer_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_reinplacing__foreach_add_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing__foreach_div_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing__foreach_mul_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing__foreach_sub_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_after__foreach_add_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_after__foreach_div_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_after__foreach_mul_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_after__foreach_sub_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_before__foreach_add_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_before__foreach_div_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_before__foreach_mul_, test/inductor/test_foreach.py::ForeachTests::test_reinplacing_mut_before__foreach_sub_, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_scheduler_fusion_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_single_list__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_single_list_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_single_scalar__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_single_scalar_tensor_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_abs, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_neg, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_rsqrt, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_sign, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_sqrt, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_abs, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_addcmul_op, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_neg, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_recipaddmul_op, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_reciprocal, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_sign, test/inductor/test_foreach.py::ForeachTests::test_singleton_lists_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_add, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_copy, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_div, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_maximum, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_minimum, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_mul, test/inductor/test_foreach.py::ForeachTests::test_type_promotion__foreach_sub, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_add, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_add_op, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_addrecip_op, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_clamp_max, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_clamp_min, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_copy, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_div, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_maximum, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_minimum, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_mul, test/inductor/test_foreach.py::ForeachTests::test_type_promotion_foreach_map_sub, test/inductor/test_foreach.py::ForeachTests::test_zero_elems 2025-12-04T10:51:50.6783555Z 2025-12-04T10:51:50.6783860Z Finished inductor/test_foreach 1/1 ... [2025-12-04 10:51:50.637175][4970293.133359457], took 4.35min 2025-12-04T10:51:50.6784689Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:51:50.6785434Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:51:50.6785904Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T10:51:50.6786292Z Uploading artifacts took 0.00 seconds 2025-12-04T10:51:50.6786696Z Running dynamo/test_buffers_override 1/1 ... [2025-12-04 10:51:50.654910][4970293.151096738] 2025-12-04T10:51:50.6787088Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:51:50.6787917Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_buffers_override.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:51:50.655393] 2025-12-04T10:51:58.0339369Z 2025-12-04T10:51:58.0340006Z dynamo/test_buffers_override 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_buffers_override_1.1_300d4db83503cb5e_.log 2025-12-04T10:51:58.0340573Z Running 2 items in this shard: test/dynamo/test_buffers_override.py::TestBuffersOverride::test_buffers_override, test/dynamo/test_buffers_override.py::TestBuffersOverride::test_named_buffers_override 2025-12-04T10:51:58.0341077Z 2025-12-04T10:51:58.0341217Z Finished dynamo/test_buffers_override 1/1 ... [2025-12-04 10:51:58.033545][4970300.529730155], took 0.12min 2025-12-04T10:51:58.0358860Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:51:58.0512990Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:51:58.0517690Z Running inductor/test_inplacing_pass 1/1 ... [2025-12-04 10:51:58.051488][4970300.547674473] 2025-12-04T10:51:58.0518490Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:51:58.0522751Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_inplacing_pass.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:51:58.051977] 2025-12-04T10:52:10.1880028Z 2025-12-04T10:52:10.1881474Z inductor/test_inplacing_pass 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_inplacing_pass_1.1_a535776e9eb731a6_.log 2025-12-04T10:52:10.1894455Z Running 22 items in this shard: test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_counters_functionalize_old, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_counters_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_dont_modify_input, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_dont_modify_live, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_dont_modify_view_of_live, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_generalized_scatter, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_lists_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_lists_old_functionalize, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_multi_output_intermediate, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_multiple_intermediate, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_multiple_mutations, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_partitioner_recomputes_factory_empty_like_sin_op, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_partitioner_recomputes_factory_empty_like_sin_triton, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_partitioner_recomputes_factory_ones_like_sin_op, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_partitioner_recomputes_factory_ones_like_sin_triton, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_should_modify_inner, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_should_modify_input, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_view_inplaced2_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_view_inplaced_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_views_not_inplaced2_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_views_not_inplaced3_functionalize_v2, test/inductor/test_inplacing_pass.py::TestReinplacingPassCorrectness::test_views_not_inplaced_functionalize_v2 2025-12-04T10:52:10.1906039Z 2025-12-04T10:52:10.1906477Z Finished inductor/test_inplacing_pass 1/1 ... [2025-12-04 10:52:10.187667][4970312.683851395], took 0.20min 2025-12-04T10:52:10.1907813Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:52:10.2053293Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:52:10.2058048Z Running inductor/test_aot_inductor_custom_ops 1/1 ... [2025-12-04 10:52:10.205618][4970312.701804791] 2025-12-04T10:52:10.2058708Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:52:10.2063088Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_aot_inductor_custom_ops.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:52:10.206099] 2025-12-04T10:53:51.7528205Z 2025-12-04T10:53:51.7529221Z inductor/test_aot_inductor_custom_ops 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_aot_inductor_custom_ops_1.1_0f36fec512ec4fef_.log 2025-12-04T10:53:51.7549864Z Running 35 items in this shard: test/inductor/test_aot_inductor_custom_ops.py::AOTInductorLoggingTest::test_shape_env_reuse, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_boxed_run_inputs_clearing_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_add_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_add_output_path_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_all_inputs_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_missing_arg_with_default_value_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_out_variant_without_return_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_return_list_of_single_tensor_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_return_single_tensor_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_square_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_with_concat_inputs_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_with_multiple_outputs_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_custom_op_with_reinterpret_view_inputs_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_fn_with_int_output_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_fn_with_optional_tensor_nullopt_output_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_fn_with_optional_tensor_output_2_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_fn_with_optional_tensor_output_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleCpu::test_incorrect_custom_op_schema_cpu, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_boxed_run_inputs_clearing_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_add_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_add_output_path_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_all_inputs_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_missing_arg_with_default_value_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_out_variant_without_return_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_return_list_of_single_tensor_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_return_single_tensor_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_square_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_with_concat_inputs_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_with_multiple_outputs_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_custom_op_with_reinterpret_view_inputs_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_fn_with_int_output_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_fn_with_optional_tensor_nullopt_output_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_fn_with_optional_tensor_output_2_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_fn_with_optional_tensor_output_cuda, test/inductor/test_aot_inductor_custom_ops.py::AOTInductorTestABICompatibleGpu::test_incorrect_custom_op_schema_cuda 2025-12-04T10:53:51.7569157Z 2025-12-04T10:53:51.7569599Z Finished inductor/test_aot_inductor_custom_ops 1/1 ... [2025-12-04 10:53:51.752598][4970414.248782744], took 1.69min 2025-12-04T10:53:51.7570935Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:53:51.7701406Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:53:51.7708262Z Running inductor/test_split_cat_fx_passes 1/1 ... [2025-12-04 10:53:51.770486][4970414.266672979] 2025-12-04T10:53:51.7708930Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:53:51.7711899Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_split_cat_fx_passes.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:53:51.770956] 2025-12-04T10:54:30.6472067Z 2025-12-04T10:54:30.6473315Z inductor/test_split_cat_fx_passes 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_split_cat_fx_passes_1.1_b75d3faf0dab0778_.log 2025-12-04T10:54:30.6479724Z Running 11 items in this shard: test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_cat_normalization, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_config_flag_is_respected, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_consecutive_split_merge, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_numpy_compat_normalization, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_split_cat_merge, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_split_cat_merge_mutation, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_split_cat_new_patterns, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_split_normalization, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_split_squeeze, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_stack_normalization_axis_kwarg, test/inductor/test_split_cat_fx_passes.py::TestSplitCatFxPasses::test_unbind_stack 2025-12-04T10:54:30.6484571Z 2025-12-04T10:54:30.6484987Z Finished inductor/test_split_cat_fx_passes 1/1 ... [2025-12-04 10:54:30.646835][4970453.143017988], took 0.65min 2025-12-04T10:54:30.6491209Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:54:30.6645468Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:54:30.6650378Z Running export/test_cpp_serdes 1/1 ... [2025-12-04 10:54:30.664789][4970453.160975031] 2025-12-04T10:54:30.6651012Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:54:30.6655112Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_cpp_serdes.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:54:30.665263] 2025-12-04T10:55:23.1601617Z 2025-12-04T10:55:23.1602509Z export/test_cpp_serdes 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_cpp_serdes_1.1_f9c1634fdcdede66_.log 2025-12-04T10:55:23.1685185Z Running 431 items in this shard: test/export/test_cpp_serdes.py::CppSerdesTestExport::test__scaled_dot_product_flash_attention_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_additional_inputs_constants_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_allow_explicit_guards_as_runtime_asserts_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_annotate_on_assert_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_args_type_checked_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_aten_lift_fresh_copy_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_attention_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_attr_assignment_extra_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_automatic_constrain_size_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_automatic_dynamic_shapes_constant_relation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_automatic_dynamic_shapes_linear_relation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_automatic_dynamic_shapes_simple_equality_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_baddbmm_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_basic_non_strict_fake_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_basic_non_strict_real_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_bincount_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_buffer_util_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_capture_subclass_constructor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_capture_subclass_constructor_torch_ir_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_capture_subclass_wrong_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_ccode_python_mod_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cdist_forward_compute_mode_zero_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_check_specialized_int_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_checks_to_constrain_range_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cleanup_dynamic_markers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_colin_unbacked_backed_vr_sub_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_colon_parameter_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_compiling_state_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_access_identical_symint_closure_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_branches_return_constant_int_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_branches_return_same_int_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_buffers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_contains_unbacked_no_escape_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_int_closure_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_unflatten_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_with_module_stack_export_with_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cond_with_module_stack_export_with_unflatten_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_aliasing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_input_naming_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_no_user_inp_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_output_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_output_dup_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_requires_grad_const_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_return_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_tensor_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_tensor_with_non_functional_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constant_tensor_with_non_functional_nested_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constrain_decomp_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constrain_size_in_eager_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constrain_size_with_constrain_value_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_constrain_size_with_various_cases_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_conv_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_crop_like_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_cse_for_symint_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_op_auto_functionalize_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_op_auto_functionalize_pre_dispatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_op_auto_warn_pre_dispatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_op_preserve_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_pytree_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_custom_tag_metadata_re_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_decomp_batch_norm_functional_predispatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_decomp_item_in_prim_after_decomposition_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_decomp_item_in_prim_before_decomposition_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_default_decomposition_core_cia_ops_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_1_2_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_integer_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_nested_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_out_of_order_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_out_of_order_repeat_derived_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_out_of_order_simplified_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_out_of_order_simplified_repeat_non_derived_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_derived_dim_repeat_derived_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_detect_leak_nonstrict_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_detect_leak_nonstrict_with_stacktrace_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_detect_leak_strict_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_device_to_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_device_to_gpu_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_device_to_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_device_to_mutation_float_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_device_to_static_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_1_2_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_auto_and_dim_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_dynamic_divisibility_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_dynamic_specialization_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_hint_range_violations_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dim_hint_ranges_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_disable_forced_specializations_errors_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_disable_forced_specializations_ok_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_distributed_all_gather_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_distributed_all_gather_into_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_distributed_all_reduce_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_distributed_all_to_all_single_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_distributed_reduce_scatter_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dont_duck_size_for_auto_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_double_lifted_constants_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_checks_aliasing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_checks_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_checks_mutation_list_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_checks_mutation_with_nan_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_fake_kernel_inference_errors_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_draft_export_infers_fake_kernel_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_duplicate_modules_with_non_persistent_buffers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_lr_shift_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_bounds_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_builder_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_builder_kwargs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_builder_pytree_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_dataclass_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_inferred_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_serdes_generic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_serdes_user_errors_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_serdes_various_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_spec_with_pytree_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_shapes_wrapped_with_shape_guards_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_dynamic_sym_round_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_ends_of_bounds_oblivious_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_enum_str_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_error_does_not_reference_eager_fallback_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_error_when_passing_mutating_primitive_op_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_exception_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_expand_copy_export_handles_implicit_true_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_api_with_dynamic_shapes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_as_backend_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_associative_scan_lifted_buffers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_associative_scan_symbol_dim_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_associative_scan_symbol_scandim_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_aten_to_unflatten_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_aten_to_unflatten_subclass_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_aten_to_unflatten_subclass_pre_dispatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_cond_preserve_torch_fn_for_subgraphs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_cond_symbool_pred_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_cond_warns_constant_pred_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_custom_decomp_table_basic_pop_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_custom_decomp_table_container_methods_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_custom_op_lib_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_custom_triton_kernel_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_custom_triton_kernel_mutable_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_cyclic_reference_leak_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_decomp_torture_case_1_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_decomp_torture_case_2_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_decomps_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_decomps_simple_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_dynamo_config_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_for_training_run_decomp_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_for_training_with_container_type_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_for_training_with_dynamic_shapes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_for_training_with_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_for_training_with_state_dict_hooks_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_default_kwargs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_keyword_only_args_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_kwargs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_pytree_kwargs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_var_keyword_args_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_var_keyword_pytree_args_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_func_with_var_postional_args_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_function_schema_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_graph_with_no_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_input_mutation_bug_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_input_mutation_dynamic_shape_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_input_mutation_static_shape_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_leak_compile_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_linear_preserve_dynamic_shape_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_max_nonstrict_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_max_onnx_reported_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_method_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_mod_constraints_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_module_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_preserve_linear_at_aot_level_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_preserve_linear_but_not_custom_op_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_rnn_variants_with_warning_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_scan_pytree_output_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_script_module_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_statically_known_true_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_then_compile_tensor_ctor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_autocast_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_fake_tensor_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_fake_tensor_inputs_on_cuda_devices_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_inline_constraints_complex_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_inline_constraints_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_set_grad_enabled_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_export_with_wrong_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_external_call_non_strict_real_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_fake_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_fake_weights_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_filter_traceback_frames_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_flex_attention_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_float_conversion_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_float_conversion_from_int_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_fqn_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_from_node_metadata_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_full_on_scalar_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_function_holding_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_hints_wrapper_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_hoo_inline_users_issue_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_if_functional_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_if_post_autograd_op_preserved_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_inductor_backend_inside_nonstrict_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_inline_script_class_method_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_inline_script_class_method_recursive_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_inline_script_function_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_inline_script_method_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_int_shape_specialization_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_intermediate_shape_comp_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_invalid_pytree_dynamo_graph_capture_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_is_exporting_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_is_nonzero_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_isnonzero_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_issue_113041_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_issue_157289_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_issue_161902_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_istft_op_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_keep_composite_ops_invalid_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_keep_composite_ops_linear_convd_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_keep_composite_ops_linear_convd_for_training_ir_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_kwarg_dynamic_shapes_diff_order_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_kwargs_reorder_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_layer_norm_unbacked_normalized_shape_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_layer_sharing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_lazy_module_kwargs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_lifted_constants_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_linear_conv_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_malformed_fqn_from_source_name_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_map_buffers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_map_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_mask_nonzero_static_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_masked_select_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_math_pow_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_mismatched_dynamic_shapes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_mixed_input_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_dict_key_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_input_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_input_subclasses_parameterization_nested_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_list_slice_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_module_with_dict_container_inp_out_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_modules_access_for_deleted_submodule_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_more_multidimensional_slicing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_multidimensional_slicing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_multinomial_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_multiple_definitions_same_name_dim_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_namedtuple_input_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_native_multi_attention_head_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_dynamic_shapes_spec_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_module_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_module_fake_tensor_leak_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_module_with_constant_buffer_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_module_with_init_buffer_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nested_module_with_parameter_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nn_module_stack_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nn_module_stack_shared_submodule_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_check_is_size_error_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_suggested_fixes_for_data_dependent_errors_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_tensor_computation_2_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_tensor_computation_3_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_tensor_computation_4_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_no_tensor_computation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_arg_name_dynamic_shapes_api_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_arg_name_dynamic_shapes_api_with_container_type_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_arg_name_dynamic_shapes_api_with_kwarg_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_persistent_buffer_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_strict_dynamic_shapes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_non_strict_dynamic_shapes_suggested_fixes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_none_buffers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nonstrict_retrace_preserves_metadata_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nonzero_2_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_nonzero_dynamic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_not_registered_parameter_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_operator_aten_tensor_mode_variant_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_output_node_name_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_pad_sequence_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_param_util_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_partial_patched_forward_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_placeholder_naming_collisions_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_placeholder_naming_collisions_hoo_subgraphs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_placeholder_naming_order_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_placeholder_naming_order_variadic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_placeholder_update_preserving_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_predispatch_cond_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_predispatch_grad_wrappers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_preserve_annotation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_preserve_module_call_signature_unflatten_specialization_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_preserve_requires_grad_placeholders_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_preserve_shape_dynamism_for_unused_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_profiling_code_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_python_asserts_with_sym_int_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_pytree_register_data_class_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_pytree_register_nested_data_class_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_raise_user_error_when_guard_on_data_dependent_operation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_range_constraints_with_replacement_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_real_tensor_alias_dtype_mismatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_real_tensor_bool_cast_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_real_tensor_errors_on_aliasing_custom_op_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_real_tensor_for_max_op_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_real_tensor_size_mismatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_redundant_assert_max_upper_bound_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_redundant_asserts_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_refine_dynamic_shapes_from_suggested_fixes_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_register_constant_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_repeat_interleave_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_replace_unbacked_with_very_large_upperbound_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_replaced_unbacked_bindings_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_reshape_view_helper_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_retracable_ep_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_retrace_pre_autograd_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_run_decomposition_supports_user_input_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_run_decompositions_keep_metadata_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_run_decompositions_keep_tensor_constant_metadata_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_runtime_assert_for_prim_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_runtime_assert_for_prm_str_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_runtime_assert_with_size_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_sdpa_gqa_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_sequential_slicing_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_set_example_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_set_grad_as_side_effect_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_set_grad_empty_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_set_grad_unflatten_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_setgrad_lifted_tensor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_shared_submodule_nn_module_stack_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_simple_export_for_training_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_simple_unbacked_view_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_size_input_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_slice_nn_module_stack_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_solver_unsupported_sympy_function_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_specialize_derived_dim_roots_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_split_const_gm_with_lifted_constants_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_stack_trace_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_stack_trace_make_fx_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_state_primitives_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_state_shape_attribute_assignment_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_state_tensors_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_static_dim_constraints_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_context_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_nested_attr_access_complicated_metadata_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_nested_attr_access_const_metadata_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_nested_attr_access_const_metadata_not_top_level_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_nested_attr_access_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclass_nested_attr_access_submodule_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclasses_parameterization_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_subclasses_parameterization_nested_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_suggest_torch_checks_with_non_negative_check_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_suggest_torch_checks_with_regular_check_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_suggested_fixes_for_data_dependent_errors_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_suggested_fixes_for_data_dependent_errors_puzzlers_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_suggested_fixes_new_roots_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_sym_float_operators_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_sym_or_sym_and_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_sym_sqrt_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symbool_item_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symfloat_item_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_input_additional_inputs_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_input_basic_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_input_ranges_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_input_shapes_collection_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_input_specialization_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_item_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_output_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_symint_tensor_return_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tag_ac_export_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tensor_attribute_zero_args_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tensor_constant_aten_to_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tensor_constant_with_wrapped_method_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_to_module_with_mutated_buffer_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_to_module_with_mutated_buffer_multiple_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_to_module_with_mutated_buffer_multiple_update_sub_later_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tolist_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_torch_check_eq_commutativity_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_torch_fn_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_trace_under_fake_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_train_eval_on_exported_preautograd_module_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_tril_dynamic_diagonal_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_triu_dynamic_diagonal_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_3d_matmul_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_bincount_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_bindings_for_divisible_u_symint_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_deferred_runtime_retrace_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_expand_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_infer_size_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_kth_value_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_linear_layer_norm_input_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_noncontig_lin_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_pad_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_scalar_constructor_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_slice_forward_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_slice_simple_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_stack_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_to_cond_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_to_cond_passthrough_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unbacked_unsqueeze_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_asserts_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_buffer_update_child2parent_swap_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_closure_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_isinstance_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_multiple_graphs_dispatch_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_multiple_graphs_preserve_signature_no_error_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_multiple_graphs_shared_submodule_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_multiple_graphs_state_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_no_unroll_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_placeholder_update_child2parent_swap_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_placeholder_update_grandchild2cousin_swap_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_5_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_6_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_buf_8_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_const_preserving_3_1_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_const_preserving_3_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_4_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_6_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_9_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_preserving_10_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_1_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_preserving_4_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_preserving_5_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_mutating_buf_preserving_7_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unflatten_random_dag_preserving_4_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unused_aliases_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_unused_constant_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_uplift_common_custom_meta_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_uplift_common_custom_meta_with_multiple_calls_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_use_embedding_twice_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_user_input_and_buffer_mutation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_vmap_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_vmap_custom_autograd_function_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_vmap_to_assert_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_where_decomp_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_while_loop_assert_separation_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_while_loop_index_assertions_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_while_loop_simple_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_while_loop_tensor_constant_idx_cpp_serdes, test/export/test_cpp_serdes.py::CppSerdesTestExport::test_wrapper_module_cpp_serdes 2025-12-04T10:55:23.1739845Z 2025-12-04T10:55:23.1739973Z Finished export/test_cpp_serdes 1/1 ... [2025-12-04 10:55:23.160725][4970505.656910043], took 0.87min 2025-12-04T10:55:23.1740403Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:23.1780572Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:23.1787350Z Running dynamo/test_pgo 1/1 ... [2025-12-04 10:55:23.178489][4970505.674675368] 2025-12-04T10:55:23.1787971Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:23.1791876Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_pgo.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:23.178966] 2025-12-04T10:55:26.8006923Z 2025-12-04T10:55:26.8008439Z dynamo/test_pgo 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_pgo_1.1_631f6eb7f1076edd_.log 2025-12-04T10:55:26.8012847Z Running 11 items in this shard: test/dynamo/test_pgo.py::PgoTest::test_basic, test/dynamo/test_pgo.py::PgoTest::test_different_file_paths_local_pgo, test/dynamo/test_pgo.py::PgoTest::test_distinct_compile_id, test/dynamo/test_pgo.py::PgoTest::test_njt, test/dynamo/test_pgo.py::PgoTest::test_no_empty_graph_allowlist, test/dynamo/test_pgo.py::PgoTest::test_pgo_dynamic_false, test/dynamo/test_pgo.py::PgoTest::test_pgo_dynamic_params, test/dynamo/test_pgo.py::PgoTest::test_remote_basic, test/dynamo/test_pgo.py::PgoTest::test_sticky_pgo_read_write, test/dynamo/test_pgo.py::PgoTest::test_whitelist_ints_floats, test/dynamo/test_pgo.py::PgoTest::test_whitelist_suggestion 2025-12-04T10:55:26.8015830Z 2025-12-04T10:55:26.8016207Z Finished dynamo/test_pgo 1/1 ... [2025-12-04 10:55:26.800356][4970509.296539975], took 0.06min 2025-12-04T10:55:26.8028646Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:26.8179764Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:26.8185363Z Running dynamo/test_config 1/1 ... [2025-12-04 10:55:26.818289][4970509.314475878] 2025-12-04T10:55:26.8185983Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:26.8190120Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_config.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:26.818785] 2025-12-04T10:55:29.3384735Z 2025-12-04T10:55:29.3385886Z dynamo/test_config 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_config_1.1_4178b2cc5da494ce_.log 2025-12-04T10:55:29.3389134Z Running 5 items in this shard: test/dynamo/test_config.py::ConfigTests::test_automatic_dynamic, test/dynamo/test_config.py::ConfigTests::test_config_compile_ignored, test/dynamo/test_config.py::ConfigTests::test_config_hash, test/dynamo/test_config.py::ConfigTests::test_no_assume_static_by_default, test/dynamo/test_config.py::ConfigTests::test_no_automatic_dynamic 2025-12-04T10:55:29.3391026Z 2025-12-04T10:55:29.3391397Z Finished dynamo/test_config 1/1 ... [2025-12-04 10:55:29.338250][4970511.834434521], took 0.04min 2025-12-04T10:55:29.3406326Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:29.3559813Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:29.3565879Z Running dynamo/test_metrics_context 1/1 ... [2025-12-04 10:55:29.356422][4970511.85260905] 2025-12-04T10:55:29.3566508Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:29.3571410Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_metrics_context.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:29.356918] 2025-12-04T10:55:31.6263173Z 2025-12-04T10:55:31.6264384Z dynamo/test_metrics_context 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_metrics_context_1.1_48dd8921b8097b23_.log 2025-12-04T10:55:31.6269176Z Running 9 items in this shard: test/dynamo/test_metrics_context.py::TestMetricsContext::test_add_to_set, test/dynamo/test_metrics_context.py::TestMetricsContext::test_context_exists, test/dynamo/test_metrics_context.py::TestMetricsContext::test_nested_context, test/dynamo/test_metrics_context.py::TestMetricsContext::test_set, test/dynamo/test_metrics_context.py::TestMetricsContext::test_set_disallow_overwrite, test/dynamo/test_metrics_context.py::TestMetricsContext::test_set_key_value, test/dynamo/test_metrics_context.py::TestMetricsContext::test_top_n, test/dynamo/test_metrics_context.py::TestMetricsContext::test_update_allow_overwrite, test/dynamo/test_metrics_context.py::TestMetricsContext::test_update_disallow_overwrite 2025-12-04T10:55:31.6272487Z 2025-12-04T10:55:31.6272893Z Finished dynamo/test_metrics_context 1/1 ... [2025-12-04 10:55:31.625978][4970514.122165098], took 0.04min 2025-12-04T10:55:31.6284010Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:31.6436552Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:31.6443174Z Running export/test_package 1/1 ... [2025-12-04 10:55:31.644045][4970514.140232229] 2025-12-04T10:55:31.6443814Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:31.6447635Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_package.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:31.644540] 2025-12-04T10:55:34.0647406Z 2025-12-04T10:55:34.0648747Z export/test_package 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_package_1.1_1c6c15ead3bedf43_.log 2025-12-04T10:55:34.0650703Z Running 4 items in this shard: test/export/test_package.py::TestPackage::test_basic, test/export/test_package.py::TestPackage::test_error, test/export/test_package.py::TestPackage::test_more_than_once, test/export/test_package.py::TestPackage::test_overloads 2025-12-04T10:55:34.0651966Z 2025-12-04T10:55:34.0652343Z Finished export/test_package 1/1 ... [2025-12-04 10:55:34.064441][4970516.560625565], took 0.04min 2025-12-04T10:55:34.0667851Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:34.0818911Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:34.0825380Z Running export/test_export_opinfo 1/1 ... [2025-12-04 10:55:34.082329][4970516.578516338] 2025-12-04T10:55:34.0825998Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:34.0830285Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'export/test_export_opinfo.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:34.082823] 2025-12-04T10:55:37.1536022Z 2025-12-04T10:55:37.1537647Z export/test_export_opinfo 1/1 was successful, full logs can be found in artifacts with path test/test-reports/export.test_export_opinfo_1.1_70959d166c9cb1d5_.log 2025-12-04T10:55:37.1543774Z Running 9 items in this shard: test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export___getitem___cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_batch_norm_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_batch_norm_without_cudnn_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_conv2d_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_instance_norm_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_multi_margin_loss_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nn_functional_scaled_dot_product_attention_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_fake_export_nonzero_cuda_float32, test/export/test_export_opinfo.py::TestExportOnFakeCudaCUDA::test_preserve_original_behavior_cuda 2025-12-04T10:55:37.1549353Z 2025-12-04T10:55:37.1549756Z Finished export/test_export_opinfo 1/1 ... [2025-12-04 10:55:37.153403][4970519.649588042], took 0.05min 2025-12-04T10:55:37.1558793Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:37.1714540Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:37.1720847Z Running dynamo/test_nops 1/1 ... [2025-12-04 10:55:37.171796][4970519.667982848] 2025-12-04T10:55:37.1721457Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:37.1725307Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_nops.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:37.172299] 2025-12-04T10:55:39.5919793Z 2025-12-04T10:55:39.5921224Z dynamo/test_nops 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_nops_1.1_b8c30fa3a4bc2f35_.log 2025-12-04T10:55:39.5923146Z Running 4 items in this shard: test/dynamo/test_nops.py::NopTests::test1, test/dynamo/test_nops.py::NopTests::test2, test/dynamo/test_nops.py::NopTests::test3, test/dynamo/test_nops.py::NopTests::test_extended_args 2025-12-04T10:55:39.5924146Z 2025-12-04T10:55:39.5924531Z Finished dynamo/test_nops 1/1 ... [2025-12-04 10:55:39.591698][4970522.087883679], took 0.04min 2025-12-04T10:55:39.5941538Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:39.6097146Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:39.6102757Z Running inductor/test_graph_transform_observer 1/1 ... [2025-12-04 10:55:39.610037][4970522.106224235] 2025-12-04T10:55:39.6103440Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:39.6107650Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_graph_transform_observer.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:39.610535] 2025-12-04T10:55:44.9358347Z 2025-12-04T10:55:44.9359692Z inductor/test_graph_transform_observer 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_graph_transform_observer_1.1_1f69168b17fcb3bf_.log 2025-12-04T10:55:44.9361241Z Running 1 items in this shard: test/inductor/test_graph_transform_observer.py::TestGraphTransformObserver::test_sdpa_rewriter 2025-12-04T10:55:44.9361892Z 2025-12-04T10:55:44.9362381Z Finished inductor/test_graph_transform_observer 1/1 ... [2025-12-04 10:55:44.935427][4970527.431611355], took 0.09min 2025-12-04T10:55:44.9380142Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:44.9533000Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:44.9539910Z Running inductor/test_memory 1/1 ... [2025-12-04 10:55:44.953744][4970527.449930551] 2025-12-04T10:55:44.9544238Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:44.9545511Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_memory.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:44.954251] 2025-12-04T10:55:59.5485490Z 2025-12-04T10:55:59.5486699Z inductor/test_memory 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_memory_1.1_55671c0f04f25728_.log 2025-12-04T10:55:59.5492436Z Running 8 items in this shard: test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_fusing_reductions_increase_peak_memory, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_fusion_acc_large_reads, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_multiple_mutations_of_buf, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_mutation_size_propagation, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_reorder_peak_memory, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_reorder_peak_memory_bfs, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_reorder_peak_memory_dfs, test/inductor/test_memory.py::TestOperatorReorderForPeakMemory::test_reorder_peak_memory_lpmf 2025-12-04T10:55:59.5496247Z 2025-12-04T10:55:59.5496625Z Finished inductor/test_memory 1/1 ... [2025-12-04 10:55:59.548103][4970542.044287892], took 0.24min 2025-12-04T10:55:59.5507270Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T10:55:59.5662016Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T10:55:59.5668303Z Running inductor/test_control_flow 1/4 ... [2025-12-04 10:55:59.566553][4970542.062735886] 2025-12-04T10:55:59.5668964Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T10:55:59.5675590Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_control_flow.py', '--shard-id=1', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 10:55:59.567113] 2025-12-04T11:03:42.7608701Z 2025-12-04T11:03:42.7734018Z inductor/test_control_flow 1/4 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_control_flow_1.4_703f295a3ca0af9f_.log 2025-12-04T11:03:42.7839890Z Running 190 items in this shard: test/inductor/test_control_flow.py::CondTests::test_cond_mismatched_branch_output_size_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_mismatched_branch_output_size_device_cuda_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_multiple_outputs_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_multiple_outputs_device_cuda_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_nested_control_flow_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_non_tensor_predicates_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_non_tensor_predicates_device_cuda_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_outer_code_before_after_device_cuda_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_reintepret_view_inputs_outputs, test/inductor/test_control_flow.py::CondTests::test_cond_select_with_input_idx_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_simple_control_flow_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_simple_with_int_closure_device_cpu, test/inductor/test_control_flow.py::CondTests::test_cond_subgraphs_with_parameters_device_cpu_dynamic_True, test/inductor/test_control_flow.py::CondTests::test_cond_unbacked_symint_closure_device_cpu_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_unbacked_symint_closure_device_cuda_dynamic_False, test/inductor/test_control_flow.py::CondTests::test_cond_unbacked_symint_inner_device_cuda, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_infinite_loop_error, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_simple_control_flow_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_stack_output_simple_device_cuda_dynamic_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_conv_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_conv_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_conv_device_cuda_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_conv_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_in_out_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_data_dependent_ops_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_buffers_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_outer_code_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_parameters_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_parameters_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_parameters_device_cuda_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_sym_expr_cond_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_unbacked_symint_closure_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_unbacked_symint_closure_device_cpu_dynamic_False_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_with_unbacked_symint_closure_device_cpu_dynamic_True_autograd_True, test/inductor/test_control_flow.py::WhileLoopTests::test_while_loop_zero_loop_device_cpu_dynamic_True, test/inductor/test_control_flow.py::AssociativeScanTests::test_associative_scan_CUDA_flip_combine_mode_generic_backend_inductor_cpu, test/inductor/test_control_flow.py::AssociativeScanTests::test_associative_scan_CUDA_flip_combine_mode_pointwise_backend_inductor_cpu, test/inductor/test_control_flow.py::AssociativeScanTests::test_associative_scan_CUDA_flip_combine_mode_pointwise_backend_inductor_device_cuda, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_False_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_False_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_False_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_cond_in_scan_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_chunked_ce_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_compare_chunked_ce_with_no_scan_device_cpu_dynamic_True, test/inductor/test_control_flow.py::ScanTests::test_scan_compare_chunked_ce_with_no_scan_device_cuda_dynamic_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_False_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_False_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_False_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cpu_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_False_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_False_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_False_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_conv_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_0_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_1_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_False_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_0_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_3_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_False_reverse_True_dim_3_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_0_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_1_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_1_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_False_dim_3_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_0_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_1_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_1_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_1_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_3_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_3_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cpu_dynamic_True_reverse_True_dim_3_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_0_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_0_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_1_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_1_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_3_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_False_dim_3_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_0_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_1_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_False_reverse_True_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_0_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_0_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_1_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_False_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_False_dim_3_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_0_pred_True_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_0_pred_True_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_0_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_False_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_True_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_1_pred_True_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_False_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_in_cond_device_cuda_dynamic_True_reverse_True_dim_3_pred_False_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_False_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_True_dim_1_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_False_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cpu_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_False_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_False_dim_0_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_False_dim_1_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_False_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_0_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_1_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_nn_modules_device_cuda_dynamic_True_reverse_True_dim_3_scan_length_5_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_False_dim_0_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_False_dim_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_True_dim_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_False_reverse_True_dim_2_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_False_dim_0_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_False_dim_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cpu_dynamic_True_reverse_False_dim_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_False_dim_0_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_False_dim_1_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_False_dim_2_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_False_reverse_True_dim_2_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_False_dim_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_True_dim_0_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_True_dim_1_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_pytree_in_out_device_cuda_dynamic_True_reverse_True_dim_2_autograd_True, test/inductor/test_control_flow.py::ScanTests::test_scan_with_clamp_device_cpu_dynamic_False_autograd_False, test/inductor/test_control_flow.py::ScanTests::test_scan_with_clamp_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_nested_with_cond_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_nested_with_cond_device_cuda_dynamic_False_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_nested_with_cond_device_cuda_dynamic_True_autograd_True, test/inductor/test_control_flow.py::MapTests::test_map_simple_device_cpu_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_simple_device_cuda_dynamic_True_autograd_False, test/inductor/test_control_flow.py::MapTests::test_map_simple_device_cuda_dynamic_True_autograd_True 2025-12-04T11:03:42.8041071Z 2025-12-04T11:03:42.8047071Z Finished inductor/test_control_flow 1/4 ... [2025-12-04 11:03:42.804430][4971005.300606271], took 7.72min 2025-12-04T11:03:42.8064247Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:03:42.8144814Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:03:42.8146175Z Running inductor/test_pallas 1/1 ... [2025-12-04 11:03:42.814411][4971005.310598018] 2025-12-04T11:03:42.8149470Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:03:42.8150790Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_pallas.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:03:42.814675] 2025-12-04T11:03:48.4274682Z 2025-12-04T11:03:48.4275861Z inductor/test_pallas 1/1 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_pallas_1.1_7085f88dee8b2634_.log 2025-12-04T11:03:48.4276657Z 2025-12-04T11:03:48.4277088Z Finished inductor/test_pallas 1/1 ... [2025-12-04 11:03:48.426971][4971010.923156479], took 0.09min 2025-12-04T11:03:48.4288507Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:03:48.4343182Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:03:48.4344904Z Running dynamo/test_callback 1/1 ... [2025-12-04 11:03:48.434273][4971010.93046305] 2025-12-04T11:03:48.4345567Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:03:48.4346997Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_callback.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:03:48.434503] 2025-12-04T11:03:54.3091838Z 2025-12-04T11:03:54.3093020Z dynamo/test_callback 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_callback_1.1_75b139f05e44cf1d_.log 2025-12-04T11:03:54.3095245Z Running 4 items in this shard: test/dynamo/test_callback.py::CallbackTests::test_callbacks_with_duplicate_prevention, test/dynamo/test_callback.py::CallbackTests::test_counter, test/dynamo/test_callback.py::CallbackTests::test_counter_assertion, test/dynamo/test_callback.py::CallbackTests::test_triggers 2025-12-04T11:03:54.3096703Z 2025-12-04T11:03:54.3097103Z Finished dynamo/test_callback 1/1 ... [2025-12-04 11:03:54.308718][4971016.804905022], took 0.10min 2025-12-04T11:03:54.3104687Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:03:54.3260001Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:03:54.3265120Z Running dynamo/test_cudagraphs_expandable_segments 1/1 ... [2025-12-04 11:03:54.326208][4971016.822393926] 2025-12-04T11:03:54.3266411Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:03:54.3269245Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_cudagraphs_expandable_segments.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:03:54.326700] 2025-12-04T11:03:58.1497434Z 2025-12-04T11:03:58.1498877Z dynamo/test_cudagraphs_expandable_segments 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_cudagraphs_expandable_segments_1.1_4fde8c73ca50bd60_.log 2025-12-04T11:03:58.1503179Z Running 8 items in this shard: test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_basic, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_dead_fill, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_dtoh, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_factory, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_htod, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_mutate_constant, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_mutate_input, test/dynamo/test_cudagraphs_expandable_segments.py::TestAotCudagraphs::test_mutated_metadata 2025-12-04T11:03:58.1507131Z 2025-12-04T11:03:58.1507615Z Finished dynamo/test_cudagraphs_expandable_segments 1/1 ... [2025-12-04 11:03:58.149344][4971020.645528925], took 0.06min 2025-12-04T11:03:58.1519950Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:03:58.1675168Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:03:58.1679447Z Running dynamo/test_aot_autograd_cache 1/1 ... [2025-12-04 11:03:58.167629][4971020.663815837] 2025-12-04T11:03:58.1680110Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:03:58.1683952Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'dynamo/test_aot_autograd_cache.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:03:58.168118] 2025-12-04T11:05:59.9847556Z 2025-12-04T11:05:59.9848893Z dynamo/test_aot_autograd_cache 1/1 was successful, full logs can be found in artifacts with path test/test-reports/dynamo.test_aot_autograd_cache_1.1_ac867890a230dbe7_.log 2025-12-04T11:05:59.9902413Z Running 108 items in this shard: test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_aot_runtime_trace_joint, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_guard_single_entry_device_cuda_bfloat16, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_guard_single_entry_device_cuda_float16, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_inductor_guards_device_cuda_bfloat16_requires_grad_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_inductor_guards_device_cuda_bfloat16_requires_grad_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_inductor_guards_device_cuda_float16_requires_grad_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_inductor_guards_device_cuda_float16_requires_grad_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_lazy_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_autograd_no_dynamo_trace_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_basic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cpu_bfloat16_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cpu_bfloat16_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cpu_float32_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cpu_float32_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cuda_bfloat16_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cuda_bfloat16_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cuda_float32_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_hot_load_device_cuda_float32_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_cache_lazy_backward_for_compiled_autograd, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_clear_fx_graph_cache, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_compiled_autograd_bypass, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_constant_tensor_device_guards, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_custom_autograd_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_custom_autograd_function_miss, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_custom_autograd_function_with_custom_triton_kernel, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_custom_autograd_function_with_custom_triton_kernel_cache_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_dynamic_shapes_different_sizes, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_fx_graph_cache_off, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_inference_graph_cache_hit_with_compiled_autograd_enabled, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_invoke_subgraph, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_multi_graph_specialization, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_multiple_compile_triton_kernels, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_nn_module_with_params_global_constant, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_non_bundled_to_bundled_config_change, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_regional_inductor_basic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_regional_inductor_cache_miss_on_change, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_regional_inductor_with_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_saved_tensors_hooks_autograd_cache, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_saved_tensors_hooks_autograd_cache_symbolic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_symbol_specialization, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_triton_op_cache_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_triton_op_cache_multiple_ops_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_unsafe_mark_cacheable_fn_select_allow_in_graph, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_unsafe_mark_cacheable_fn_select_tag_activation_checkpoint, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_view_replay, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheTests::test_vmap, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_aot_runtime_trace_joint, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_guard_single_entry_device_cuda_bfloat16, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_guard_single_entry_device_cuda_float16, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_inductor_guards_device_cuda_bfloat16_requires_grad_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_inductor_guards_device_cuda_bfloat16_requires_grad_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_inductor_guards_device_cuda_float16_requires_grad_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_inductor_guards_device_cuda_float16_requires_grad_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_lazy_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_autograd_no_dynamo_trace_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_basic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cpu_bfloat16_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cpu_bfloat16_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cpu_float32_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cpu_float32_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cuda_bfloat16_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cuda_bfloat16_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cuda_float32_dynamic_False, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_hot_load_device_cuda_float32_dynamic_True, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_cache_lazy_backward_for_compiled_autograd, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_clear_fx_graph_cache, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_compiled_autograd_bypass, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_constant_tensor_device_guards, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_custom_autograd_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_custom_autograd_function_miss, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_custom_autograd_function_with_custom_triton_kernel, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_custom_autograd_function_with_custom_triton_kernel_cache_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_dynamic_shapes_different_sizes, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_fx_graph_cache_off, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_inference_graph_cache_hit_with_compiled_autograd_enabled, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_invoke_subgraph, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_multi_graph_specialization, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_multiple_compile_triton_kernels, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_nn_module_with_params_global_constant, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_non_bundled_to_bundled_config_change, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_regional_inductor_basic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_regional_inductor_cache_miss_on_change, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_regional_inductor_with_backward, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_saved_tensors_hooks_autograd_cache, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_saved_tensors_hooks_autograd_cache_symbolic, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_symbol_specialization, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_triton_op_cache_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_triton_op_cache_multiple_ops_invalidation, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_unsafe_mark_cacheable_fn_select_allow_in_graph, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_unsafe_mark_cacheable_fn_select_tag_activation_checkpoint, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_view_replay, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCacheBundledTests::test_vmap, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_basic_hash_key, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_different_configs, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_different_global_configs, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_different_graphs, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_different_inputs, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_freezing, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_identical_graphs_and_configs, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_incompatible_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_nn_module_with_params, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_normal_torch_function, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_private_builtin, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_private_namespace, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_safe_torchfunction, test/dynamo/test_aot_autograd_cache.py::AOTAutogradCachePicklerTests::test_sanitize_gm_for_cache 2025-12-04T11:05:59.9955417Z 2025-12-04T11:05:59.9955841Z Finished dynamo/test_aot_autograd_cache 1/1 ... [2025-12-04 11:05:59.984450][4971142.480634555], took 2.03min 2025-12-04T11:05:59.9957239Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:06:00.0024050Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:06:00.0030454Z Running test_utils_config_module 1/1 ... [2025-12-04 11:06:00.002830][4971142.499016514] 2025-12-04T11:06:00.0031062Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:06:00.0038050Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_utils_config_module.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:06:00.003314] 2025-12-04T11:06:02.1724821Z 2025-12-04T11:06:02.1726623Z test_utils_config_module 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_utils_config_module_1.1_d7941d9c5692e802_.log 2025-12-04T11:06:02.1734979Z Running 22 items in this shard: test/test_utils_config_module.py::TestConfigModule::test_alias, test/test_utils_config_module.py::TestConfigModule::test_bad_jk_type, test/test_utils_config_module.py::TestConfigModule::test_base_value_loading, test/test_utils_config_module.py::TestConfigModule::test_codegen_config, test/test_utils_config_module.py::TestConfigModule::test_codegen_config_function, test/test_utils_config_module.py::TestConfigModule::test_dict_copy_semantics, test/test_utils_config_module.py::TestConfigModule::test_env_name_semantics, test/test_utils_config_module.py::TestConfigModule::test_env_name_string_semantics, test/test_utils_config_module.py::TestConfigModule::test_get_hash, test/test_utils_config_module.py::TestConfigModule::test_invalid_config_float, test/test_utils_config_module.py::TestConfigModule::test_invalid_config_int, test/test_utils_config_module.py::TestConfigModule::test_make_closur_patcher, test/test_utils_config_module.py::TestConfigModule::test_multi_env, test/test_utils_config_module.py::TestConfigModule::test_none_override_semantics, test/test_utils_config_module.py::TestConfigModule::test_overrides, test/test_utils_config_module.py::TestConfigModule::test_patch, test/test_utils_config_module.py::TestConfigModule::test_reference_is_default, test/test_utils_config_module.py::TestConfigModule::test_reference_semantics, test/test_utils_config_module.py::TestConfigModule::test_save_config, test/test_utils_config_module.py::TestConfigModule::test_save_config_portable, test/test_utils_config_module.py::TestConfigModule::test_type_loading, test/test_utils_config_module.py::TestConfigModule::test_unittest_patch 2025-12-04T11:06:02.1743180Z 2025-12-04T11:06:02.1743566Z Finished test_utils_config_module 1/1 ... [2025-12-04 11:06:02.172072][4971144.668255831], took 0.04min 2025-12-04T11:06:02.1748406Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:06:02.1905589Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:06:02.1910113Z Running test_hop_infra 1/1 ... [2025-12-04 11:06:02.190679][4971144.686866336] 2025-12-04T11:06:02.1910733Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:06:02.1914258Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_hop_infra.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:06:02.191184] 2025-12-04T11:06:04.5106577Z 2025-12-04T11:06:04.5107975Z test_hop_infra 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_hop_infra_1.1_4f0965ea1b1910b0_.log 2025-12-04T11:06:04.5109820Z Running 3 items in this shard: test/test_hop_infra.py::TestHOPInfra::test_all_hops_are_imported, test/test_hop_infra.py::TestHOPInfra::test_all_hops_have_opinfo, test/test_hop_infra.py::TestHOPInfra::test_imports_from_all_work 2025-12-04T11:06:04.5111056Z 2025-12-04T11:06:04.5111881Z Finished test_hop_infra 1/1 ... [2025-12-04 11:06:04.510349][4971147.006533208], took 0.04min 2025-12-04T11:06:04.5131912Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:06:04.5283661Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:06:04.5290609Z Running test_transformers 1/1 ... [2025-12-04 11:06:04.528829][4971147.024999736] 2025-12-04T11:06:04.5291196Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:06:04.5296034Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_transformers.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:06:04.529379] 2025-12-04T11:09:22.1075403Z 2025-12-04T11:09:22.1076830Z test_transformers 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_transformers_1.1_280ccc2174af8888_.log 2025-12-04T11:09:22.4978582Z Running 12279 items in this shard: test/test_transformers.py::TestTransformersCUDA::test_bias_is_none_cuda, test/test_transformers.py::TestTransformersCUDA::test_decoder_only_layer_cuda, test/test_transformers.py::TestTransformersCUDA::test_decoder_padding_and_src_mask_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_disable_fastpath_cuda, test/test_transformers.py::TestTransformersCUDA::test_encoder_is_causal_cuda, test/test_transformers.py::TestTransformersCUDA::test_encoder_padding_and_src_mask_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_is_causal_gpu_cuda, test/test_transformers.py::TestTransformersCUDA::test_kpm_mask_trailing_column_with_nested_tensor_cuda, test/test_transformers.py::TestTransformersCUDA::test_mask_check_fastpath_cuda, test/test_transformers.py::TestTransformersCUDA::test_math_backend_high_precision_cuda, test/test_transformers.py::TestTransformersCUDA::test_mha_native_args_nb_heads_1_bias_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_mha_native_args_nb_heads_1_bias_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_mha_native_args_nb_heads_8_bias_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_mha_native_args_nb_heads_8_bias_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim2_key_padding_mask_dim1_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim2_key_padding_mask_dim1_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim2_key_padding_mask_dim_2_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim2_key_padding_mask_dim_2_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_2_key_padding_mask_dim1_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_2_key_padding_mask_dim1_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_2_key_padding_mask_dim_2_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_2_key_padding_mask_dim_2_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_3_key_padding_mask_dim1_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_3_key_padding_mask_dim1_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_3_key_padding_mask_dim_2_bool_cuda, test/test_transformers.py::TestTransformersCUDA::test_multiheadattention_fastpath_attn_mask_attn_mask_dim_3_key_padding_mask_dim_2_float32_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_2D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_3D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_no_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_no_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_3D_input_dim_no_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_2D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_4D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_no_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_no_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_0_4D_input_dim_no_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_2D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_3D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_no_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_no_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_3D_input_dim_no_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_2D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_causal_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_causal_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_4D_causal_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_no_attn_mask_dropout_p_0_0_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_no_attn_mask_dropout_p_0_2_cuda, test/test_transformers.py::TestTransformersCUDA::test_scaled_dot_product_attention_batch_size_5_4D_input_dim_no_attn_mask_dropout_p_0_5_cuda, test/test_transformers.py::TestTransformersCUDA::test_script_encoder_subclass_cuda, test/test_transformers.py::TestTransformersCUDA::test_script_mha_in_proj_weight_none_cuda, test/test_transformers.py::TestTransformersCUDA::test_self_attn_TxT_attn_mask_cuda, test/test_transformers.py::TestTransformersCUDA::test_train_with_is_causal_cuda, test/test_transformers.py::TestTransformersCUDA::test_train_with_pad_and_catch_error_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformer_bias_is_none_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_False_training_False_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_False_training_False_enable_nested_tensor_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_False_training_True_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_False_training_True_enable_nested_tensor_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_True_training_False_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_True_training_False_enable_nested_tensor_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_True_training_True_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_batch_first_True_training_True_enable_nested_tensor_True_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_False_use_autocast_False_d_model_12_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_False_use_autocast_False_d_model_256_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_False_use_autocast_True_d_model_12_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_False_use_autocast_True_d_model_256_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_True_use_autocast_False_d_model_12_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_True_use_autocast_False_d_model_256_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_True_use_autocast_True_d_model_12_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_fastpath_use_torchscript_False_enable_nested_tensor_True_use_autocast_True_d_model_256_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_square_input_with_no_grad_False_training_False_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_square_input_with_no_grad_False_training_True_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_square_input_with_no_grad_True_training_False_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoder_square_input_with_no_grad_True_training_True_enable_nested_tensor_False_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_no_fastpath_with_hooks_nhead_3_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_no_fastpath_with_hooks_nhead_4_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_src_mask_nhead_1_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_src_mask_nhead_4_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_src_mask_nhead_8_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_subclass_cuda, test/test_transformers.py::TestTransformersCUDA::test_transformerencoderlayer_subclass_model_cuda, test/test_transformers.py::TestTransformersCUDA::test_with_nested_tensor_input_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_dispatch_fails_no_backend_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_atteention_large_bf16_nan_values_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_attention_fail_with_non_square_causal_attention_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_autocast_fp32_bfloat16_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_autocast_fp32_float16_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_backward_failure_sm86plus_head_dim_193_dropout_p_0_0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_backward_failure_sm86plus_head_dim_193_dropout_p_0_2_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_backward_failure_sm86plus_head_dim_256_dropout_p_0_0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_backward_failure_sm86plus_head_dim_256_dropout_p_0_2_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_flash_fail_fp32_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_fused_kernels_nested_broadcasting_error_cases_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_fused_kernels_nested_broadcasting_requires_grad_failure_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_fused_kernels_seq_len_0_inputs_fused_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_fused_kernels_seq_len_0_inputs_fused_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_attn_mask_present_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_broadcast_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_broadcast_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_dim_3_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_dim_3_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_head_dim_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_head_dim_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_invalid_dtype_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_fused_inputs_invalid_dtype_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_1_dimensional_inputs_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_1_dimensional_inputs_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_1_dimensional_inputs_kernel2_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_datatypes_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_datatypes_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_datatypes_kernel2_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_devices_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_devices_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_inputs_different_devices_kernel2_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_last_dim_stride_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_last_dim_stride_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_sdpa_kernel_grouped_query_attention_cuda_fused_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_sequence_lengths_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_invalid_sequence_lengths_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mask_invalid_last_dim_stride_kernel0_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mask_invalid_last_dim_stride_kernel1_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mem_eff_attention_fail_with_batch_size_geq_65536_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mem_eff_attention_fail_with_batch_size_geq_65536_error_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mem_eff_attention_large_seq_len_uniform_attention_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_mem_efficient_fail_bfloat16_less_than_sm80_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_nested_fails_on_padding_head_dim_cuda, test/test_transformers.py::TestSDPAFailureModesCUDA::test_unaligned_tensors_cuda, test/test_transformers.py::TestSDPACUDA::test_scaled_dot_product_attention_fp16_overflow_cuda, test/test_transformers.py::TestSDPACUDA::test_scaled_dot_product_attention_math_with_negative_scale_kernel0_cuda, test/test_transformers.py::TestSDPACUDA::test_sdp_math_gradcheck_contiguous_inputs_False_cuda, test/test_transformers.py::TestSDPACUDA::test_sdp_math_gradcheck_contiguous_inputs_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_broken_166211_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_compiles_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_d256_heuristic_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_different_dk_dv_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_fail_d128_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_gqa_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_nonmodulo64seqlen_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_preserves_query_layout_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_seqlen1_dropout_heuristic_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_cudnn_attention_trivial_output_transpose_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_1_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_143_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_127_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_4_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_203_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_256_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_False_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale0_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_False_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_bfloat16_scale_l1_enable_gqa_True_n_heads1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale0_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_False_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_batch_size_8_seq_len_q_4_seq_len_k_579_head_dim_8_is_causal_True_dropout_p_0_48_float16_scale_l1_enable_gqa_True_n_heads1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_32_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_256_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_256_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_64_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_0_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale0_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_False_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_flash_attention_vs_math_ref_grads_nestedtensor_batch_size_8_max_seq_len_q_32_max_seq_len_kv_32_head_dim_8_dropout_p_0_1_float16_scale_l1_is_causal_True_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_different_dk_dv_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_1_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_1024_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_1024_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_32_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_False_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_0_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale0_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_attention_vs_math_ref_grads_cudagraph_batch_size_8_seq_len_q_256_seq_len_k_256_head_dim_64_is_causal_True_dropout_p_0_22_float16_scale_l1_fused_kernel1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_backwards_throws_determinism_warning_fused_kernel0_warn_only_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_backwards_throws_determinism_warning_fused_kernel0_warn_only_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_backwards_throws_determinism_warning_fused_kernel1_warn_only_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_backwards_throws_determinism_warning_fused_kernel1_warn_only_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel0_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_False_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_False_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_False_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_False_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_False_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_kernel1_expand_q_batch_True_expand_k_batch_True_expand_v_batch_True_expand_q_num_heads_True_expand_k_num_heads_True_expand_v_num_heads_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_nested_broadcasting_query_dense_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_seq_len_1_inputs_fused_kernel0_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_kernels_seq_len_1_inputs_fused_kernel1_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_sdp_choice_type_dense_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_sdp_choice_type_nested_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_sdp_priority_order_use_compile_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_fused_sdp_priority_order_use_compile_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_attention_long_sequence_mask_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_attention_long_sequence_mask_float32_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_attention_non_contig_mask_bug_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_attention_non_contiguous_mask_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_attention_non_contiguous_mask_float32_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_eff_backwards_determinism_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_312_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_408_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_attn_mask_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_mask_variants_mask_dim_1_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_mask_variants_mask_dim_2_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_mask_variants_mask_dim_3_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_mask_variants_mask_dim_4_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_1_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_1024_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_103_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_2048_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_1024_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_103_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_2048_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_128_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_16_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_8_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_False_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_0_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale0_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_bfloat16_scale_l1_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale0_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float16_scale_l1_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale0_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_mem_efficient_attention_vs_math_ref_grads_batch_size_8_seq_len_q_8_seq_len_k_8_head_dim_96_is_causal_True_dropout_p_0_22_float32_scale_l1_cuda_float32, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_cudnn_nested_type_nested_is_contiguous_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_cudnn_nested_type_nested_is_contiguous_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_accuracy_type_dense_fused_kernel0_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_accuracy_type_dense_fused_kernel1_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_accuracy_type_nested_fused_kernel0_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_accuracy_type_nested_fused_kernel1_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_type_dense_is_contiguous_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_type_dense_is_contiguous_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_type_nested_is_contiguous_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_scaled_dot_product_attention_fused_kernels_packed_type_nested_is_contiguous_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_choice_with_determinism_warn_only_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_choice_with_determinism_warn_only_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_False_is_causal_False_bfloat16_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_False_is_causal_False_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_False_is_causal_True_bfloat16_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_False_is_causal_True_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_True_is_causal_False_bfloat16_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_True_is_causal_False_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_True_is_causal_True_bfloat16_cuda_bfloat16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_flash_attention_grad_against_math_contiguous_inputs_True_is_causal_True_float16_cuda_float16, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_mem_efficient_grad_against_math_contiguous_inputs_False_is_causal_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_mem_efficient_grad_against_math_contiguous_inputs_False_is_causal_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_mem_efficient_grad_against_math_contiguous_inputs_True_is_causal_False_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_sdp_mem_efficient_grad_against_math_contiguous_inputs_True_is_causal_True_cuda, test/test_transformers.py::TestSDPACudaOnlyCUDA::test_singelton_head_dim_stride_ne_1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_LOWER_RIGHT_shape0_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_LOWER_RIGHT_shape1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_LOWER_RIGHT_shape2_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_LOWER_RIGHT_shape3_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_UPPER_LEFT_shape0_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_UPPER_LEFT_shape1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_UPPER_LEFT_shape2_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_causal_variant_CausalVariant_UPPER_LEFT_shape3_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_LOWER_RIGHT_shape0_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_LOWER_RIGHT_shape1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_LOWER_RIGHT_shape2_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_LOWER_RIGHT_shape3_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_UPPER_LEFT_shape0_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_UPPER_LEFT_shape1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_UPPER_LEFT_shape2_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_causal_variants_compile_causal_variant_CausalVariant_UPPER_LEFT_shape3_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_is_causal_and_mask_fails_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_is_causal_equals_upper_left_shape0_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_is_causal_equals_upper_left_shape1_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_is_causal_equals_upper_left_shape2_cuda, test/test_transformers.py::TestAttnBiasCUDA::test_is_causal_equals_upper_left_shape3_cuda 2025-12-04T11:09:22.8208400Z 2025-12-04T11:09:22.8208566Z Finished test_transformers 1/1 ... [2025-12-04 11:09:22.139185][4971344.635349338], took 3.29min 2025-12-04T11:09:22.8208974Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:09:22.8209342Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:09:22.8209602Z Running torch_np/numpy_tests/core/test_scalarinherit 1/1 ... [2025-12-04 11:09:22.154044][4971344.650231864] 2025-12-04T11:09:22.8209821Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:09:22.8210244Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/core/test_scalarinherit.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:09:22.154410] 2025-12-04T11:09:24.0725527Z 2025-12-04T11:09:24.0727408Z torch_np/numpy_tests/core/test_scalarinherit 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.core.test_scalarinherit_1.1_e07d51a32640780c_.log 2025-12-04T11:09:24.0729908Z Running 3 items in this shard: test/torch_np/numpy_tests/core/test_scalarinherit.py::TestInherit::test_gh_15395, test/torch_np/numpy_tests/core/test_scalarinherit.py::TestInherit::test_init, test/torch_np/numpy_tests/core/test_scalarinherit.py::TestInherit::test_init2 2025-12-04T11:09:24.0731196Z 2025-12-04T11:09:24.0731698Z Finished torch_np/numpy_tests/core/test_scalarinherit 1/1 ... [2025-12-04 11:09:24.072206][4971346.568391561], took 0.03min 2025-12-04T11:09:24.0751289Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:09:24.0906246Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:09:24.0912767Z Running test_tensorexpr_pybind 1/1 ... [2025-12-04 11:09:24.091019][4971346.587205432] 2025-12-04T11:09:24.0913381Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:09:24.0917367Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_tensorexpr_pybind.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:09:24.091496] 2025-12-04T11:09:26.3606969Z 2025-12-04T11:09:26.3609094Z test_tensorexpr_pybind 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_tensorexpr_pybind_1.1_75f3196cf8e42f5d_.log 2025-12-04T11:09:26.3616762Z Running 17 items in this shard: test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_alloc_in_loop, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_call_raw, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_dtype_error, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_dynamic_shape, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_dynamic_shape_2d, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_external_calls, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_shape_prop, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_shape_prop_module, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_custom_lowering, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_expand, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_permute, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_scalar_inputs, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_t, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_tensor_inputs, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_kernel_with_transpose, test/test_tensorexpr_pybind.py::TestTensorExprPyBind::test_simple_sum, test/test_tensorexpr_pybind.py::TestExprHandlePyBind::test_unary_ops 2025-12-04T11:09:26.3623234Z 2025-12-04T11:09:26.3623625Z Finished test_tensorexpr_pybind 1/1 ... [2025-12-04 11:09:26.360319][4971348.85650412], took 0.04min 2025-12-04T11:09:26.3632531Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:09:26.3786612Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:09:26.3793298Z Running test_fx_experimental 1/1 ... [2025-12-04 11:09:26.379080][4971348.875266342] 2025-12-04T11:09:26.3793958Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:09:26.3797680Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_fx_experimental.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:09:26.379559] 2025-12-04T11:11:05.9436431Z 2025-12-04T11:11:05.9439179Z test_fx_experimental 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_fx_experimental_1.1_bc394ceadaa9b939_.log 2025-12-04T11:11:05.9829787Z Running 724 items in this shard: test/test_fx_experimental.py::TestFXExperimental::test_annotate_getitem_node, test/test_fx_experimental.py::TestFXExperimental::test_annotate_returns_with_schema, test/test_fx_experimental.py::TestFXExperimental::test_aot_based_partition, test/test_fx_experimental.py::TestFXExperimental::test_call_to_assert_no_msg, test/test_fx_experimental.py::TestFXExperimental::test_call_to_assert_with_empty_msg, test/test_fx_experimental.py::TestFXExperimental::test_call_to_assert_with_msg, test/test_fx_experimental.py::TestFXExperimental::test_call_to_assert_with_multiline_message, test/test_fx_experimental.py::TestFXExperimental::test_conv_bn_fusion, test/test_fx_experimental.py::TestFXExperimental::test_conv_bn_fusion_mixed_dtype, test/test_fx_experimental.py::TestFXExperimental::test_conv_bn_fusion_not_running_state, test/test_fx_experimental.py::TestFXExperimental::test_cost_aware_partition, test/test_fx_experimental.py::TestFXExperimental::test_fetch, test/test_fx_experimental.py::TestFXExperimental::test_find_single_partition, test/test_fx_experimental.py::TestFXExperimental::test_lack_of_devices, test/test_fx_experimental.py::TestFXExperimental::test_large_node_error, test/test_fx_experimental.py::TestFXExperimental::test_merge_matmuls, test/test_fx_experimental.py::TestFXExperimental::test_meta_tracer, test/test_fx_experimental.py::TestFXExperimental::test_normalize_args, test/test_fx_experimental.py::TestFXExperimental::test_normalize_args_perserve_type, test/test_fx_experimental.py::TestFXExperimental::test_normalize_args_preserve_meta, test/test_fx_experimental.py::TestFXExperimental::test_normalize_binary_operators, test/test_fx_experimental.py::TestFXExperimental::test_normalize_modules_exhaustive, test/test_fx_experimental.py::TestFXExperimental::test_optimize_for_inference_cpu, test/test_fx_experimental.py::TestFXExperimental::test_optimize_for_inference_cpu_torchvision, test/test_fx_experimental.py::TestFXExperimental::test_partition_device_mapping, test/test_fx_experimental.py::TestFXExperimental::test_partition_latency, test/test_fx_experimental.py::TestFXExperimental::test_partition_node_manipulation, test/test_fx_experimental.py::TestFXExperimental::test_replace_target_nodes_with, test/test_fx_experimental.py::TestFXExperimental::test_saturate_host, test/test_fx_experimental.py::TestFXExperimental::test_size_based_partition, test/test_fx_experimental.py::TestFXExperimental::test_sparse_nn_partition, test/test_fx_experimental.py::TestFXExperimental::test_split_module_dead_code, test/test_fx_experimental.py::TestFXExperimental::test_split_module_default_arg, test/test_fx_experimental.py::TestFXExperimental::test_split_module_input_names, test/test_fx_experimental.py::TestFXExperimental::test_split_module_keep_original_order_and_noop_graph, test/test_fx_experimental.py::TestFXExperimental::test_split_module_kwargs_expansion, test/test_fx_experimental.py::TestFXExperimental::test_split_module_return_node, test/test_fx_experimental.py::TestFXExperimental::test_split_module_symint_dependency_handling, test/test_fx_experimental.py::TestFXExperimental::test_split_qualname_mapping, test/test_fx_experimental.py::TestFXExperimental::test_subgraph_creation, test/test_fx_experimental.py::TestFXExperimental::test_subgraph_trivial_resnet, test/test_fx_experimental.py::TestFXExperimental::test_subgraph_uniquename, test/test_fx_experimental.py::TestFXExperimental::test_to_folder, test/test_fx_experimental.py::TestFXExperimental::test_traceable_function_with_nonstandard_name, test/test_fx_experimental.py::TestFXExperimental::test_type_matches, test/test_fx_experimental.py::TestTranslationValidation::test_sat, test/test_fx_experimental.py::TestTranslationValidation::test_sat_bitwise, test/test_fx_experimental.py::TestTranslationValidation::test_sympy_to_z3, test/test_fx_experimental.py::TestTranslationValidation::test_unsat, test/test_fx_experimental.py::TestTranslationValidation::test_z3str, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_args_op_overload_cuda, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_H_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_T_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___getitem___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___radd___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rdiv___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rmatmul___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rmod___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rmul___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rpow___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive___rsub___cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__batch_norm_with_update_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__chunk_cat_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__native_batch_norm_legit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__segment_reduce_lengths_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__segment_reduce_offsets_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__softmax_backward_data_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__unsafe_masked_index_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__unsafe_masked_index_put_accumulate_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive__upsample_bilinear2d_aa_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_abs_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_acos_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_acosh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_add_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addbmm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addcdiv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addcmul_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addmm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addmm_decomposed_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addmv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_addr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_alias_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_all_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_allclose_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_amax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_amin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_aminmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_angle_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_any_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_arange_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_argmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_argmin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_argsort_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_argwhere_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_as_strided_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_as_strided_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_as_strided_partial_views_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_as_strided_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_asin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_asinh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atan2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atan_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atanh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atleast_1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atleast_2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_atleast_3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_baddbmm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_bernoulli_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_bfloat16_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_block_diag_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_bmm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_bool_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_broadcast_shapes_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_broadcast_tensors_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_broadcast_to_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_bucketize_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_byte_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cartesian_prod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cat_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cauchy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cdist_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cdouble_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ceil_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cfloat_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_chalf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_char_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cholesky_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cholesky_inverse_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cholesky_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_chunk_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_clamp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_clamp_max_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_clamp_min_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_clone_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_column_stack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_combinations_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_complex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_conj_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_conj_physical_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_constant_pad_nd_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_contiguous_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_copysign_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_corrcoef_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cos_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cosh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_count_nonzero_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cov_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cross_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cummax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cummin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cumprod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cumsum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_cumulative_trapezoid_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_deg2rad_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diag_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diag_embed_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diagflat_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diagonal_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diagonal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diagonal_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_diff_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_digamma_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_dist_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_div_floor_rounding_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_div_no_rounding_mode_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_div_trunc_rounding_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_dot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_double_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_dsplit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_dstack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_einsum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_empty_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_empty_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_empty_permuted_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_empty_strided_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_eq_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_equal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_erf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_erfc_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_erfinv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_exp2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_exp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_expand_as_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_expand_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_expand_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_expm1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_exponential_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_eye_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_fft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_fft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_fftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_fftshift_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_hfft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_hfft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_hfftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ifft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ifft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ifftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ifftshift_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ihfft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ihfft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_ihfftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_irfft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_irfft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_irfftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_rfft2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_rfft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fft_rfftn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fill_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_flatten_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_flip_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fliplr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_flipud_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_float_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_float_power_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_floor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_floor_divide_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fmin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_fmod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_frac_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_frexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_full_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_full_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_gather_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ge_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_geometric_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_geqrf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_gradient_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_grid_sampler_2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_grid_sampler_3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_gt_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_half_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_hash_tensor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_heaviside_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_histc_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_hsplit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_hstack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_hypot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_i0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_igamma_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_igammac_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_add_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_fill_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_put_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_reduce_amax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_reduce_amin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_reduce_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_reduce_prod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_index_select_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_inner_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_int_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isclose_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isfinite_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isinf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isnan_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isneginf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isposinf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_isreal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_item_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_jiterator_2inputs_2outputs_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_jiterator_4inputs_with_extra_args_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_jiterator_binary_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_jiterator_binary_return_by_ref_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_jiterator_unary_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_kron_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_kthvalue_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ldexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_le_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lerp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lgamma_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_cholesky_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_cholesky_ex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_cond_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_cross_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_det_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_diagonal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_eig_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_eigh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_eigvals_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_eigvalsh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_householder_product_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_inv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_inv_ex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_ldl_factor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_ldl_factor_ex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_ldl_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lstsq_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lstsq_grad_oriented_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lu_factor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lu_factor_ex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_lu_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_matrix_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_matrix_power_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_matrix_rank_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_matrix_rank_hermitian_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_multi_dot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_norm_subgradients_at_zero_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_pinv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_pinv_hermitian_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_pinv_singular_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_qr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_slogdet_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_solve_ex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_solve_triangular_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_svd_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_svdvals_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_tensorinv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_tensorsolve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_vander_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_vecdot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linalg_vector_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linspace_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_linspace_tensor_overload_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log10_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log1p_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log_normal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log_softmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_log_softmax_with_dtype_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logaddexp2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logaddexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logcumsumexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logdet_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logical_and_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logical_not_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logical_or_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logical_xor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logspace_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logspace_tensor_overload_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_logsumexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_long_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lt_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lu_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_lu_unpack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mH_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mT_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_amax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_amin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_argmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_argmin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_cumprod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_cumsum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_fill_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_log_softmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_logaddexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_logsumexp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_median_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_normalize_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_prod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_select_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_softmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_softmin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_std_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_sum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_masked_var_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_matmul_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_matrix_exp_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_max_binary_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_max_pool2d_with_indices_backward_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_max_reduction_no_dim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_max_reduction_with_dim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_maximum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_median_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_meshgrid_list_of_tensors_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_meshgrid_variadic_tensors_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_min_binary_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_min_reduction_no_dim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_min_reduction_with_dim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_minimum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mode_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_movedim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_msort_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mul_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_multinomial_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mv_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nan_to_num_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nanmean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nanmedian_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nanquantile_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nansum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_narrow_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_narrow_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_native_batch_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_native_dropout_backward_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_native_layer_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ne_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_neg_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_new_empty_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_new_empty_strided_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_new_full_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_new_ones_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_new_zeros_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nextafter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_alpha_dropout_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_avg_pool1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_avg_pool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_avg_pool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_batch_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_bilinear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_binary_cross_entropy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_celu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_channel_shuffle_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv_transpose1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv_transpose2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_conv_transpose3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_cosine_embedding_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_cosine_similarity_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_cross_entropy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_ctc_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_dropout2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_dropout3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_dropout_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_elu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_embedding_bag_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_embedding_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_fractional_max_pool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_fractional_max_pool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_gaussian_nll_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_gelu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_glu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_grid_sample_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_group_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_hardshrink_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_hardsigmoid_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_hardswish_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_hardtanh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_hinge_embedding_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_huber_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_instance_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_area_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_bicubic_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_bilinear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_linear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_nearest_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_interpolate_trilinear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_kl_div_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_l1_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_layer_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_leaky_relu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_linear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_local_response_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_logsigmoid_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_margin_ranking_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_pool1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_pool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_pool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool1d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool1d_grad_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool2d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool2d_grad_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool3d_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_max_unpool3d_grad_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_mish_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_mse_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_multi_head_attention_forward_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_multi_margin_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_multilabel_margin_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_nll_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_normalize_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pad_circular_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pad_constant_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pad_reflect_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pad_replicate_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pad_replicate_negative_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pairwise_distance_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pdist_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pixel_shuffle_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_pixel_unshuffle_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_poisson_nll_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_prelu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_relu6_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_relu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_rms_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_rrelu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_selu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_silu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_smooth_l1_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_soft_margin_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_softmin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_softmin_with_dtype_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_softplus_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_softshrink_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_softsign_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_tanhshrink_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_threshold_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_triplet_margin_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_unfold_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_upsample_bilinear_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nn_functional_upsample_nearest_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nonzero_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_nonzero_static_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_norm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_norm_fro_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_norm_inf_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_norm_nuc_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_normal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_normal_in_place_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_normal_number_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ones_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ones_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ormqr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_outer_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_pca_lowrank_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_permute_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_permute_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_pinverse_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polar_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polygamma_polygamma_n_0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polygamma_polygamma_n_1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polygamma_polygamma_n_2_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polygamma_polygamma_n_3_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_polygamma_polygamma_n_4_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_positive_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_pow_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_prod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_put_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_qr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_quantile_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_rad2deg_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_rand_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_randint_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_randint_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_randn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_randn_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_ravel_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_real_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_reciprocal_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_remainder_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_renorm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_repeat_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_repeat_interleave_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_reshape_as_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_reshape_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_resize__cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_resize_as__cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_resolve_conj_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_resolve_neg_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_roll_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_rot90_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_round_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_round_decimals_0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_round_decimals_3_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_round_decimals_neg_3_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_rsqrt_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_rsub_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scalar_tensor_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_add_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_reduce_amax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_reduce_amin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_reduce_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_reduce_prod_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_scatter_reduce_sum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_searchsorted_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_select_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_select_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sgn_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_short_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sigmoid_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sign_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_bartlett_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_blackman_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_cosine_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_exponential_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_gaussian_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_general_cosine_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_general_hamming_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_hamming_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_hann_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_kaiser_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signal_windows_nuttall_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_signbit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sin_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sinc_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sinh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_slice_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_slice_scatter_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_softmax_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_softmax_with_dtype_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sort_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sparse_mm_reduce_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sparse_sampled_addmm_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_airy_ai_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_bessel_j0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_bessel_j1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_bessel_y0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_bessel_y1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_chebyshev_polynomial_t_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_chebyshev_polynomial_u_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_chebyshev_polynomial_v_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_chebyshev_polynomial_w_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_entr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_erfcx_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_hermite_polynomial_h_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_hermite_polynomial_he_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_i0e_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_i1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_i1e_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_laguerre_polynomial_l_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_legendre_polynomial_p_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_log_ndtr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_modified_bessel_i0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_modified_bessel_i1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_modified_bessel_k0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_modified_bessel_k1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_ndtr_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_ndtri_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_scaled_modified_bessel_k0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_scaled_modified_bessel_k1_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_spherical_bessel_j0_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_xlog1py_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_special_zeta_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_split_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_split_list_args_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_split_with_sizes_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_split_with_sizes_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sqrt_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_square_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_squeeze_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_squeeze_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_squeeze_multiple_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_stack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_std_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_std_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_std_mean_unbiased_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_std_unbiased_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_stft_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sub_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sum_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_sum_to_size_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_svd_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_svd_lowrank_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_t_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_t_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_take_along_dim_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_take_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tan_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tanh_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tensor_split_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tensordot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tile_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_to_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_to_sparse_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_topk_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_trace_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_transpose_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_transpose_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_trapezoid_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_trapz_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_triangular_solve_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_tril_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_triu_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_true_divide_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_trunc_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unbind_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unbind_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unflatten_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unfold_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unfold_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_uniform_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unique_consecutive_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unique_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unsafe_chunk_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unsafe_split_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unsqueeze_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_unsqueeze_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_var_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_var_mean_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_var_mean_unbiased_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_var_unbiased_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_vdot_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_view_as_complex_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_view_as_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_view_copy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_view_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_vsplit_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_vstack_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_where_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_xlogy_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_zero__cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_zeros_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_operator_exhaustive_zeros_like_cuda_float32, test/test_fx_experimental.py::TestNormalizeOperatorsCUDA::test_normalize_quantized_eb_cuda 2025-12-04T11:11:06.0030478Z 2025-12-04T11:11:06.0030602Z Finished test_fx_experimental 1/1 ... [2025-12-04 11:11:05.944557][4971448.440741318], took 1.66min 2025-12-04T11:11:06.0030999Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:11:06.0031362Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:11:06.0031585Z Running test_namedtensor 1/1 ... [2025-12-04 11:11:05.962980][4971448.459167335] 2025-12-04T11:11:06.0031768Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:11:06.0032160Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_namedtensor.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:11:05.963444] 2025-12-04T11:11:09.5927723Z 2025-12-04T11:11:09.5929191Z test_namedtensor 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_namedtensor_1.1_e96031c9659e9b25_.log 2025-12-04T11:11:09.5955812Z Running 89 items in this shard: test/test_namedtensor.py::TestNamedTensor::test_aaa_must_run_first_check_experimental_warning, test/test_namedtensor.py::TestNamedTensor::test_addcmul_addcdiv, test/test_namedtensor.py::TestNamedTensor::test_addmm, test/test_namedtensor.py::TestNamedTensor::test_addmv, test/test_namedtensor.py::TestNamedTensor::test_align_as, test/test_namedtensor.py::TestNamedTensor::test_align_tensors, test/test_namedtensor.py::TestNamedTensor::test_align_tensors_two_inputs, test/test_namedtensor.py::TestNamedTensor::test_align_to, test/test_namedtensor.py::TestNamedTensor::test_align_to_ellipsis, test/test_namedtensor.py::TestNamedTensor::test_any_all, test/test_namedtensor.py::TestNamedTensor::test_as_strided, test/test_namedtensor.py::TestNamedTensor::test_as_strided_cuda, test/test_namedtensor.py::TestNamedTensor::test_autograd_ignores_names, test/test_namedtensor.py::TestNamedTensor::test_autograd_smoke, test/test_namedtensor.py::TestNamedTensor::test_autograd_warns_named_grad, test/test_namedtensor.py::TestNamedTensor::test_bernoulli, test/test_namedtensor.py::TestNamedTensor::test_big_tensor_repr_has_names, test/test_namedtensor.py::TestNamedTensor::test_binary_ops, test/test_namedtensor.py::TestNamedTensor::test_bitwise_not, test/test_namedtensor.py::TestNamedTensor::test_bmm, test/test_namedtensor.py::TestNamedTensor::test_cat, test/test_namedtensor.py::TestNamedTensor::test_cdist, test/test_namedtensor.py::TestNamedTensor::test_comparison_ops, test/test_namedtensor.py::TestNamedTensor::test_copy_transpose, test/test_namedtensor.py::TestNamedTensor::test_cummax_cummin, test/test_namedtensor.py::TestNamedTensor::test_detach, test/test_namedtensor.py::TestNamedTensor::test_diagonal, test/test_namedtensor.py::TestNamedTensor::test_dot, test/test_namedtensor.py::TestNamedTensor::test_empty_names, test/test_namedtensor.py::TestNamedTensor::test_equal, test/test_namedtensor.py::TestNamedTensor::test_expand, test/test_namedtensor.py::TestNamedTensor::test_factory_coverage, test/test_namedtensor.py::TestNamedTensor::test_factory_edge_cases, test/test_namedtensor.py::TestNamedTensor::test_flatten, test/test_namedtensor.py::TestNamedTensor::test_flatten_index_error, test/test_namedtensor.py::TestNamedTensor::test_flatten_nodims, test/test_namedtensor.py::TestNamedTensor::test_has_names, test/test_namedtensor.py::TestNamedTensor::test_index_fill, test/test_namedtensor.py::TestNamedTensor::test_info_smoke, test/test_namedtensor.py::TestNamedTensor::test_logcumsumexp, test/test_namedtensor.py::TestNamedTensor::test_logical_not, test/test_namedtensor.py::TestNamedTensor::test_logical_ops, test/test_namedtensor.py::TestNamedTensor::test_masked_fill, test/test_namedtensor.py::TestNamedTensor::test_masked_select, test/test_namedtensor.py::TestNamedTensor::test_matmul, test/test_namedtensor.py::TestNamedTensor::test_max_pooling, test/test_namedtensor.py::TestNamedTensor::test_max_pooling_without_names_does_not_warn, test/test_namedtensor.py::TestNamedTensor::test_mm, test/test_namedtensor.py::TestNamedTensor::test_mv, test/test_namedtensor.py::TestNamedTensor::test_no_jit_script_support, test/test_namedtensor.py::TestNamedTensor::test_no_jit_tracer_support, test/test_namedtensor.py::TestNamedTensor::test_no_multiprocessing_support, test/test_namedtensor.py::TestNamedTensor::test_no_pickle_support, test/test_namedtensor.py::TestNamedTensor::test_no_save_support, test/test_namedtensor.py::TestNamedTensor::test_noncontig_contiguous, test/test_namedtensor.py::TestNamedTensor::test_none_names_refcount, test/test_namedtensor.py::TestNamedTensor::test_nyi_dimname_overload_msg, test/test_namedtensor.py::TestNamedTensor::test_out_fn_semantics, test/test_namedtensor.py::TestNamedTensor::test_pow_special, test/test_namedtensor.py::TestNamedTensor::test_py3_ellipsis, test/test_namedtensor.py::TestNamedTensor::test_reduction_fns, test/test_namedtensor.py::TestNamedTensor::test_refine_names, test/test_namedtensor.py::TestNamedTensor::test_rename, test/test_namedtensor.py::TestNamedTensor::test_rename_, test/test_namedtensor.py::TestNamedTensor::test_rename_globber, test/test_namedtensor.py::TestNamedTensor::test_rename_rename_map, test/test_namedtensor.py::TestNamedTensor::test_repr, test/test_namedtensor.py::TestNamedTensor::test_resize, test/test_namedtensor.py::TestNamedTensor::test_select, test/test_namedtensor.py::TestNamedTensor::test_select_cuda, test/test_namedtensor.py::TestNamedTensor::test_set_names_property, test/test_namedtensor.py::TestNamedTensor::test_size, test/test_namedtensor.py::TestNamedTensor::test_split_fns_propagates_names, test/test_namedtensor.py::TestNamedTensor::test_squeeze, test/test_namedtensor.py::TestNamedTensor::test_stride, test/test_namedtensor.py::TestNamedTensor::test_support_device_named_grad, test/test_namedtensor.py::TestNamedTensor::test_tensor_from_lists, test/test_namedtensor.py::TestNamedTensor::test_tensor_from_named_tensor, test/test_namedtensor.py::TestNamedTensor::test_tensor_from_numpy, test/test_namedtensor.py::TestNamedTensor::test_tensor_from_tensor, test/test_namedtensor.py::TestNamedTensor::test_tensor_grad_is_unnamed, test/test_namedtensor.py::TestNamedTensor::test_transpose_variants, test/test_namedtensor.py::TestNamedTensor::test_trivial, test/test_namedtensor.py::TestNamedTensor::test_unary_propagate_names_fns, test/test_namedtensor.py::TestNamedTensor::test_unflatten, test/test_namedtensor.py::TestNamedTensor::test_unsupported_op_error_msg, test/test_namedtensor.py::TestNamedTensor::test_using_seen_interned_string_doesnt_bump_refcount, test/test_namedtensor.py::TestNamedTensor::test_using_unseen_interned_string_bumps_refcount_permanently, test/test_namedtensor.py::TestNamedTensor::test_using_unseen_uninterned_string_refcounts 2025-12-04T11:11:09.5981383Z 2025-12-04T11:11:09.5981763Z Finished test_namedtensor 1/1 ... [2025-12-04 11:11:09.592494][4971452.088679558], took 0.06min 2025-12-04T11:11:09.5983056Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:11:09.6046268Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:11:09.6052721Z Running test_dataloader 1/1 ... [2025-12-04 11:11:09.604943][4971452.101128844] 2025-12-04T11:11:09.6053285Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:11:09.6055711Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_dataloader.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:11:09.605337] 2025-12-04T11:17:39.5631414Z 2025-12-04T11:17:39.5633541Z PRINTING LOG FILE of test_dataloader 1/1 (test/test-reports/test_dataloader_1.1_cbec5e33ff3c884a_.log) 2025-12-04T11:17:39.5634672Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-3d822d8017a86837.xml 2025-12-04T11:17:39.5635489Z ============================= test session starts ============================== 2025-12-04T11:17:39.5636226Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.5636870Z cachedir: .pytest_cache 2025-12-04T11:17:39.5637789Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.5638723Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.5639126Z configfile: pytest.ini 2025-12-04T11:17:39.5639919Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.5641608Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5642841Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.5644050Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5645324Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.5646557Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5647944Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.5649227Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5650413Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.5651613Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5652819Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.5654228Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5655362Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.5656506Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.5657831Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.5658432Z collected 188 items 2025-12-04T11:17:39.5658835Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T11:17:39.5725471Z Running 188 items in this shard: test/test_dataloader.py::TestDatasetRandomSplit::test_incomplete_fractional_splits, test/test_dataloader.py::TestDatasetRandomSplit::test_lengths_must_equal_dataset_size, test/test_dataloader.py::TestDatasetRandomSplit::test_slicing_of_subset_of_dataset, test/test_dataloader.py::TestDatasetRandomSplit::test_slicing_of_subset_of_subset, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_are_mutually_exclusive, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_generator, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_have_correct_size, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_indexing_type, test/test_dataloader.py::TestDatasetRandomSplit::test_splits_reproducibility, test/test_dataloader.py::TestTensorDataset::test_getitem, test/test_dataloader.py::TestTensorDataset::test_getitem_1d, test/test_dataloader.py::TestTensorDataset::test_len, test/test_dataloader.py::TestTensorDataset::test_many_tensors, test/test_dataloader.py::TestTensorDataset::test_single_tensor, test/test_dataloader.py::TestStackDataset::test_empty, test/test_dataloader.py::TestStackDataset::test_getitem, test/test_dataloader.py::TestStackDataset::test_getitems, test/test_dataloader.py::TestStackDataset::test_getitems_raises_index_error, test/test_dataloader.py::TestStackDataset::test_getitems_value_error, test/test_dataloader.py::TestStackDataset::test_len, test/test_dataloader.py::TestStackDataset::test_mixed, test/test_dataloader.py::TestStackDataset::test_single, test/test_dataloader.py::TestStackDataset::test_size_mismatch, test/test_dataloader.py::TestConcatDataset::test_add_dataset, test/test_dataloader.py::TestConcatDataset::test_concat_raises_index_error, test/test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons, test/test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons_with_empty, test/test_dataloader.py::TestConcatDataset::test_concat_two_singletons, test/test_dataloader.py::TestConcatDataset::test_iterable_dataset_err, test/test_dataloader.py::TestDataLoader::test_batch_sampler, test/test_dataloader.py::TestDataLoader::test_builtin_collection_conversion, test/test_dataloader.py::TestDataLoader::test_bulk_loading_nobatch, test/test_dataloader.py::TestDataLoader::test_chain_iterable_style_dataset, test/test_dataloader.py::TestDataLoader::test_default_collate_bad_numpy_types, test/test_dataloader.py::TestDataLoader::test_default_collate_bad_sequence_type, test/test_dataloader.py::TestDataLoader::test_default_collate_dtype, test/test_dataloader.py::TestDataLoader::test_default_collate_mapping_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_numpy_memmap, test/test_dataloader.py::TestDataLoader::test_default_collate_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_sequence_keep_type, test/test_dataloader.py::TestDataLoader::test_default_collate_shared_tensor, test/test_dataloader.py::TestDataLoader::test_default_convert_mapping_keep_type, test/test_dataloader.py::TestDataLoader::test_default_convert_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoader::test_default_convert_sequence_keep_type, test/test_dataloader.py::TestDataLoader::test_distributed_sampler_invalid_rank, test/test_dataloader.py::TestDataLoader::test_duplicating_data_with_drop_last, test/test_dataloader.py::TestDataLoader::test_error, test/test_dataloader.py::TestDataLoader::test_error_in_init, test/test_dataloader.py::TestDataLoader::test_error_workers, test/test_dataloader.py::TestDataLoader::test_excessive_thread_creation_warning, test/test_dataloader.py::TestDataLoader::test_fd_limit_exceeded, test/test_dataloader.py::TestDataLoader::test_get_worker_info, test/test_dataloader.py::TestDataLoader::test_growing_dataset, test/test_dataloader.py::TestDataLoader::test_invalid_assign_after_init, test/test_dataloader.py::TestDataLoader::test_invalid_ctor_args_combinations, test/test_dataloader.py::TestDataLoader::test_iterable_style_dataset, test/test_dataloader.py::TestDataLoader::test_iterabledataset_len, test/test_dataloader.py::TestDataLoader::test_large_sampler_indices, test/test_dataloader.py::TestDataLoader::test_len, test/test_dataloader.py::TestDataLoader::test_multi_epochs_reproducibility, test/test_dataloader.py::TestDataLoader::test_multiple_dataloaders, test/test_dataloader.py::TestDataLoader::test_multiprocessing_contexts, test/test_dataloader.py::TestDataLoader::test_multiprocessing_iterdatapipe, test/test_dataloader.py::TestDataLoader::test_multiprocessing_iterdatapipe_with_dill, test/test_dataloader.py::TestDataLoader::test_no_segfault, test/test_dataloader.py::TestDataLoader::test_numpy, test/test_dataloader.py::TestDataLoader::test_numpy_gen_state, test/test_dataloader.py::TestDataLoader::test_numpy_scalars, test/test_dataloader.py::TestDataLoader::test_partial_workers, test/test_dataloader.py::TestDataLoader::test_proper_exit, test/test_dataloader.py::TestDataLoader::test_random_sampler, test/test_dataloader.py::TestDataLoader::test_random_sampler_len_with_replacement, test/test_dataloader.py::TestDataLoader::test_random_sampler_len_without_replacement, test/test_dataloader.py::TestDataLoader::test_sampler, test/test_dataloader.py::TestDataLoader::test_sampler_reproducibility, test/test_dataloader.py::TestDataLoader::test_segfault, test/test_dataloader.py::TestDataLoader::test_seqential_batch_workers, test/test_dataloader.py::TestDataLoader::test_seqential_batch_workers_prefetch, test/test_dataloader.py::TestDataLoader::test_sequential_batch, test/test_dataloader.py::TestDataLoader::test_sequential_nonbatch, test/test_dataloader.py::TestDataLoader::test_sequential_pin_memory, test/test_dataloader.py::TestDataLoader::test_sequential_workers, test/test_dataloader.py::TestDataLoader::test_shuffle, test/test_dataloader.py::TestDataLoader::test_shuffle_batch, test/test_dataloader.py::TestDataLoader::test_shuffle_batch_none, test/test_dataloader.py::TestDataLoader::test_shuffle_batch_workers, test/test_dataloader.py::TestDataLoader::test_shuffle_batch_workers_prefetch, test/test_dataloader.py::TestDataLoader::test_shuffle_pin_memory, test/test_dataloader.py::TestDataLoader::test_shuffle_reproducibility, test/test_dataloader.py::TestDataLoader::test_shuffle_workers, test/test_dataloader.py::TestDataLoader::test_timeout, test/test_dataloader.py::TestDataLoader::test_typing, test/test_dataloader.py::TestDataLoader::test_worker_init_fn, test/test_dataloader.py::TestDataLoader::test_worker_init_fn_forkserver, test/test_dataloader.py::TestDataLoader::test_worker_seed, test/test_dataloader.py::TestDataLoader::test_worker_seed_reproducibility, test/test_dataloader.py::IntegrationTestDataLoaderDataPipe::test_shuffler_iterdatapipe, test/test_dataloader.py::TestStringDataLoader::test_shuffle_pin_memory, test/test_dataloader.py::TestDictDataLoader::test_pin_memory, test/test_dataloader.py::TestDictDataLoader::test_pin_memory_device, test/test_dataloader.py::TestDictDataLoader::test_pin_memory_no_cuda, test/test_dataloader.py::TestDictDataLoader::test_pin_memory_with_only_device, test/test_dataloader.py::TestDictDataLoader::test_sequential_batch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_batch_sampler, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_builtin_collection_conversion, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_bulk_loading_nobatch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_chain_iterable_style_dataset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_dataset_not_reset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_bad_numpy_types, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_bad_sequence_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_dtype, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_mapping_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_numpy_memmap, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_shared_tensor, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_mapping_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_dont_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_keep_type, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_distributed_sampler_invalid_rank, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_duplicating_data_with_drop_last, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_early_exit, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_error, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_error_in_init, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_error_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_excessive_thread_creation_warning, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_fd_limit_exceeded, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_get_worker_info, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_growing_dataset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_invalid_assign_after_init, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_invalid_ctor_args_combinations, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterable_style_dataset, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterabledataset_len, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_large_sampler_indices, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_len, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multi_epochs_reproducibility, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiple_dataloaders, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_contexts, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_iterdatapipe, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_iterdatapipe_with_dill, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_no_segfault, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy_gen_state, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy_scalars, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_partial_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_proper_exit, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler_len_with_replacement, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler_len_without_replacement, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sampler, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sampler_reproducibility, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_segfault, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers_prefetch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_batch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_nonbatch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_pin_memory, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_none, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers_prefetch, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_pin_memory, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_reproducibility, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_workers, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_timeout, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_typing, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_init_fn, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_init_fn_forkserver, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_seed, test/test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_seed_reproducibility, test/test_dataloader.py::TestNamedTupleDataLoader::test_dataloader_with_namedtuple, test/test_dataloader.py::TestCustomPinFn::test_custom_batch_pin, test/test_dataloader.py::TestCustomPinFn::test_custom_batch_pin_worker, test/test_dataloader.py::TestIndividualWorkerQueue::test_ind_worker_queue, test/test_dataloader.py::TestSetAffinity::test_set_affinity_in_worker_init, test/test_dataloader.py::TestConvAfterFork::test_conv_after_fork, test/test_dataloader.py::TestOutOfOrderDataLoader::test_in_order_index_ds, test/test_dataloader.py::TestOutOfOrderDataLoader::test_in_order_iterable_ds, test/test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_index_ds, test/test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_iterable_ds, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_fork_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_forkserver_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_spawn_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_fork_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda, test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.5791403Z 2025-12-04T11:17:39.5791844Z test_dataloader.py::TestDatasetRandomSplit::test_incomplete_fractional_splits PASSED [0.0009s] [ 0%] 2025-12-04T11:17:39.5792815Z test_dataloader.py::TestDatasetRandomSplit::test_lengths_must_equal_dataset_size PASSED [0.0004s] [ 1%] 2025-12-04T11:17:39.5793756Z test_dataloader.py::TestDatasetRandomSplit::test_slicing_of_subset_of_dataset PASSED [0.0020s] [ 1%] 2025-12-04T11:17:39.5794738Z test_dataloader.py::TestDatasetRandomSplit::test_slicing_of_subset_of_subset PASSED [0.0006s] [ 2%] 2025-12-04T11:17:39.5795680Z test_dataloader.py::TestDatasetRandomSplit::test_splits_are_mutually_exclusive PASSED [0.0004s] [ 2%] 2025-12-04T11:17:39.5796582Z test_dataloader.py::TestDatasetRandomSplit::test_splits_generator PASSED [0.0010s] [ 3%] 2025-12-04T11:17:39.5797460Z test_dataloader.py::TestDatasetRandomSplit::test_splits_have_correct_size PASSED [0.0006s] [ 3%] 2025-12-04T11:17:39.5798401Z test_dataloader.py::TestDatasetRandomSplit::test_splits_indexing_type PASSED [0.0011s] [ 4%] 2025-12-04T11:17:39.5799288Z test_dataloader.py::TestDatasetRandomSplit::test_splits_reproducibility PASSED [0.0029s] [ 4%] 2025-12-04T11:17:39.5800111Z test_dataloader.py::TestTensorDataset::test_getitem PASSED [0.0016s] [ 5%] 2025-12-04T11:17:39.5800847Z test_dataloader.py::TestTensorDataset::test_getitem_1d PASSED [0.0013s] [ 5%] 2025-12-04T11:17:39.5801576Z test_dataloader.py::TestTensorDataset::test_len PASSED [0.0005s] [ 6%] 2025-12-04T11:17:39.5802352Z test_dataloader.py::TestTensorDataset::test_many_tensors PASSED [0.0013s] [ 6%] 2025-12-04T11:17:39.5803135Z test_dataloader.py::TestTensorDataset::test_single_tensor PASSED [0.0006s] [ 7%] 2025-12-04T11:17:39.5803877Z test_dataloader.py::TestStackDataset::test_empty PASSED [0.0005s] [ 7%] 2025-12-04T11:17:39.5804597Z test_dataloader.py::TestStackDataset::test_getitem PASSED [0.0025s] [ 8%] 2025-12-04T11:17:39.5805314Z test_dataloader.py::TestStackDataset::test_getitems PASSED [0.0010s] [ 9%] 2025-12-04T11:17:39.5806101Z test_dataloader.py::TestStackDataset::test_getitems_raises_index_error PASSED [0.0007s] [ 9%] 2025-12-04T11:17:39.5806936Z test_dataloader.py::TestStackDataset::test_getitems_value_error PASSED [0.0005s] [ 10%] 2025-12-04T11:17:39.5807703Z test_dataloader.py::TestStackDataset::test_len PASSED [0.0005s] [ 10%] 2025-12-04T11:17:39.5808475Z test_dataloader.py::TestStackDataset::test_mixed PASSED [0.0004s] [ 11%] 2025-12-04T11:17:39.5809189Z test_dataloader.py::TestStackDataset::test_single PASSED [0.0014s] [ 11%] 2025-12-04T11:17:39.5809976Z test_dataloader.py::TestStackDataset::test_size_mismatch PASSED [0.0004s] [ 12%] 2025-12-04T11:17:39.5810704Z test_dataloader.py::TestConcatDataset::test_add_dataset PASSED [0.0008s] [ 12%] 2025-12-04T11:17:39.5811492Z test_dataloader.py::TestConcatDataset::test_concat_raises_index_error PASSED [0.0004s] [ 13%] 2025-12-04T11:17:39.5812345Z test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons PASSED [0.0005s] [ 13%] 2025-12-04T11:17:39.5813243Z test_dataloader.py::TestConcatDataset::test_concat_two_non_singletons_with_empty PASSED [0.0004s] [ 14%] 2025-12-04T11:17:39.5814121Z test_dataloader.py::TestConcatDataset::test_concat_two_singletons PASSED [0.0004s] [ 14%] 2025-12-04T11:17:39.5814926Z test_dataloader.py::TestConcatDataset::test_iterable_dataset_err PASSED [0.0005s] [ 15%] 2025-12-04T11:17:39.5815702Z test_dataloader.py::TestDataLoader::test_batch_sampler PASSED [2.4672s] [ 15%] 2025-12-04T11:17:39.5816495Z test_dataloader.py::TestDataLoader::test_builtin_collection_conversion PASSED [0.2540s] [ 16%] 2025-12-04T11:17:39.5817316Z test_dataloader.py::TestDataLoader::test_bulk_loading_nobatch PASSED [0.0577s] [ 17%] 2025-12-04T11:17:39.5818184Z test_dataloader.py::TestDataLoader::test_chain_iterable_style_dataset PASSED [0.1146s] [ 17%] 2025-12-04T11:17:39.5819036Z test_dataloader.py::TestDataLoader::test_default_collate_bad_numpy_types PASSED [0.0020s] [ 18%] 2025-12-04T11:17:39.5819929Z test_dataloader.py::TestDataLoader::test_default_collate_bad_sequence_type PASSED [0.0012s] [ 18%] 2025-12-04T11:17:39.5820763Z test_dataloader.py::TestDataLoader::test_default_collate_dtype PASSED [0.0009s] [ 19%] 2025-12-04T11:17:39.5821596Z test_dataloader.py::TestDataLoader::test_default_collate_mapping_keep_type PASSED [0.0006s] [ 19%] 2025-12-04T11:17:39.5822528Z test_dataloader.py::TestDataLoader::test_default_collate_numpy_memmap PASSED [0.0010s] [ 20%] 2025-12-04T11:17:39.5823418Z test_dataloader.py::TestDataLoader::test_default_collate_sequence_dont_keep_type PASSED [0.0006s] [ 20%] 2025-12-04T11:17:39.5824346Z test_dataloader.py::TestDataLoader::test_default_collate_sequence_keep_type PASSED [0.0006s] [ 21%] 2025-12-04T11:17:39.5825221Z test_dataloader.py::TestDataLoader::test_default_collate_shared_tensor PASSED [0.0007s] [ 21%] 2025-12-04T11:17:39.5826116Z test_dataloader.py::TestDataLoader::test_default_convert_mapping_keep_type PASSED [0.0005s] [ 22%] 2025-12-04T11:17:39.5827027Z test_dataloader.py::TestDataLoader::test_default_convert_sequence_dont_keep_type PASSED [0.0005s] [ 22%] 2025-12-04T11:17:39.5827944Z test_dataloader.py::TestDataLoader::test_default_convert_sequence_keep_type PASSED [0.0005s] [ 23%] 2025-12-04T11:17:39.5828902Z test_dataloader.py::TestDataLoader::test_distributed_sampler_invalid_rank PASSED [0.0005s] [ 23%] 2025-12-04T11:17:39.5829783Z test_dataloader.py::TestDataLoader::test_duplicating_data_with_drop_last PASSED [0.0010s] [ 24%] 2025-12-04T11:17:39.5830622Z test_dataloader.py::TestDataLoader::test_error PASSED [0.0010s] [ 25%] 2025-12-04T11:17:39.5831381Z test_dataloader.py::TestDataLoader::test_error_in_init PASSED [0.0621s] [ 25%] 2025-12-04T11:17:39.5832096Z test_dataloader.py::TestDataLoader::test_error_workers PASSED [0.0396s] [ 26%] 2025-12-04T11:17:39.5832897Z test_dataloader.py::TestDataLoader::test_excessive_thread_creation_warning PASSED [0.0087s] [ 26%] 2025-12-04T11:17:39.5833721Z test_dataloader.py::TestDataLoader::test_fd_limit_exceeded PASSED [1.6441s] [ 27%] 2025-12-04T11:17:39.5834464Z test_dataloader.py::TestDataLoader::test_get_worker_info PASSED [4.8421s] [ 27%] 2025-12-04T11:17:39.5835192Z test_dataloader.py::TestDataLoader::test_growing_dataset PASSED [0.0018s] [ 28%] 2025-12-04T11:17:39.5835961Z test_dataloader.py::TestDataLoader::test_invalid_assign_after_init PASSED [0.0009s] [ 28%] 2025-12-04T11:17:39.5836802Z test_dataloader.py::TestDataLoader::test_invalid_ctor_args_combinations PASSED [0.0020s] [ 29%] 2025-12-04T11:17:39.5837631Z test_dataloader.py::TestDataLoader::test_iterable_style_dataset PASSED [0.7530s] [ 29%] 2025-12-04T11:17:39.5838510Z test_dataloader.py::TestDataLoader::test_iterabledataset_len PASSED [0.0013s] [ 30%] 2025-12-04T11:17:39.5839293Z test_dataloader.py::TestDataLoader::test_large_sampler_indices PASSED [3.9916s] [ 30%] 2025-12-04T11:17:39.5840042Z test_dataloader.py::TestDataLoader::test_len PASSED [0.0055s] [ 31%] 2025-12-04T11:17:39.5840821Z test_dataloader.py::TestDataLoader::test_multi_epochs_reproducibility PASSED [0.1136s] [ 31%] 2025-12-04T11:17:39.5841625Z test_dataloader.py::TestDataLoader::test_multiple_dataloaders PASSED [4.5158s] [ 32%] 2025-12-04T11:17:39.5843017Z test_dataloader.py::TestDataLoader::test_multiprocessing_contexts [W1204 11:11:33.529606520 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5844688Z [W1204 11:11:33.550711273 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5846048Z [W1204 11:11:33.562558498 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5847391Z [W1204 11:11:35.916220599 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5848797Z [W1204 11:11:35.253671769 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5850134Z [W1204 11:11:35.265110481 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5851004Z PASSED [9.1761s] [ 32%] 2025-12-04T11:17:39.5851652Z test_dataloader.py::TestDataLoader::test_multiprocessing_iterdatapipe PASSED [9.6126s] [ 33%] 2025-12-04T11:17:39.5852558Z test_dataloader.py::TestDataLoader::test_multiprocessing_iterdatapipe_with_dill XFAIL [2.1301s] [ 34%] 2025-12-04T11:17:39.5853398Z test_dataloader.py::TestDataLoader::test_no_segfault PASSED [12.3223s] [ 34%] 2025-12-04T11:17:39.5854114Z test_dataloader.py::TestDataLoader::test_numpy PASSED [0.0034s] [ 35%] 2025-12-04T11:17:39.5854839Z test_dataloader.py::TestDataLoader::test_numpy_gen_state PASSED [0.0037s] [ 35%] 2025-12-04T11:17:39.5855567Z test_dataloader.py::TestDataLoader::test_numpy_scalars PASSED [0.0025s] [ 36%] 2025-12-04T11:17:39.5856290Z test_dataloader.py::TestDataLoader::test_partial_workers PASSED [0.1274s] [ 36%] 2025-12-04T11:17:39.5857263Z test_dataloader.py::TestDataLoader::test_proper_exit SKIPPED [0.0004s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 37%] 2025-12-04T11:17:39.5858299Z test_dataloader.py::TestDataLoader::test_random_sampler PASSED [0.0013s] [ 37%] 2025-12-04T11:17:39.5859115Z test_dataloader.py::TestDataLoader::test_random_sampler_len_with_replacement PASSED [0.0007s] [ 38%] 2025-12-04T11:17:39.5860081Z test_dataloader.py::TestDataLoader::test_random_sampler_len_without_replacement PASSED [0.0006s] [ 38%] 2025-12-04T11:17:39.5860936Z test_dataloader.py::TestDataLoader::test_sampler PASSED [2.6125s] [ 39%] 2025-12-04T11:17:39.5861695Z test_dataloader.py::TestDataLoader::test_sampler_reproducibility PASSED [0.0227s] [ 39%] 2025-12-04T11:17:39.5862456Z test_dataloader.py::TestDataLoader::test_segfault PASSED [3.8365s] [ 40%] 2025-12-04T11:17:39.5863208Z test_dataloader.py::TestDataLoader::test_seqential_batch_workers PASSED [0.1314s] [ 40%] 2025-12-04T11:17:39.5864048Z test_dataloader.py::TestDataLoader::test_seqential_batch_workers_prefetch PASSED [0.0881s] [ 41%] 2025-12-04T11:17:39.5864863Z test_dataloader.py::TestDataLoader::test_sequential_batch PASSED [0.0206s] [ 42%] 2025-12-04T11:17:39.5865621Z test_dataloader.py::TestDataLoader::test_sequential_nonbatch PASSED [0.0064s] [ 42%] 2025-12-04T11:17:39.5866402Z test_dataloader.py::TestDataLoader::test_sequential_pin_memory PASSED [0.0020s] [ 43%] 2025-12-04T11:17:39.5867221Z test_dataloader.py::TestDataLoader::test_sequential_workers PASSED [0.0800s] [ 43%] 2025-12-04T11:17:39.5867962Z test_dataloader.py::TestDataLoader::test_shuffle PASSED [0.0217s] [ 44%] 2025-12-04T11:17:39.5868735Z test_dataloader.py::TestDataLoader::test_shuffle_batch PASSED [0.0183s] [ 44%] 2025-12-04T11:17:39.5869471Z test_dataloader.py::TestDataLoader::test_shuffle_batch_none PASSED [0.0190s] [ 45%] 2025-12-04T11:17:39.5870238Z test_dataloader.py::TestDataLoader::test_shuffle_batch_workers PASSED [0.0850s] [ 45%] 2025-12-04T11:17:39.5871061Z test_dataloader.py::TestDataLoader::test_shuffle_batch_workers_prefetch PASSED [0.0758s] [ 46%] 2025-12-04T11:17:39.5871872Z test_dataloader.py::TestDataLoader::test_shuffle_pin_memory PASSED [0.0707s] [ 46%] 2025-12-04T11:17:39.5872656Z test_dataloader.py::TestDataLoader::test_shuffle_reproducibility PASSED [0.1953s] [ 47%] 2025-12-04T11:17:39.5873424Z test_dataloader.py::TestDataLoader::test_shuffle_workers PASSED [0.1097s] [ 47%] 2025-12-04T11:17:39.5874146Z test_dataloader.py::TestDataLoader::test_timeout PASSED [6.3433s] [ 48%] 2025-12-04T11:17:39.5874854Z test_dataloader.py::TestDataLoader::test_typing PASSED [0.0017s] [ 48%] 2025-12-04T11:17:39.5875563Z test_dataloader.py::TestDataLoader::test_worker_init_fn PASSED [0.0507s] [ 49%] 2025-12-04T11:17:39.5876331Z test_dataloader.py::TestDataLoader::test_worker_init_fn_forkserver PASSED [0.0145s] [ 50%] 2025-12-04T11:17:39.5877093Z test_dataloader.py::TestDataLoader::test_worker_seed PASSED [2.1310s] [ 50%] 2025-12-04T11:17:39.5877866Z test_dataloader.py::TestDataLoader::test_worker_seed_reproducibility PASSED [4.1936s] [ 51%] 2025-12-04T11:17:39.5879128Z test_dataloader.py::IntegrationTestDataLoaderDataPipe::test_shuffler_iterdatapipe SKIPPED [0.0007s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 51%] 2025-12-04T11:17:39.5880364Z test_dataloader.py::TestStringDataLoader::test_shuffle_pin_memory PASSED [1.9883s] [ 52%] 2025-12-04T11:17:39.5881152Z test_dataloader.py::TestDictDataLoader::test_pin_memory PASSED [0.0031s] [ 52%] 2025-12-04T11:17:39.5881921Z test_dataloader.py::TestDictDataLoader::test_pin_memory_device PASSED [0.0019s] [ 53%] 2025-12-04T11:17:39.5882881Z test_dataloader.py::TestDictDataLoader::test_pin_memory_no_cuda SKIPPED [0.0003s] (Test for when CUDA is not available) [ 53%] 2025-12-04T11:17:39.5883881Z test_dataloader.py::TestDictDataLoader::test_pin_memory_with_only_device PASSED [0.0016s] [ 54%] 2025-12-04T11:17:39.5884724Z test_dataloader.py::TestDictDataLoader::test_sequential_batch PASSED [1.7950s] [ 54%] 2025-12-04T11:17:39.5885586Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_batch_sampler PASSED [4.5928s] [ 55%] 2025-12-04T11:17:39.5886576Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_builtin_collection_conversion PASSED [14.3151s] [ 55%] 2025-12-04T11:17:39.5887602Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_bulk_loading_nobatch PASSED [2.0083s] [ 56%] 2025-12-04T11:17:39.5888755Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_chain_iterable_style_dataset PASSED [3.6285s] [ 56%] 2025-12-04T11:17:39.5889799Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_dataset_not_reset PASSED [3.8920s] [ 57%] 2025-12-04T11:17:39.5890807Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_bad_numpy_types PASSED [0.0024s] [ 57%] 2025-12-04T11:17:39.5891889Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_bad_sequence_type PASSED [0.0014s] [ 58%] 2025-12-04T11:17:39.5892920Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_dtype PASSED [0.0022s] [ 59%] 2025-12-04T11:17:39.5893954Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_mapping_keep_type PASSED [0.0017s] [ 59%] 2025-12-04T11:17:39.5895027Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_numpy_memmap PASSED [0.0020s] [ 60%] 2025-12-04T11:17:39.5896164Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_dont_keep_type PASSED [0.0017s] [ 60%] 2025-12-04T11:17:39.5897289Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_sequence_keep_type PASSED [0.0016s] [ 61%] 2025-12-04T11:17:39.5898423Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_collate_shared_tensor PASSED [0.0020s] [ 61%] 2025-12-04T11:17:39.5899492Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_mapping_keep_type PASSED [0.0015s] [ 62%] 2025-12-04T11:17:39.5900602Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_dont_keep_type PASSED [0.0014s] [ 62%] 2025-12-04T11:17:39.5901715Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_default_convert_sequence_keep_type PASSED [0.0009s] [ 63%] 2025-12-04T11:17:39.5902805Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_distributed_sampler_invalid_rank PASSED [0.0009s] [ 63%] 2025-12-04T11:17:39.5903880Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_duplicating_data_with_drop_last PASSED [0.0016s] [ 64%] 2025-12-04T11:17:39.5904865Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_early_exit PASSED [10.9656s] [ 64%] 2025-12-04T11:17:39.5905734Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_error PASSED [0.0025s] [ 65%] 2025-12-04T11:17:39.5906614Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_error_in_init PASSED [3.8048s] [ 65%] 2025-12-04T11:17:39.5907532Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_error_workers PASSED [1.9521s] [ 66%] 2025-12-04T11:17:39.5908610Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_excessive_thread_creation_warning PASSED [0.0070s] [ 67%] 2025-12-04T11:17:39.5909633Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_fd_limit_exceeded PASSED [1.6103s] [ 67%] 2025-12-04T11:17:39.5910624Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_get_worker_info PASSED [4.3688s] [ 68%] 2025-12-04T11:17:39.5911564Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_growing_dataset PASSED [0.0023s] [ 68%] 2025-12-04T11:17:39.5912544Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_invalid_assign_after_init PASSED [0.0015s] [ 69%] 2025-12-04T11:17:39.5913578Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_invalid_ctor_args_combinations PASSED [0.0018s] [ 69%] 2025-12-04T11:17:39.5914606Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterable_style_dataset PASSED [23.3446s] [ 70%] 2025-12-04T11:17:39.5915591Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_iterabledataset_len PASSED [0.0013s] [ 70%] 2025-12-04T11:17:39.5916562Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_large_sampler_indices PASSED [3.9361s] [ 71%] 2025-12-04T11:17:39.5917465Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_len PASSED [0.0079s] [ 71%] 2025-12-04T11:17:39.5918445Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_multi_epochs_reproducibility PASSED [1.8959s] [ 72%] 2025-12-04T11:17:39.5919522Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiple_dataloaders PASSED [8.4979s] [ 72%] 2025-12-04T11:17:39.5921144Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_contexts [W1204 11:14:01.819914068 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5922909Z [W1204 11:14:01.947626275 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5924251Z [W1204 11:14:01.110527042 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5925579Z [W1204 11:14:04.503638948 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5926973Z [W1204 11:14:04.607192623 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5928392Z [W1204 11:14:04.622316074 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.5929263Z PASSED [11.1531s] [ 73%] 2025-12-04T11:17:39.5929967Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_iterdatapipe PASSED [11.5939s] [ 73%] 2025-12-04T11:17:39.5931082Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_multiprocessing_iterdatapipe_with_dill XFAIL [1.7624s] [ 74%] 2025-12-04T11:17:39.5932159Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_no_segfault Traceback (most recent call last): 2025-12-04T11:17:39.5933121Z File "/opt/conda/envs/py_3.10/lib/python3.10/multiprocessing/forkserver.py", line 274, in main 2025-12-04T11:17:39.5933777Z code = _serve_one(child_r, fds, 2025-12-04T11:17:39.5934447Z File "/opt/conda/envs/py_3.10/lib/python3.10/multiprocessing/forkserver.py", line 313, in _serve_one 2025-12-04T11:17:39.5935157Z code = spawn._main(child_r, parent_sentinel) 2025-12-04T11:17:39.5935809Z File "/opt/conda/envs/py_3.10/lib/python3.10/multiprocessing/spawn.py", line 126, in _main 2025-12-04T11:17:39.5936467Z self = reduction.pickle.load(from_parent) 2025-12-04T11:17:39.5937178Z File "/opt/conda/envs/py_3.10/lib/python3.10/multiprocessing/synchronize.py", line 110, in __setstate__ 2025-12-04T11:17:39.5937951Z self._semlock = _multiprocessing.SemLock._rebuild(*state) 2025-12-04T11:17:39.5938584Z FileNotFoundError: [Errno 2] No such file or directory 2025-12-04T11:17:39.5939057Z PASSED [7.5134s] [ 75%] 2025-12-04T11:17:39.5939637Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy PASSED [0.0037s] [ 75%] 2025-12-04T11:17:39.5940594Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy_gen_state PASSED [0.0037s] [ 76%] 2025-12-04T11:17:39.5941540Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_numpy_scalars PASSED [0.0024s] [ 76%] 2025-12-04T11:17:39.5942477Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_partial_workers PASSED [3.9734s] [ 77%] 2025-12-04T11:17:39.5943668Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_proper_exit SKIPPED [0.0008s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 77%] 2025-12-04T11:17:39.5944839Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler PASSED [0.0024s] [ 78%] 2025-12-04T11:17:39.5945865Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler_len_with_replacement PASSED [0.0019s] [ 78%] 2025-12-04T11:17:39.5946996Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_random_sampler_len_without_replacement PASSED [0.0016s] [ 79%] 2025-12-04T11:17:39.5948004Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sampler PASSED [4.4779s] [ 79%] 2025-12-04T11:17:39.5949026Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sampler_reproducibility PASSED [0.0220s] [ 80%] 2025-12-04T11:17:39.5950013Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_segfault PASSED [3.8185s] [ 80%] 2025-12-04T11:17:39.5950947Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers PASSED [2.1038s] [ 81%] 2025-12-04T11:17:39.5951992Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_seqential_batch_workers_prefetch PASSED [2.1215s] [ 81%] 2025-12-04T11:17:39.5953009Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_batch PASSED [0.0332s] [ 82%] 2025-12-04T11:17:39.5953965Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_nonbatch PASSED [0.0125s] [ 82%] 2025-12-04T11:17:39.5954931Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_pin_memory PASSED [0.0020s] [ 83%] 2025-12-04T11:17:39.5955903Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_sequential_workers PASSED [2.1179s] [ 84%] 2025-12-04T11:17:39.5956819Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle PASSED [0.0385s] [ 84%] 2025-12-04T11:17:39.5957753Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch PASSED [0.0181s] [ 85%] 2025-12-04T11:17:39.5958740Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_none PASSED [0.0188s] [ 85%] 2025-12-04T11:17:39.5959706Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers PASSED [2.0984s] [ 86%] 2025-12-04T11:17:39.5960720Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_batch_workers_prefetch PASSED [2.0692s] [ 86%] 2025-12-04T11:17:39.5961722Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_pin_memory PASSED [2.1081s] [ 87%] 2025-12-04T11:17:39.5962696Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_reproducibility PASSED [4.0137s] [ 87%] 2025-12-04T11:17:39.5963668Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_shuffle_workers PASSED [2.0627s] [ 88%] 2025-12-04T11:17:39.5964565Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_timeout PASSED [6.3488s] [ 88%] 2025-12-04T11:17:39.5965422Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_typing PASSED [0.0018s] [ 89%] 2025-12-04T11:17:39.5966309Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_init_fn PASSED [1.8647s] [ 89%] 2025-12-04T11:17:39.5967278Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_init_fn_forkserver PASSED [0.0137s] [ 90%] 2025-12-04T11:17:39.5968305Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_seed PASSED [2.0512s] [ 90%] 2025-12-04T11:17:39.5969275Z test_dataloader.py::TestDataLoaderPersistentWorkers::test_worker_seed_reproducibility PASSED [4.1438s] [ 91%] 2025-12-04T11:17:39.5970263Z test_dataloader.py::TestNamedTupleDataLoader::test_dataloader_with_namedtuple PASSED [0.0037s] [ 92%] 2025-12-04T11:17:39.5971167Z test_dataloader.py::TestCustomPinFn::test_custom_batch_pin PASSED [0.0053s] [ 92%] 2025-12-04T11:17:39.5971962Z test_dataloader.py::TestCustomPinFn::test_custom_batch_pin_worker PASSED [5.4735s] [ 93%] 2025-12-04T11:17:39.5973005Z test_dataloader.py::TestIndividualWorkerQueue::test_ind_worker_queue PASSED [29.2898s] [ 93%] 2025-12-04T11:17:39.5973894Z test_dataloader.py::TestSetAffinity::test_set_affinity_in_worker_init PASSED [2.0000s] [ 94%] 2025-12-04T11:17:39.5974702Z test_dataloader.py::TestConvAfterFork::test_conv_after_fork PASSED [1.8848s] [ 94%] 2025-12-04T11:17:39.5975525Z test_dataloader.py::TestOutOfOrderDataLoader::test_in_order_index_ds PASSED [2.9022s] [ 95%] 2025-12-04T11:17:39.5976403Z test_dataloader.py::TestOutOfOrderDataLoader::test_in_order_iterable_ds PASSED [2.8908s] [ 95%] 2025-12-04T11:17:39.5977290Z test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_index_ds PASSED [2.9190s] [ 96%] 2025-12-04T11:17:39.5978258Z test_dataloader.py::TestOutOfOrderDataLoader::test_out_of_order_iterable_ds PASSED [2.9610s] [ 96%] 2025-12-04T11:17:39.5979625Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_fork_cuda SKIPPED [0.0529s] (fork multiprocessing context not supported for cuda:0) [ 97%] 2025-12-04T11:17:39.5983232Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_forkserver_cuda /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.5986249Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.5989034Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.5991097Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.5992675Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.5994257Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.5995819Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.5997382Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.5998991Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6000605Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6002168Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6003715Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6005313Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6006898Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6008504Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6010056Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6010398Z PASSED [4.1828s] [ 97%] 2025-12-04T11:17:39.6012174Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_nested_tensor_multiprocessing_context_spawn_cuda /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6014042Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6015624Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6017193Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6018791Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6020392Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6022419Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6024090Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6025660Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6027250Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6028891Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6030454Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6032058Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6033619Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6035177Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/multiprocessing/reductions.py:417: UserWarning: The PyTorch API of nested tensors is in prototype stage and will change in the near future. We recommend specifying layout=torch.jagged when constructing a nested tensor, as this layout receives active development, has better operator coverage, and works with torch.compile. (Triggered internally at /var/lib/jenkins/workspace/aten/src/ATen/NestedTensorImpl.cpp:178.) 2025-12-04T11:17:39.6036717Z return torch._nested_view_from_buffer_copy(buffer, sizes, strides, offsets) 2025-12-04T11:17:39.6037057Z PASSED [5.2907s] [ 98%] 2025-12-04T11:17:39.6037692Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_fork_cuda SKIPPED [0.0022s] (fork multiprocessing context not supported for cuda:0) [ 98%] 2025-12-04T11:17:39.6038732Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.3508s] [ 99%] 2025-12-04T11:17:39.6039625Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.4724s] [ 99%] 2025-12-04T11:17:39.6040468Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda FAILED [2.5095s] [ 99%] 2025-12-04T11:17:39.6040906Z 2025-12-04T11:17:39.6041024Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6041516Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6041953Z Traceback (most recent call last): 2025-12-04T11:17:39.6042458Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6042952Z method(*args, **kwargs) 2025-12-04T11:17:39.6043423Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6043907Z method(*args, **kwargs) 2025-12-04T11:17:39.6044365Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6044839Z with policy(): 2025-12-04T11:17:39.6045280Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6045765Z raise RuntimeError(msg) 2025-12-04T11:17:39.6046758Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 870318080 and is now 880803840. 2025-12-04T11:17:39.6047647Z 2025-12-04T11:17:39.6047808Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6048575Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6049146Z 2025-12-04T11:17:39.6049336Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6049854Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6050283Z Traceback (most recent call last): 2025-12-04T11:17:39.6050776Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6051264Z method(*args, **kwargs) 2025-12-04T11:17:39.6051767Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6052253Z method(*args, **kwargs) 2025-12-04T11:17:39.6052709Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6053184Z with policy(): 2025-12-04T11:17:39.6053621Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6054107Z raise RuntimeError(msg) 2025-12-04T11:17:39.6055042Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 880803840 and is now 884998144. 2025-12-04T11:17:39.6055913Z 2025-12-04T11:17:39.6056069Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6056783Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6057343Z 2025-12-04T11:17:39.6057532Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6057912Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6058409Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6058836Z Traceback (most recent call last): 2025-12-04T11:17:39.6059325Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6059840Z method(*args, **kwargs) 2025-12-04T11:17:39.6060297Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6060775Z method(*args, **kwargs) 2025-12-04T11:17:39.6061231Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6061703Z with policy(): 2025-12-04T11:17:39.6062142Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6062628Z raise RuntimeError(msg) 2025-12-04T11:17:39.6063564Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 884998144 and is now 889192448. 2025-12-04T11:17:39.6064435Z 2025-12-04T11:17:39.6064591Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6065330Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6065918Z 2025-12-04T11:17:39.6066103Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6066746Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-3d822d8017a86837.xml - 2025-12-04T11:17:39.6067333Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6068753Z FAILED [2.5095s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 884998144 and is now 889192448. 2025-12-04T11:17:39.6069976Z 2025-12-04T11:17:39.6070137Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6070889Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6071440Z 2025-12-04T11:17:39.6071624Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6072019Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6072405Z === 1 failed, 178 passed, 6 skipped, 2 xfailed, 2 rerun in 310.49s (0:05:10) === 2025-12-04T11:17:39.6073050Z [W1204 11:16:22.766091689 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6073601Z Got exit code 1 2025-12-04T11:17:39.6073808Z Retrying single test... 2025-12-04T11:17:39.6074239Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-864685fc9f3ac539.xml 2025-12-04T11:17:39.6074741Z ============================= test session starts ============================== 2025-12-04T11:17:39.6075183Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6075579Z cachedir: .pytest_cache 2025-12-04T11:17:39.6076043Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6076546Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6076797Z configfile: pytest.ini 2025-12-04T11:17:39.6077268Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6078346Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6079149Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6079912Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6080718Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6081495Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6082252Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6083006Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6083755Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6084542Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6085300Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6086017Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6086733Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6087454Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6088259Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6088578Z collected 188 items / 187 deselected / 1 selected 2025-12-04T11:17:39.6089278Z stepcurrent: skipping 186 already run items. Running only test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6089918Z Running 1 items in this shard 2025-12-04T11:17:39.6090071Z 2025-12-04T11:17:39.6090491Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.5615s] [100%] 2025-12-04T11:17:39.6091381Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.5240s] [100%] 2025-12-04T11:17:39.6092222Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda FAILED [2.6221s] [100%] 2025-12-04T11:17:39.6092660Z 2025-12-04T11:17:39.6092773Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6093230Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6093666Z Traceback (most recent call last): 2025-12-04T11:17:39.6094171Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6094662Z method(*args, **kwargs) 2025-12-04T11:17:39.6095124Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6095604Z method(*args, **kwargs) 2025-12-04T11:17:39.6096063Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6096540Z with policy(): 2025-12-04T11:17:39.6096982Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6097502Z raise RuntimeError(msg) 2025-12-04T11:17:39.6098496Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T11:17:39.6099358Z 2025-12-04T11:17:39.6099515Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6100236Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6100799Z 2025-12-04T11:17:39.6100981Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6101488Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6101919Z Traceback (most recent call last): 2025-12-04T11:17:39.6102415Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6102934Z method(*args, **kwargs) 2025-12-04T11:17:39.6103425Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6103906Z method(*args, **kwargs) 2025-12-04T11:17:39.6104362Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6104832Z with policy(): 2025-12-04T11:17:39.6105270Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6105757Z raise RuntimeError(msg) 2025-12-04T11:17:39.6106686Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 853540864 and is now 868220928. 2025-12-04T11:17:39.6107561Z 2025-12-04T11:17:39.6107720Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6108504Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6109066Z 2025-12-04T11:17:39.6109245Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6109620Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6110065Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6110492Z Traceback (most recent call last): 2025-12-04T11:17:39.6110986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6111494Z method(*args, **kwargs) 2025-12-04T11:17:39.6111973Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6112467Z method(*args, **kwargs) 2025-12-04T11:17:39.6112921Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6113466Z with policy(): 2025-12-04T11:17:39.6113901Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6114383Z raise RuntimeError(msg) 2025-12-04T11:17:39.6115323Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6116230Z 2025-12-04T11:17:39.6116384Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6117090Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6117648Z 2025-12-04T11:17:39.6117828Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6118502Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-864685fc9f3ac539.xml - 2025-12-04T11:17:39.6119080Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6120463Z FAILED [2.6221s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6121721Z 2025-12-04T11:17:39.6121881Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6122587Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6123142Z 2025-12-04T11:17:39.6123329Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6123723Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6124083Z ================== 1 failed, 187 deselected, 2 rerun in 8.06s ================== 2025-12-04T11:17:39.6139701Z [W1204 11:16:32.486203124 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6140359Z Got exit code 1 2025-12-04T11:17:39.6140576Z Retrying single test... 2025-12-04T11:17:39.6141210Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-41dc6d4e54ec1afa.xml 2025-12-04T11:17:39.6141714Z ============================= test session starts ============================== 2025-12-04T11:17:39.6142164Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6142559Z cachedir: .pytest_cache 2025-12-04T11:17:39.6143020Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6143520Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6143770Z configfile: pytest.ini 2025-12-04T11:17:39.6144244Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6145260Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6146025Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6146777Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6147579Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6148423Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6149171Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6149916Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6150689Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6151432Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6152153Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6152860Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6153566Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6154275Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6155022Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6155384Z collected 188 items / 187 deselected / 1 selected 2025-12-04T11:17:39.6156256Z stepcurrent: skipping 186 already run items. Running only test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6156884Z Running 1 items in this shard 2025-12-04T11:17:39.6157037Z 2025-12-04T11:17:39.6157459Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.6126s] [100%] 2025-12-04T11:17:39.6158382Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda ('RERUN', {'yellow': True}) [2.4641s] [100%] 2025-12-04T11:17:39.6159219Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda FAILED [2.5332s] [100%] 2025-12-04T11:17:39.6159654Z 2025-12-04T11:17:39.6159787Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6160253Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6160723Z Traceback (most recent call last): 2025-12-04T11:17:39.6161229Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6161715Z method(*args, **kwargs) 2025-12-04T11:17:39.6162173Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6162652Z method(*args, **kwargs) 2025-12-04T11:17:39.6163109Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6163576Z with policy(): 2025-12-04T11:17:39.6164015Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6164500Z raise RuntimeError(msg) 2025-12-04T11:17:39.6165433Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T11:17:39.6166287Z 2025-12-04T11:17:39.6166452Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6167168Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6167723Z 2025-12-04T11:17:39.6167909Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6168463Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6168926Z Traceback (most recent call last): 2025-12-04T11:17:39.6169414Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6169902Z method(*args, **kwargs) 2025-12-04T11:17:39.6170320Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6170559Z method(*args, **kwargs) 2025-12-04T11:17:39.6170785Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6171017Z with policy(): 2025-12-04T11:17:39.6171227Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6171455Z raise RuntimeError(msg) 2025-12-04T11:17:39.6171925Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 853540864 and is now 868220928. 2025-12-04T11:17:39.6172357Z 2025-12-04T11:17:39.6172443Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6172793Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6173055Z 2025-12-04T11:17:39.6173144Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6173318Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6173528Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda _ 2025-12-04T11:17:39.6173727Z Traceback (most recent call last): 2025-12-04T11:17:39.6173957Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6174181Z method(*args, **kwargs) 2025-12-04T11:17:39.6174412Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6174648Z method(*args, **kwargs) 2025-12-04T11:17:39.6174868Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6175102Z with policy(): 2025-12-04T11:17:39.6175324Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6175562Z raise RuntimeError(msg) 2025-12-04T11:17:39.6176014Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6176428Z 2025-12-04T11:17:39.6176515Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6176867Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6177140Z 2025-12-04T11:17:39.6177231Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6177540Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-41dc6d4e54ec1afa.xml - 2025-12-04T11:17:39.6177817Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6178507Z FAILED [2.5332s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6179114Z 2025-12-04T11:17:39.6179190Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6179533Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6179804Z 2025-12-04T11:17:39.6179895Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6180086Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6180265Z ================== 1 failed, 187 deselected, 2 rerun in 7.97s ================== 2025-12-04T11:17:39.6180561Z [W1204 11:16:43.898406153 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6180821Z Got exit code 1 2025-12-04T11:17:39.6181092Z FAILED CONSISTENTLY: test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda 2025-12-04T11:17:39.6181451Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:17:39.6181766Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-51fff536cd2370fd.xml 2025-12-04T11:17:39.6182007Z ============================= test session starts ============================== 2025-12-04T11:17:39.6182226Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6182425Z cachedir: .pytest_cache 2025-12-04T11:17:39.6182662Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6182908Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6183035Z configfile: pytest.ini 2025-12-04T11:17:39.6183289Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6183777Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6184154Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6184525Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6184916Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6185293Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6185662Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6186031Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6186393Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6186752Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6187105Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6187453Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6187820Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6188214Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6188581Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6188742Z collected 188 items / 187 deselected / 1 selected 2025-12-04T11:17:39.6188894Z stepcurrent: skipping 187 already run items. 2025-12-04T11:17:39.6189036Z Running 1 items in this shard 2025-12-04T11:17:39.6189118Z 2025-12-04T11:17:39.6189458Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:16:48.979176065 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6189880Z ('RERUN', {'yellow': True}) [3.5192s] [100%] 2025-12-04T11:17:39.6190320Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:16:52.666783627 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6190858Z ('RERUN', {'yellow': True}) [3.1338s] [100%] 2025-12-04T11:17:39.6191275Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:16:55.966763417 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6191684Z FAILED [3.3633s] [100%] 2025-12-04T11:17:39.6191749Z 2025-12-04T11:17:39.6191812Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6192031Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6192237Z Traceback (most recent call last): 2025-12-04T11:17:39.6192486Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6192731Z method(*args, **kwargs) 2025-12-04T11:17:39.6192986Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6193226Z method(*args, **kwargs) 2025-12-04T11:17:39.6193450Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6193682Z with policy(): 2025-12-04T11:17:39.6193904Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6194144Z raise RuntimeError(msg) 2025-12-04T11:17:39.6194589Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T11:17:39.6194989Z 2025-12-04T11:17:39.6195074Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6195426Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6195687Z 2025-12-04T11:17:39.6195785Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6196029Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6196233Z Traceback (most recent call last): 2025-12-04T11:17:39.6196473Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6196711Z method(*args, **kwargs) 2025-12-04T11:17:39.6196965Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6197202Z method(*args, **kwargs) 2025-12-04T11:17:39.6197430Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6197663Z with policy(): 2025-12-04T11:17:39.6197880Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6198183Z raise RuntimeError(msg) 2025-12-04T11:17:39.6198623Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 853540864 and is now 868220928. 2025-12-04T11:17:39.6199026Z 2025-12-04T11:17:39.6199107Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6199470Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6199746Z 2025-12-04T11:17:39.6199842Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6200025Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6200233Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6200432Z Traceback (most recent call last): 2025-12-04T11:17:39.6200667Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6200901Z method(*args, **kwargs) 2025-12-04T11:17:39.6201123Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6201357Z method(*args, **kwargs) 2025-12-04T11:17:39.6201580Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6201810Z with policy(): 2025-12-04T11:17:39.6202039Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6202274Z raise RuntimeError(msg) 2025-12-04T11:17:39.6202713Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6203114Z 2025-12-04T11:17:39.6203193Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6203527Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6203787Z 2025-12-04T11:17:39.6203881Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6204195Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-51fff536cd2370fd.xml - 2025-12-04T11:17:39.6204479Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6205122Z FAILED [3.3633s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6205695Z 2025-12-04T11:17:39.6205788Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6206124Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6206388Z 2025-12-04T11:17:39.6206477Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6206668Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6206848Z ================= 1 failed, 187 deselected, 2 rerun in 10.36s ================== 2025-12-04T11:17:39.6207138Z [W1204 11:16:56.888194050 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6207393Z Got exit code 1 2025-12-04T11:17:39.6207490Z Retrying single test... 2025-12-04T11:17:39.6207695Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-1845db6bdecae254.xml 2025-12-04T11:17:39.6207936Z ============================= test session starts ============================== 2025-12-04T11:17:39.6208202Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6208410Z cachedir: .pytest_cache 2025-12-04T11:17:39.6208634Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6208871Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6208988Z configfile: pytest.ini 2025-12-04T11:17:39.6209211Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6209686Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6210049Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6210416Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6210816Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6211181Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6211538Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6211891Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6212239Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6212590Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6212941Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6213283Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6213617Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6213956Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6214309Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6214459Z collected 188 items / 187 deselected / 1 selected 2025-12-04T11:17:39.6214763Z stepcurrent: skipping 187 already run items. Running only test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6215073Z Running 1 items in this shard 2025-12-04T11:17:39.6215148Z 2025-12-04T11:17:39.6215485Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:01.991613813 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6215991Z [W1204 11:17:02.603110114 unwind.cpp:219] Warning: Unsupported unwinding pattern: Address not in range (function unwinderFor) 2025-12-04T11:17:39.6216321Z [W1204 11:17:02.603184423 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6216511Z 2025-12-04T11:17:39.6216662Z [W1204 11:17:09.985702898 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6216851Z 2025-12-04T11:17:39.6216905Z ('RERUN', {'yellow': True}) [10.9236s] [100%] 2025-12-04T11:17:39.6217358Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:12.157523045 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6217883Z [W1204 11:17:12.359575805 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6218122Z 2025-12-04T11:17:39.6218272Z [W1204 11:17:12.360073227 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6218459Z 2025-12-04T11:17:39.6218511Z ('RERUN', {'yellow': True}) [3.2852s] [100%] 2025-12-04T11:17:39.6218920Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:15.139036859 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6219457Z [W1204 11:17:15.369754726 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6219648Z 2025-12-04T11:17:39.6219799Z [W1204 11:17:15.370281948 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6219989Z 2025-12-04T11:17:39.6220028Z FAILED [3.0110s] [100%] 2025-12-04T11:17:39.6220092Z 2025-12-04T11:17:39.6220149Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6220354Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6220550Z Traceback (most recent call last): 2025-12-04T11:17:39.6220789Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6221023Z method(*args, **kwargs) 2025-12-04T11:17:39.6221243Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6221479Z method(*args, **kwargs) 2025-12-04T11:17:39.6221694Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6221922Z with policy(): 2025-12-04T11:17:39.6222132Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6222364Z raise RuntimeError(msg) 2025-12-04T11:17:39.6222796Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T11:17:39.6223211Z 2025-12-04T11:17:39.6223288Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6223623Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6223887Z 2025-12-04T11:17:39.6223974Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6224208Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6224402Z Traceback (most recent call last): 2025-12-04T11:17:39.6224631Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6224861Z method(*args, **kwargs) 2025-12-04T11:17:39.6225076Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6225305Z method(*args, **kwargs) 2025-12-04T11:17:39.6225520Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6225756Z with policy(): 2025-12-04T11:17:39.6225981Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6226207Z raise RuntimeError(msg) 2025-12-04T11:17:39.6226638Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 853540864 and is now 868220928. 2025-12-04T11:17:39.6227043Z 2025-12-04T11:17:39.6227116Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6227449Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6227711Z 2025-12-04T11:17:39.6227801Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6227999Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6228235Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6228428Z Traceback (most recent call last): 2025-12-04T11:17:39.6228657Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6228883Z method(*args, **kwargs) 2025-12-04T11:17:39.6229097Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6229323Z method(*args, **kwargs) 2025-12-04T11:17:39.6229538Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6229762Z with policy(): 2025-12-04T11:17:39.6229972Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6230207Z raise RuntimeError(msg) 2025-12-04T11:17:39.6230645Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6231050Z 2025-12-04T11:17:39.6231124Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6231455Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6231717Z 2025-12-04T11:17:39.6231828Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6232135Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-1845db6bdecae254.xml - 2025-12-04T11:17:39.6232416Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6233047Z FAILED [3.0110s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6233610Z 2025-12-04T11:17:39.6233683Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6234010Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6234272Z 2025-12-04T11:17:39.6234381Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6234582Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6234752Z ================= 1 failed, 187 deselected, 2 rerun in 17.59s ================== 2025-12-04T11:17:39.6235040Z [W1204 11:17:16.917890651 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6235295Z Got exit code 1 2025-12-04T11:17:39.6235392Z Retrying single test... 2025-12-04T11:17:39.6235594Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-9e8b46e84e6885f3.xml 2025-12-04T11:17:39.6235827Z ============================= test session starts ============================== 2025-12-04T11:17:39.6236035Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6236223Z cachedir: .pytest_cache 2025-12-04T11:17:39.6236473Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6236710Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6236828Z configfile: pytest.ini 2025-12-04T11:17:39.6237050Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6237527Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6237887Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6238285Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6238664Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6239032Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6239387Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6239740Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6240090Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6240441Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6240809Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6241154Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6241490Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6241829Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6242183Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6242333Z collected 188 items / 187 deselected / 1 selected 2025-12-04T11:17:39.6242636Z stepcurrent: skipping 187 already run items. Running only test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6242931Z Running 1 items in this shard 2025-12-04T11:17:39.6243004Z 2025-12-04T11:17:39.6243363Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:21.158761771 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6243885Z [W1204 11:17:22.629427052 unwind.cpp:219] Warning: Unsupported unwinding pattern: Address not in range (function unwinderFor) 2025-12-04T11:17:39.6244216Z [W1204 11:17:22.629464192 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6244404Z 2025-12-04T11:17:39.6244554Z [W1204 11:17:29.066797051 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6244739Z 2025-12-04T11:17:39.6244791Z ('RERUN', {'yellow': True}) [10.8806s] [100%] 2025-12-04T11:17:39.6245205Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:32.965641764 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6245747Z [W1204 11:17:32.200566921 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6245935Z 2025-12-04T11:17:39.6246085Z [W1204 11:17:32.201108612 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6246274Z 2025-12-04T11:17:39.6246322Z ('RERUN', {'yellow': True}) [3.0641s] [100%] 2025-12-04T11:17:39.6246731Z test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda [W1204 11:17:35.960441164 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6247242Z [W1204 11:17:35.463880955 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6247430Z 2025-12-04T11:17:39.6247581Z [W1204 11:17:35.464377897 Module.cpp:201] symbolizing C++ stack trace for exception; if this hangs, rerun with TORCH_DISABLE_ADDR2LINE=1... 2025-12-04T11:17:39.6247773Z 2025-12-04T11:17:39.6247812Z FAILED [3.2574s] [100%] 2025-12-04T11:17:39.6247877Z 2025-12-04T11:17:39.6247930Z ==================================== RERUNS ==================================== 2025-12-04T11:17:39.6248175Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6248371Z Traceback (most recent call last): 2025-12-04T11:17:39.6248608Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6248841Z method(*args, **kwargs) 2025-12-04T11:17:39.6249058Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6249307Z method(*args, **kwargs) 2025-12-04T11:17:39.6249529Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6249758Z with policy(): 2025-12-04T11:17:39.6249966Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6250195Z raise RuntimeError(msg) 2025-12-04T11:17:39.6250628Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 0 and is now reported as 10240 on device 0. CUDA driver allocated memory was 807403520 and is now 853540864. 2025-12-04T11:17:39.6251027Z 2025-12-04T11:17:39.6251100Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6251437Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6251710Z 2025-12-04T11:17:39.6251816Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6252050Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6252243Z Traceback (most recent call last): 2025-12-04T11:17:39.6252472Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6252700Z method(*args, **kwargs) 2025-12-04T11:17:39.6252916Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6253142Z method(*args, **kwargs) 2025-12-04T11:17:39.6253358Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6253580Z with policy(): 2025-12-04T11:17:39.6253790Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6254039Z raise RuntimeError(msg) 2025-12-04T11:17:39.6254472Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 10240 and is now reported as 20480 on device 0. CUDA driver allocated memory was 853540864 and is now 868220928. 2025-12-04T11:17:39.6254872Z 2025-12-04T11:17:39.6254945Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6255272Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6255530Z 2025-12-04T11:17:39.6255617Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6255791Z =================================== FAILURES =================================== 2025-12-04T11:17:39.6255992Z _ TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda _ 2025-12-04T11:17:39.6256189Z Traceback (most recent call last): 2025-12-04T11:17:39.6256417Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6256643Z method(*args, **kwargs) 2025-12-04T11:17:39.6256857Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3329, in wrapper 2025-12-04T11:17:39.6257087Z method(*args, **kwargs) 2025-12-04T11:17:39.6257305Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 3328, in wrapper 2025-12-04T11:17:39.6257529Z with policy(): 2025-12-04T11:17:39.6257739Z File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py", line 2705, in __exit__ 2025-12-04T11:17:39.6257983Z raise RuntimeError(msg) 2025-12-04T11:17:39.6258466Z RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6258871Z 2025-12-04T11:17:39.6258945Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6259276Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6259542Z 2025-12-04T11:17:39.6259629Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6259937Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-9e8b46e84e6885f3.xml - 2025-12-04T11:17:39.6260242Z =========================== short test summary info ============================ 2025-12-04T11:17:39.6260902Z FAILED [3.2574s] test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda - RuntimeError: CUDA driver API confirmed a leak in __main__.TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda! Caching allocator allocated memory was 20480 and is now reported as 30720 on device 0. CUDA driver allocated memory was 868220928 and is now 882900992. 2025-12-04T11:17:39.6261468Z 2025-12-04T11:17:39.6261545Z To execute this test, run the following from the base repo dir: 2025-12-04T11:17:39.6261876Z PYTORCH_TEST_WITH_ROCM=1 PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=1 python test/test_dataloader.py TestDataLoaderDeviceTypeCUDA.test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6262133Z 2025-12-04T11:17:39.6262224Z This message can be suppressed by setting PYTORCH_PRINT_REPRO_ON_FAILURE=0 2025-12-04T11:17:39.6262410Z !!!!!!!!!!!!!!!!!!!!!!!!!! stopping after 1 failures !!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T11:17:39.6262599Z ================= 1 failed, 187 deselected, 2 rerun in 17.57s ================== 2025-12-04T11:17:39.6262891Z [W1204 11:17:36.977126964 CudaIPCTypes.cpp:16] Producer process has been terminated before all shared CUDA tensors released. See Note [Sharing CUDA tensors] 2025-12-04T11:17:39.6263153Z Got exit code 1 2025-12-04T11:17:39.6263382Z FAILED CONSISTENTLY: test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda 2025-12-04T11:17:39.6263718Z Test failed consistently, continuing with the rest of the tests due to continue-through-error being set 2025-12-04T11:17:39.6264018Z Test results will be stored in test-reports/python-pytest/test_dataloader/test_dataloader-744538f36b405c9d.xml 2025-12-04T11:17:39.6264255Z ============================= test session starts ============================== 2025-12-04T11:17:39.6264465Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T11:17:39.6264657Z cachedir: .pytest_cache 2025-12-04T11:17:39.6264884Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T11:17:39.6265123Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T11:17:39.6265242Z configfile: pytest.ini 2025-12-04T11:17:39.6265469Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T11:17:39.6265952Z collecting ... /var/lib/jenkins/pytorch/test/test_dataloader.py:900: PytestCollectionWarning: cannot collect test class 'TestProperExitDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6266317Z class TestProperExitDataset(Dataset): 2025-12-04T11:17:39.6266698Z /var/lib/jenkins/pytorch/test/test_dataloader.py:920: PytestCollectionWarning: cannot collect test class 'TestProperExitIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6267087Z class TestProperExitIterableDataset(IterableDataset): 2025-12-04T11:17:39.6267454Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1038: PytestCollectionWarning: cannot collect test class 'TestWorkerInfoDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6267809Z class TestWorkerInfoDataset(SynchronizedDataset): 2025-12-04T11:17:39.6268194Z /var/lib/jenkins/pytorch/test/test_dataloader.py:1150: PytestCollectionWarning: cannot collect test class 'TestMultiEpochDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6268545Z class TestMultiEpochDataset(IterableDataset): 2025-12-04T11:17:39.6268893Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3452: PytestCollectionWarning: cannot collect test class 'TestWorkerQueueDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6269240Z class TestWorkerQueueDataset(Dataset): 2025-12-04T11:17:39.6269603Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3574: PytestCollectionWarning: cannot collect test class 'TestSlowIndexDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6269955Z class TestSlowIndexDataset(Dataset): 2025-12-04T11:17:39.6270297Z /var/lib/jenkins/pytorch/test/test_dataloader.py:3592: PytestCollectionWarning: cannot collect test class 'TestSlowIterableDataset' because it has a __init__ constructor (from: test/test_dataloader.py) 2025-12-04T11:17:39.6270655Z class TestSlowIterableDataset(IterableDataset): 2025-12-04T11:17:39.6270807Z collected 188 items / 188 deselected / 0 selected 2025-12-04T11:17:39.6270951Z stepcurrent: skipping 188 already run items. 2025-12-04T11:17:39.6271084Z Running 0 items in this shard 2025-12-04T11:17:39.6271159Z 2025-12-04T11:17:39.6271343Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_dataloader/test_dataloader-744538f36b405c9d.xml - 2025-12-04T11:17:39.6271618Z =========================== 188 deselected in 0.35s ============================ 2025-12-04T11:17:39.6272097Z The following tests failed consistently: ['test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_forkserver_cuda', 'test/test_dataloader.py::TestDataLoaderDeviceTypeCUDA::test_sparse_tensor_multiprocessing_context_spawn_cuda'] 2025-12-04T11:17:39.6272490Z 2025-12-04T11:17:39.6272628Z FINISHED PRINTING LOG FILE of test_dataloader 1/1 (test/test-reports/test_dataloader_1.1_cbec5e33ff3c884a_.log) 2025-12-04T11:17:39.6272802Z 2025-12-04T11:17:39.6272900Z Finished test_dataloader 1/1 ... [2025-12-04 11:17:39.563920][4971842.0600965], took 6.50min 2025-12-04T11:17:39.6273257Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:17:39.6273608Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:17:39.6273831Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T11:17:39.6274011Z Uploading artifacts took 0.00 seconds 2025-12-04T11:17:39.6274137Z test_dataloader 1/1 failed! 2025-12-04T11:17:39.6274286Z Running test_decomp 6/12 ... [2025-12-04 11:17:39.581395][4971842.077580312] 2025-12-04T11:17:39.6274448Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:17:39.6274810Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=6', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:17:39.581751] 2025-12-04T11:30:51.9843873Z 2025-12-04T11:30:51.9844701Z test_decomp 6/12 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_6.12_7bc04bedbbce958a_.log 2025-12-04T11:30:52.0106618Z Running 763 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmatmul___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmod___cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rxor___cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__batch_norm_with_update_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__batch_norm_with_update_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__segment_reduce_lengths_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__segment_reduce_offsets_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__segment_reduce_offsets_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acosh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmm_decomposed_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addmv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_allclose_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_aminmax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_any_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_arange_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argmin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argsort_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argsort_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argwhere_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argwhere_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atan2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_1d_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_1d_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bernoulli_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bfloat16_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bfloat16_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bfloat16_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bincount_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_and_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_not_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_or_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_tensors_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cartesian_prod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cartesian_prod_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chalf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_chunk_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_min_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_min_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_contiguous_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cos_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_count_nonzero_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cross_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumulative_trapezoid_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_deg2rad_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dist_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_no_rounding_mode_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_trunc_rounding_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_div_trunc_rounding_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dot_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dsplit_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_strided_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eq_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erf_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_as_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exponential_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_eye_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftn_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fftshift_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfftn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifftshift_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfftn_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flipud_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flipud_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmin_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmod_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmod_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_frac_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_frexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gcd_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geometric_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geqrf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_grid_sampler_3d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_heaviside_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hsplit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_add_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_fill_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_fill_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_put_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amin_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_inner_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isclose_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isinf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isreal_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_item_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_binary_return_by_ref_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kthvalue_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lcm_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_ex_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_ex_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_ex_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lstsq_grad_oriented_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_lu_factor_ex_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_pinv_hermitian_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_tensorinv_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log1p_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_softmax_with_dtype_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logaddexp2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logcumsumexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logcumsumexp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_or_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logit_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_long_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lt_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_solve_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_solve_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmin_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_cumsum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_logsumexp_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_mean_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_normalize_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_prod_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_std_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_std_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_sum_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_max_binary_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_binary_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_no_dim_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_with_dim_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_minimum_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mul_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nanmedian_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_native_layer_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_neg_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nextafter_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_avg_pool3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv1d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv3d_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose1d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_embedding_loss_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_similarity_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_elu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_fractional_max_pool2d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_fractional_max_pool3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gelu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_grid_sample_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardtanh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_huber_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_kl_div_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_l1_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_mse_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_circular_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_constant_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu6_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_rrelu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_softsign_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_unfold_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_upsample_nearest_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_norm_fro_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_normal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ormqr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pca_lowrank_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_4_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_positive_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_prod_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_quantile_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_like_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randn_like_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_real_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_renorm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_renorm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_as_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize__cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsub_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scalar_tensor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_amax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_prod_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_select_scatter_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sgn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_hamming_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signbit_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_airy_ai_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_j1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_t_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_u_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_erfcx_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_he_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i1_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k0_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_zeta_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_zeta_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_zeta_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sqrt_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sub_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensor_split_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tensordot_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trace_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapz_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_indices_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_true_divide_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_uniform_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_consecutive_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_chunk_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_unbiased_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_unbiased_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vdot_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_as_real_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick__batch_norm_with_update_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick__batch_norm_with_update_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick__chunk_cat_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick__chunk_cat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_abs_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_acos_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_acos_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_addmm_decomposed_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_amax_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_amin_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_aminmax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_atan_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_atanh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_baddbmm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_not_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_block_diag_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_copysign_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_addr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_nn_functional_unfold_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_sinc_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_split_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_std_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_take_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_unsafe_split_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_xlogy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cos_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_cos_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_cumsum_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_deg2rad_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_embed_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_scatter_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_div_floor_rounding_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_div_floor_rounding_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_div_trunc_rounding_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_dot_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_dot_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_erf_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_erfc_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_erfinv_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_exp2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_eye_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ifft_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_fill_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_floor_divide_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fmax_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_fmod_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_fmod_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_grid_sampler_2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_i0_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_igamma_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_index_add_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_index_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_index_fill_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_isposinf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isposinf_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_item_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_lcm_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_le_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_lgamma_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_lgamma_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log1p_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_log2_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_log_normal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_log_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logical_and_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_mean_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_list_of_tensors_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_minimum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_mv_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_narrow_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_native_batch_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_native_dropout_backward_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_neg_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_new_zeros_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_gelu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_glu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_pad_constant_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu6_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_rrelu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_rrelu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_softplus_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_softplus_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_softshrink_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_normal_in_place_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_ones_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_permute_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_polar_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_round_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_0_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_3_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_sgn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sgn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_signbit_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_slice_scatter_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_softmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_special_erfcx_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_special_i1e_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_special_ndtr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_multiple_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_std_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_std_unbiased_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_tan_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_trunc_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_trunc_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_uniform_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_uniform_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_view_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_view_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_xlogy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_GRU_eval_mode_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_RNN_eval_mode_cuda_float32, test/test_decomp.py::DecompOneOffTestsCUDA::test_exponential_non_inf_cuda, test/test_decomp.py::DecompOneOffTestsCUDA::test_threshold_backward_dtype_cuda, test/test_decomp.py::HasDecompTest::test_mm_decompose_mm_dde 2025-12-04T11:30:52.0306532Z 2025-12-04T11:30:52.0306762Z Finished test_decomp 6/12 ... [2025-12-04 11:30:51.985066][4972634.481249746], took 13.21min 2025-12-04T11:30:52.0307537Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:30:52.0308314Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:30:52.0308743Z Running test_decomp 12/12 ... [2025-12-04 11:30:52.003741][4972634.499928036] 2025-12-04T11:30:52.0309089Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:30:52.0309853Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_decomp.py', '--shard-id=12', '--num-shards=12', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:30:52.004227] 2025-12-04T11:41:46.8352455Z 2025-12-04T11:41:46.8353991Z test_decomp 12/12 was successful, full logs can be found in artifacts with path test/test-reports/test_decomp_12.12_fbe5e4adbe2542e0_.log 2025-12-04T11:41:46.8462848Z Running 725 items in this shard: test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_T_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___getitem___cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive___radd___cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rdiv___cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmatmul___cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmatmul___cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmatmul___cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmod___cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rmul___cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive___rsub___cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__chunk_cat_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__softmax_backward_data_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive__unsafe_masked_index_put_accumulate_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive__upsample_bilinear2d_aa_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_abs_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_acos_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_add_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_addcmul_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_alias_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_all_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_allclose_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_amin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_aminmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_angle_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_argmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_as_strided_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_asinh_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atanh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_1d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_2d_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_atleast_3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_baddbmm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bfloat16_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_left_shift_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_left_shift_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bitwise_left_shift_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_block_diag_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bmm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bool_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_broadcast_to_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bucketize_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bucketize_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_bucketize_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cdouble_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cfloat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_char_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cholesky_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_max_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_clamp_min_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_column_stack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_conj_physical_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_constant_pad_nd_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_copysign_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_corrcoef_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cos_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cosh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cov_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummax_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cummin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumprod_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumprod_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_cumprod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_deg2rad_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_deg2rad_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diag_embed_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagflat_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_diagonal_scatter_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_digamma_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_double_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_dstack_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_empty_permuted_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_equal_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erf_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfc_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_erfinv_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exp2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_as_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expand_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expm1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expm1_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_expm1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_exponential_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_fft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_hfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft2_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ifft_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_ihfftn_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft2_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_irfft_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fft_rfft_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fill_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flatten_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_flip_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fliplr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_float_power_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_floor_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_fmod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_full_like_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gather_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geometric_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_geometric_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_gt_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_histc_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hsplit_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_hstack_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_i0_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_add_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_fill_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_amax_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_reduce_prod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_index_select_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_int_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isfinite_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isnan_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_isneginf_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_2inputs_2outputs_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_4inputs_with_extra_args_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_jiterator_unary_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_kron_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lcm_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lcm_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ldexp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ldexp_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lgamma_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lgamma_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_cholesky_ex_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_det_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_diagonal_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_householder_product_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_inv_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_solve_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_ldl_solve_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_matrix_power_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_pinv_hermitian_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_pinv_singular_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_solve_ex_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linalg_solve_triangular_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_linspace_tensor_overload_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log10_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log1p_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log1p_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log2_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_log_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logdet_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_and_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_and_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_and_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logical_or_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logspace_tensor_overload_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_logsumexp_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_solve_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_lu_unpack_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mH_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mT_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_amin_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmax_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_argmin_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_log_softmax_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_log_softmax_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_log_softmax_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_median_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_normalize_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_normalize_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_scatter_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_select_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_masked_softmin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_matrix_exp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_maximum_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mean_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_list_of_tensors_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_meshgrid_variadic_tensors_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_min_reduction_with_dim_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_minimum_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mm_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mode_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_msort_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nan_to_num_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nansum_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_narrow_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_native_batch_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ne_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_neg_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_neg_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_neg_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_empty_strided_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_full_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_ones_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_new_zeros_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nextafter_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_batch_norm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_celu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_channel_shuffle_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_conv_transpose2d_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cosine_similarity_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_cross_entropy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout2d_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_dropout2d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_embedding_bag_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_fractional_max_pool3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gelu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_gelu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_group_norm_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_group_norm_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_hardtanh_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_huber_loss_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_area_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_bicubic_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_l1_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_linear_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_margin_ranking_loss_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_pool2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool1d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_max_unpool2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_multi_head_attention_forward_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_nll_loss_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_normalize_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_circular_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_reflect_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_shuffle_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_pixel_unshuffle_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_poisson_nll_loss_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_prelu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_relu_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_silu_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_tanhshrink_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nn_functional_unfold_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_nonzero_static_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ones_like_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_outer_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pca_lowrank_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_permute_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_2_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_polygamma_polygamma_n_3_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_pow_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_put_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rad2deg_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rad2deg_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_randint_like_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_ravel_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reciprocal_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_remainder_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_repeat_interleave_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_reshape_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize__cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize_as__cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize_as__cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resize_as__cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_resolve_conj_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_roll_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rot90_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_round_decimals_0_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_rsqrt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_add_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_mean_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_scatter_reduce_sum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_searchsorted_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_short_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sigmoid_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sign_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_cosine_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_signal_windows_hamming_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinc_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sinh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_slice_scatter_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_softmax_with_dtype_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sort_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_airy_ai_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_bessel_y0_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_chebyshev_polynomial_v_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_entr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_erfcx_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_hermite_polynomial_h_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_i1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_laguerre_polynomial_l_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_legendre_polynomial_p_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_log_ndtr_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i1_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_i1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k0_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k0_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_modified_bessel_k1_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_scaled_modified_bessel_k1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_comprehensive_special_xlog1py_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_split_with_sizes_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_square_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_copy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_squeeze_multiple_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_mean_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_std_unbiased_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_stft_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_sum_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_svd_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_copy_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_t_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tan_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tanh_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_to_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_topk_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_transpose_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapezoid_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapz_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trapz_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triangular_solve_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_tril_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_triu_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_true_divide_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_trunc_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unbind_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unflatten_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unfold_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_uniform_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_consecutive_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_consecutive_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unique_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unravel_index_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsafe_split_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_unsqueeze_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_var_mean_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vdot_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_view_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_comprehensive_vstack_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_where_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_xlogy_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zero__cuda_int64, test/test_decomp.py::TestDecompCUDA::test_comprehensive_zeros_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick__native_batch_norm_legit_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick__unsafe_masked_index_put_accumulate_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_abs_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_acos_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_acosh_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_add_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_addcmul_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_addr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_alias_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_all_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_any_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_as_strided_scatter_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_asinh_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_atanh_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_bernoulli_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_and_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_bitwise_left_shift_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_bucketize_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_ceil_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_ceil_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_clamp_min_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_clone_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_addcmul_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_diagonal_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_fill_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_norm_inf_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_rad2deg_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_sgn_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_core_backward_std_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cos_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_cos_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_cosh_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_count_nonzero_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_cumprod_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_cumsum_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_diag_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_diagonal_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_div_no_rounding_mode_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_empty_like_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_empty_strided_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_empty_strided_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_eq_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_erfc_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_erfinv_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_exp_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_expand_copy_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_expand_copy_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_expand_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_expand_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_expm1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_fft_fft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfft2_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fft_hfftn_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfft_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_fft_ihfftn_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_irfftn_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfft2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_fft_rfftn_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_fill_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_flip_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_floor_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_fmod_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_full_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_gcd_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_ge_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_geometric_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_grid_sampler_2d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_gt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_heaviside_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_index_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_index_fill_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isin_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_isinf_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_isneginf_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_isneginf_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_item_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_cross_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_linalg_diagonal_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_linspace_tensor_overload_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_log10_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_logaddexp2_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logical_not_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_or_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_logical_xor_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_logit_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_logspace_tensor_overload_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_logsumexp_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_lt_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_masked_fill_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_mean_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_list_of_tensors_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_list_of_tensors_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_meshgrid_variadic_tensors_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_mul_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nan_to_num_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nansum_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_ne_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_new_empty_strided_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_new_full_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_new_ones_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_elu_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_embedding_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardswish_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_hardtanh_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool2d_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_max_unpool3d_grad_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_mish_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_relu_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_nn_functional_silu_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_norm_fro_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_normal_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_normal_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_normal_number_mean_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_ones_like_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_permute_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_pow_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_prod_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_rad2deg_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_randn_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_reciprocal_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_remainder_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_renorm_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_renorm_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_repeat_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_repeat_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_roll_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_rot90_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_round_decimals_3_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_rsqrt_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_rsub_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_select_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sigmoid_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_sign_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_slice_scatter_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_special_entr_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_special_erfcx_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_special_log_ndtr_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_special_ndtri_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_split_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_split_list_args_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_split_with_sizes_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_sqrt_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_squeeze_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_std_cuda_complex128, test/test_decomp.py::TestDecompCUDA::test_quick_std_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_sub_cuda_int32, test/test_decomp.py::TestDecompCUDA::test_quick_sum_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_t_copy_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_t_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_take_cuda_int64, test/test_decomp.py::TestDecompCUDA::test_quick_tanh_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_trace_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_copy_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_transpose_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_tril_cuda_int16, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_triu_cuda_float16, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_unbind_copy_cuda_complex32, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_unfold_cuda_int8, test/test_decomp.py::TestDecompCUDA::test_quick_uniform_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_unsafe_split_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_complex64, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_unsqueeze_cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_var_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_quick_var_mean_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_var_unbiased_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_view_copy_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_view_cuda_float64, test/test_decomp.py::TestDecompCUDA::test_quick_view_cuda_uint8, test/test_decomp.py::TestDecompCUDA::test_quick_zero__cuda_bfloat16, test/test_decomp.py::TestDecompCUDA::test_quick_zero__cuda_bool, test/test_decomp.py::TestDecompCUDA::test_quick_zeros_like_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_GRU_train_mode_cuda_float32, test/test_decomp.py::TestDecompCUDA::test_rnn_decomp_module_nn_RNN_eval_mode_cuda_float64, test/test_decomp.py::DecompOneOffTestsCUDA::test_contiguous_softmax_cuda 2025-12-04T11:41:46.8535841Z 2025-12-04T11:41:46.8535955Z Finished test_decomp 12/12 ... [2025-12-04 11:41:46.838413][4973289.334601759], took 10.91min 2025-12-04T11:41:46.8536555Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:41:46.8536918Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:41:46.8537147Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T11:41:46.8537335Z Uploading artifacts took 0.00 seconds 2025-12-04T11:41:46.8537518Z Running test_ci_sanity_check_fail 1/1 ... [2025-12-04 11:41:46.845291][4973289.341482106] 2025-12-04T11:41:46.8537704Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:41:46.8538145Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ci_sanity_check_fail.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:41:46.845497] 2025-12-04T11:41:56.6692265Z Finished test_ci_sanity_check_fail 1/1 ... [2025-12-04 11:41:56.667188][4973299.163367635], took 0.16min 2025-12-04T11:41:56.6694030Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T11:41:56.6743810Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T11:41:56.6746418Z Running test_meta 3/3 ... [2025-12-04 11:41:56.674560][4973299.170744874] 2025-12-04T11:41:56.6747983Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T11:41:56.6751681Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_meta.py', '--shard-id=3', '--num-shards=3', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 11:41:56.675054] 2025-12-04T12:25:42.7807076Z 2025-12-04T12:25:42.7807741Z PRINTING LOG FILE of test_meta 3/3 (test/test-reports/test_meta_3.3_fff0f714bc69ad37_.log) 2025-12-04T12:25:42.7808176Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-aadb16c8e7d2ceb7.xml 2025-12-04T12:25:42.7808492Z ============================= test session starts ============================== 2025-12-04T12:25:42.7809081Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:25:42.7809329Z cachedir: .pytest_cache 2025-12-04T12:25:42.7809595Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:25:42.7809901Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:25:42.7810113Z configfile: pytest.ini 2025-12-04T12:25:42.7810641Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:25:42.7813415Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T12:25:42.7815306Z collected 40725 items 2025-12-04T12:25:42.7815448Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T12:25:42.9514865Z Running 13711 items in this shard: test/test_meta.py::TestMetaConverter::test_complex_noncontiguous_bug, test/test_meta.py::TestMetaConverter::test_inplace_set_storage, test/test_meta.py::TestMetaConverter::test_leaf, test/test_meta.py::TestMetaConverter::test_non_leaf_torture, test/test_meta.py::TestMetaConverter::test_requires_grad_false, test/test_meta.py::TestMetaConverter::test_view_of_leaf, test/test_meta.py::TestMetaConverter::test_view_of_view_of_leaf, test/test_meta.py::TestMetaCUDA::test_batch_norm_backward_output_mask0_cuda, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs__conversions_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs__conversions_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_le_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_jiterator_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_decomposed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_qr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_slogdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorsolve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rrelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_complex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polar_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_blackman_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_gaussian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_general_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_nuttall_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_mm_reduce_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_offsets_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cauchy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cond_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cond_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_grad_oriented_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vecdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_multinomial_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanquantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nextafter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_alpha_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_similarity_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_ctc_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardswish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_linear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_kl_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_prelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rrelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_nuc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_nuc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ormqr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pinverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_gaussian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_legendre_polynomial_p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_legendre_polynomial_p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_real_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__native_batch_norm_legit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_offsets_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_T_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rxor___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__chunk_cat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_addcdiv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__unsafe_masked_index_put_accumulate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_acosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bool_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_div_no_rounding_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_equal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ihfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_le_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eigvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_inv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_inv_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vander_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mT_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanquantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_new_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_avg_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_prelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_rrelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_norm_fro_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_permute_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_positive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resolve_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_i0e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_xlog1py_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_squeeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sum_to_size_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_take_along_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unsqueeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bernoulli_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float8_e5m2, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_igamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_istft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_grad_oriented_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_unpack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanquantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nextafter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nextafter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_embedding_bag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_trilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_prelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_prelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pca_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pinverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hann_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_kaiser_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_sampled_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unravel_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_offsets_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rand___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rmatmul___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_cosh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__softmax_backward_data_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__unsafe_masked_index_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_abs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_decomposed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_all_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_any_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_asin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_byte_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cartesian_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clone_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_deg2rad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_eq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_equal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_expand_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_hfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_floor_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isneginf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isposinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isreal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ldexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_rank_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_norm_subgradients_at_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mT_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nanmedian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_ctc_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_gelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_group_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_kl_div_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multilabel_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_one_hot_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_circular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_constant_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_reflect_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_prelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_rms_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_selu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softplus_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_number_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polar_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_reshape_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resolve_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_neg_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_reduce_sum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_entr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_scaled_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_spherical_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tril_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_baddbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_baddbmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_inverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_inverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e4m3fnuz, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_imag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_det_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_triangular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_triangular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_prelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rrelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_quantile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_cosine_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hamming_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hann_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_embedding_bag_byte_unpack_cuda, test/test_meta.py::TestMetaCUDA::test_embedding_bag_dense_backward_per_sample_weights_cuda, test/test_meta.py::TestMetaCUDA::test_fill_stride_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask0_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask1_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask2_cuda, test/test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask4_cuda, test/test_meta.py::TestMetaCUDA::test_inplace_masked_fill_error_cuda, test/test_meta.py::TestMetaCUDA::test_meta__fused_moving_avg_obs_fq_helper_cuda, test/test_meta.py::TestMetaCUDA::test_meta_consistency_out_dtype_mismatch_pow_Tensor_Scalar_cuda, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__batch_norm_with_update_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bernoulli_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bernoulli_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e4m3fnuz, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geqrf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_geqrf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_istft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_grad_oriented_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_alpha_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardshrink_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_layer_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mse_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_complex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_normal_number_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ormqr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polar_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_blackman_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_gaussian_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_gaussian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_hamming_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_svd_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_indices_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_tril_indices_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_complex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_shapes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e4m3fnuz, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e5m2fnuz, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gcd_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gcd_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_igammac_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_igammac_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_imag_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_imag_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cond_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_ex_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_power_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_qr_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_qr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logaddexp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmax_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_pool2d_with_indices_backward_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_multinomial_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_multinomial_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nanquantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_native_layer_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_alpha_dropout_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_bilinear_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_celu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cross_entropy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_elu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardswish_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_huber_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_kl_div_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_kl_div_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_leaky_relu_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pdist_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_prelu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_prelu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rrelu_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_selu_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_selu_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_complex_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_fro_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_fro_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_norm_nuc_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_quantile_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_neg_3_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_bartlett_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_blackman_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_cosine_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_gaussian_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_nuttall_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_mm_reduce_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_complex_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_complex_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_real_cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_bool, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_bfloat16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_complex64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int16, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_complex128, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_complex32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_float32, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_float64, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_int8, test/test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_uint8, test/test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float16_bias_dtype2_cuda, test/test_meta.py::TestMetaCUDA::test_triangular_solve_out_cuda 2025-12-04T12:25:43.1027717Z 2025-12-04T12:25:43.1027924Z test_meta.py::TestMetaConverter::test_complex_noncontiguous_bug PASSED [0.0029s] [ 0%] 2025-12-04T12:25:43.1028582Z test_meta.py::TestMetaConverter::test_inplace_set_storage PASSED [0.0007s] [ 0%] 2025-12-04T12:25:43.1028811Z test_meta.py::TestMetaConverter::test_leaf PASSED [0.0008s] [ 0%] 2025-12-04T12:25:43.1029051Z test_meta.py::TestMetaConverter::test_non_leaf_torture PASSED [0.0008s] [ 0%] 2025-12-04T12:25:43.1029285Z test_meta.py::TestMetaConverter::test_requires_grad_false PASSED [0.0007s] [ 0%] 2025-12-04T12:25:43.1029511Z test_meta.py::TestMetaConverter::test_view_of_leaf PASSED [0.0016s] [ 0%] 2025-12-04T12:25:43.1029738Z test_meta.py::TestMetaConverter::test_view_of_view_of_leaf PASSED [0.0013s] [ 0%] 2025-12-04T12:25:43.1030012Z test_meta.py::TestMetaCUDA::test_batch_norm_backward_output_mask0_cuda SKIPPED [0.0445s] (Only runs on cpu) [ 0%] 2025-12-04T12:25:43.1030311Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___radd___cuda_float32 PASSED [1.2811s] [ 0%] 2025-12-04T12:25:43.1030672Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rdiv___cuda_float32 PASSED [1.0099s] [ 0%] 2025-12-04T12:25:43.1030953Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rmod___cuda_float32 PASSED [0.9833s] [ 0%] 2025-12-04T12:25:43.1031311Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rmul___cuda_float32 PASSED [0.9426s] [ 0%] 2025-12-04T12:25:43.1031649Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype___rpow___cuda_float32 PASSED [0.9961s] [ 0%] 2025-12-04T12:25:43.1031951Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs__conversions_complex_cuda_float32 XFAIL [0.0048s] [ 0%] 2025-12-04T12:25:43.1032278Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs__conversions_polar_cuda_float32 XFAIL [0.9450s] [ 0%] 2025-12-04T12:25:43.1032578Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_eq_cuda_float32 PASSED [0.9777s] [ 0%] 2025-12-04T12:25:43.1032863Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_fmax_cuda_float32 PASSED [0.0304s] [ 0%] 2025-12-04T12:25:43.1033152Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_hypot_cuda_float32 PASSED [0.9553s] [ 0%] 2025-12-04T12:25:43.1033445Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_igammac_cuda_float32 PASSED [0.9599s] [ 0%] 2025-12-04T12:25:43.1033795Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_le_cuda_float32 PASSED [0.9666s] [ 0%] 2025-12-04T12:25:43.1034086Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logaddexp_cuda_float32 PASSED [0.0951s] [ 0%] 2025-12-04T12:25:43.1034386Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_logical_or_cuda_float32 PASSED [0.0174s] [ 0%] 2025-12-04T12:25:43.1034687Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_minimum_cuda_float32 PASSED [0.9303s] [ 0%] 2025-12-04T12:25:43.1034974Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_mul_cuda_float32 PASSED [0.0064s] [ 0%] 2025-12-04T12:25:43.1035254Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_ne_cuda_float32 PASSED [0.0048s] [ 0%] 2025-12-04T12:25:43.1035535Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_pow_cuda_float32 PASSED [0.9457s] [ 0%] 2025-12-04T12:25:43.1035827Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype__refs_true_divide_cuda_float32 PASSED [0.0170s] [ 0%] 2025-12-04T12:25:43.1036114Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_add_cuda_float32 PASSED [0.0046s] [ 0%] 2025-12-04T12:25:43.1036391Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_clamp_max_cuda_float32 PASSED [0.9526s] [ 0%] 2025-12-04T12:25:43.1036667Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_complex_cuda_float32 XFAIL [0.0053s] [ 0%] 2025-12-04T12:25:43.1036959Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_div_no_rounding_mode_cuda_float32 PASSED [0.9515s] [ 0%] 2025-12-04T12:25:43.1037264Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_div_trunc_rounding_cuda_float32 PASSED [0.0047s] [ 0%] 2025-12-04T12:25:43.1037576Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_eq_cuda_float32 PASSED [0.9418s] [ 0%] 2025-12-04T12:25:43.1037849Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_hypot_cuda_float32 PASSED [0.0065s] [ 0%] 2025-12-04T12:25:43.1038246Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_jiterator_binary_cuda_float32 PASSED [1.1216s] [ 0%] 2025-12-04T12:25:43.1038539Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logaddexp_cuda_float32 PASSED [0.0188s] [ 0%] 2025-12-04T12:25:43.1038821Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_logical_or_cuda_float32 PASSED [0.9375s] [ 0%] 2025-12-04T12:25:43.1039102Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_maximum_cuda_float32 PASSED [0.0067s] [ 0%] 2025-12-04T12:25:43.1039412Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0059s] [ 0%] 2025-12-04T12:25:43.1039753Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.9502s] [ 0%] 2025-12-04T12:25:43.1040113Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0065s] [ 0%] 2025-12-04T12:25:43.1040508Z test_meta.py::TestMetaCUDA::test_binary_ufuncs_mixed_dtype_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0044s] [ 0%] 2025-12-04T12:25:43.1040854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1041196Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1041542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1041882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_H_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1042223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1042595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1042937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_T_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1043290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1043663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1044030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___getitem___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1044389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1044741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1045088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___radd___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1045433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1045781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1046127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rand___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1046479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1046850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1047197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rdiv___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1047554Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1047923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmatmul___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1048328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1048675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1049037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmod___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1049405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1049761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1050106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1050454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rmul___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1050805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1051149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___ror___cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1051515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1051870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rpow___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1052218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1052567Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1052913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rsub___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1053257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1053607Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1053952Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace___rxor___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1054325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__batch_norm_with_update_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1054699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1055061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1055424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1055793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1056148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__chunk_cat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.1056468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_bfloat16 PASSED [0.9675s] [ 0%] 2025-12-04T12:25:43.1056752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_float16 PASSED [0.0168s] [ 0%] 2025-12-04T12:25:43.1057032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int64 PASSED [0.0213s] [ 0%] 2025-12-04T12:25:43.1057310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_int8 PASSED [0.0148s] [ 0%] 2025-12-04T12:25:43.1057585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_abs_cuda_uint8 PASSED [0.0145s] [ 0%] 2025-12-04T12:25:43.1057860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_bool XFAIL [0.0060s] [ 0%] 2025-12-04T12:25:43.1058216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_complex128 PASSED [0.9602s] [ 0%] 2025-12-04T12:25:43.1058524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_float64 PASSED [0.0153s] [ 0%] 2025-12-04T12:25:43.1058808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_int16 XFAIL [0.0059s] [ 0%] 2025-12-04T12:25:43.1059086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_acos_cuda_uint8 XFAIL [0.0058s] [ 0%] 2025-12-04T12:25:43.1059363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_add_cuda_int64 PASSED [1.0229s] [ 0%] 2025-12-04T12:25:43.1059650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_float64 PASSED [0.1559s] [ 0%] 2025-12-04T12:25:43.1059941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int64 XFAIL [0.0073s] [ 0%] 2025-12-04T12:25:43.1060225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcdiv_cuda_int8 XFAIL [0.9462s] [ 0%] 2025-12-04T12:25:43.1060528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_int32 PASSED [1.0630s] [ 0%] 2025-12-04T12:25:43.1060815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_addcmul_cuda_uint8 PASSED [0.1034s] [ 0%] 2025-12-04T12:25:43.1061099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float16 PASSED [0.0151s] [ 0%] 2025-12-04T12:25:43.1061389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float32 PASSED [0.0146s] [ 0%] 2025-12-04T12:25:43.1061675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_float64 PASSED [0.0146s] [ 0%] 2025-12-04T12:25:43.1061954Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_asin_cuda_int64 XFAIL [0.0057s] [ 0%] 2025-12-04T12:25:43.1062229Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_bool XFAIL [0.9473s] [ 0%] 2025-12-04T12:25:43.1062516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_complex64 PASSED [0.9390s] [ 0%] 2025-12-04T12:25:43.1062805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_float64 PASSED [0.0151s] [ 0%] 2025-12-04T12:25:43.1063113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_atan_cuda_uint8 XFAIL [0.0058s] [ 0%] 2025-12-04T12:25:43.1063396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_bfloat16 PASSED [0.0150s] [ 0%] 2025-12-04T12:25:43.1063674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_bool XFAIL [0.0056s] [ 0%] 2025-12-04T12:25:43.1063950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int32 PASSED [0.0144s] [ 0%] 2025-12-04T12:25:43.1064229Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_ceil_cuda_int64 PASSED [0.0144s] [ 0%] 2025-12-04T12:25:43.1064529Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_bool XFAIL [0.0143s] [ 0%] 2025-12-04T12:25:43.1064828Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_float64 PASSED [1.0886s] [ 0%] 2025-12-04T12:25:43.1065125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_int16 PASSED [0.1026s] [ 0%] 2025-12-04T12:25:43.1065415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_max_cuda_int8 PASSED [0.1021s] [ 0%] 2025-12-04T12:25:43.1065711Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_float32 PASSED [0.1496s] [ 0%] 2025-12-04T12:25:43.1066006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_clamp_min_cuda_int32 PASSED [0.1019s] [ 0%] 2025-12-04T12:25:43.1066298Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_complex128 PASSED [0.0156s] [ 0%] 2025-12-04T12:25:43.1066586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_float32 PASSED [0.0153s] [ 0%] 2025-12-04T12:25:43.1066879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_int8 PASSED [0.0153s] [ 0%] 2025-12-04T12:25:43.1067172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_copy_cuda_uint8 PASSED [0.0153s] [ 0%] 2025-12-04T12:25:43.1067448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_bool XFAIL [0.0057s] [ 0%] 2025-12-04T12:25:43.1067727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_complex128 PASSED [0.0148s] [ 0%] 2025-12-04T12:25:43.1068014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_float64 PASSED [0.0145s] [ 0%] 2025-12-04T12:25:43.1068349Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int16 XFAIL [0.0055s] [ 0%] 2025-12-04T12:25:43.1068624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_int32 XFAIL [0.0055s] [ 0%] 2025-12-04T12:25:43.1068897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cos_cuda_uint8 XFAIL [0.0056s] [ 0%] 2025-12-04T12:25:43.1069209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_float16 PASSED [0.9697s] [ 0%] 2025-12-04T12:25:43.1069493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int16 XFAIL [0.0061s] [ 0%] 2025-12-04T12:25:43.1069769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int32 XFAIL [0.9421s] [ 0%] 2025-12-04T12:25:43.1070042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_int8 XFAIL [0.9338s] [ 0%] 2025-12-04T12:25:43.1070317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_cosh_cuda_uint8 XFAIL [0.9359s] [ 0%] 2025-12-04T12:25:43.1070597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_float16 PASSED [1.0088s] [ 0%] 2025-12-04T12:25:43.1070880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_float32 PASSED [0.0766s] [ 0%] 2025-12-04T12:25:43.1071157Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_div_cuda_int32 XFAIL [0.0066s] [ 1%] 2025-12-04T12:25:43.1071441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_complex128 XFAIL [0.9389s] [ 1%] 2025-12-04T12:25:43.1071720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erf_cuda_int32 XFAIL [0.9355s] [ 1%] 2025-12-04T12:25:43.1071999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_complex64 XFAIL [0.9368s] [ 1%] 2025-12-04T12:25:43.1072283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_float64 PASSED [0.9495s] [ 1%] 2025-12-04T12:25:43.1072562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_int64 XFAIL [0.0060s] [ 1%] 2025-12-04T12:25:43.1072836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_erfc_cuda_uint8 XFAIL [0.9483s] [ 1%] 2025-12-04T12:25:43.1073126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_bool XFAIL [0.9318s] [ 1%] 2025-12-04T12:25:43.1073403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float16 PASSED [0.9445s] [ 1%] 2025-12-04T12:25:43.1073686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_float64 PASSED [0.0151s] [ 1%] 2025-12-04T12:25:43.1073961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int16 XFAIL [0.0059s] [ 1%] 2025-12-04T12:25:43.1074232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_exp_cuda_int64 XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1074512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_bfloat16 PASSED [0.9592s] [ 1%] 2025-12-04T12:25:43.1074801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float16 PASSED [0.0153s] [ 1%] 2025-12-04T12:25:43.1075087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_float32 PASSED [0.0147s] [ 1%] 2025-12-04T12:25:43.1075373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int16 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1075700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int64 XFAIL [0.9476s] [ 1%] 2025-12-04T12:25:43.1075995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_expm1_cuda_int8 XFAIL [0.9275s] [ 1%] 2025-12-04T12:25:43.1076277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_bfloat16 PASSED [0.9542s] [ 1%] 2025-12-04T12:25:43.1076561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_bool XFAIL [0.0062s] [ 1%] 2025-12-04T12:25:43.1076847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_complex128 XFAIL [0.9230s] [ 1%] 2025-12-04T12:25:43.1077136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_float32 PASSED [0.9408s] [ 1%] 2025-12-04T12:25:43.1077425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_float64 PASSED [0.0149s] [ 1%] 2025-12-04T12:25:43.1077709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_int8 PASSED [0.0146s] [ 1%] 2025-12-04T12:25:43.1078005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_floor_cuda_uint8 PASSED [0.0147s] [ 1%] 2025-12-04T12:25:43.1078322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_bool XFAIL [0.0059s] [ 1%] 2025-12-04T12:25:43.1078606Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_complex128 XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1078895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_frac_cuda_float64 PASSED [0.9830s] [ 1%] 2025-12-04T12:25:43.1079174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int16 XFAIL [0.0082s] [ 1%] 2025-12-04T12:25:43.1079449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int64 XFAIL [0.0077s] [ 1%] 2025-12-04T12:25:43.1079726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_int8 XFAIL [0.9797s] [ 1%] 2025-12-04T12:25:43.1080000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lerp_cuda_uint8 XFAIL [0.9737s] [ 1%] 2025-12-04T12:25:43.1080282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_bool XFAIL [0.9655s] [ 1%] 2025-12-04T12:25:43.1080569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float16 PASSED [0.9558s] [ 1%] 2025-12-04T12:25:43.1080862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_float32 PASSED [0.0151s] [ 1%] 2025-12-04T12:25:43.1081152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_lgamma_cuda_uint8 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1081437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log10_cuda_float64 PASSED [0.0151s] [ 1%] 2025-12-04T12:25:43.1081725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_float32 PASSED [0.0148s] [ 1%] 2025-12-04T12:25:43.1082030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_float64 PASSED [0.0146s] [ 1%] 2025-12-04T12:25:43.1082315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int64 XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1082593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_int8 XFAIL [0.9573s] [ 1%] 2025-12-04T12:25:43.1082870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log1p_cuda_uint8 XFAIL [0.9611s] [ 1%] 2025-12-04T12:25:43.1083152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_bfloat16 PASSED [0.9511s] [ 1%] 2025-12-04T12:25:43.1083443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_complex128 PASSED [0.0155s] [ 1%] 2025-12-04T12:25:43.1083731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_float64 PASSED [0.0149s] [ 1%] 2025-12-04T12:25:43.1084012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int16 XFAIL [0.0059s] [ 1%] 2025-12-04T12:25:43.1084309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log2_cuda_int32 XFAIL [0.9531s] [ 1%] 2025-12-04T12:25:43.1084607Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_complex64 PASSED [0.9666s] [ 1%] 2025-12-04T12:25:43.1084885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_log_cuda_int16 XFAIL [0.0061s] [ 1%] 2025-12-04T12:25:43.1085202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_bool SKIPPED [0.9569s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1085562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_int8 SKIPPED [0.0017s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1085923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_max_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1086254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_complex64 XFAIL [0.0083s] [ 1%] 2025-12-04T12:25:43.1086549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_float32 PASSED [1.0989s] [ 1%] 2025-12-04T12:25:43.1086857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_int64 PASSED [0.1031s] [ 1%] 2025-12-04T12:25:43.1087143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_maximum_cuda_uint8 PASSED [0.1027s] [ 1%] 2025-12-04T12:25:43.1087434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_bfloat16 PASSED [0.1517s] [ 1%] 2025-12-04T12:25:43.1087726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_minimum_cuda_int16 PASSED [0.1024s] [ 1%] 2025-12-04T12:25:43.1088009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_float32 PASSED [0.0776s] [ 1%] 2025-12-04T12:25:43.1088335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_float64 PASSED [0.0774s] [ 1%] 2025-12-04T12:25:43.1088619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_mul_cuda_int16 PASSED [0.0574s] [ 1%] 2025-12-04T12:25:43.1088903Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_complex64 PASSED [0.0148s] [ 1%] 2025-12-04T12:25:43.1089185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int16 PASSED [0.0146s] [ 1%] 2025-12-04T12:25:43.1089463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int32 PASSED [0.0146s] [ 1%] 2025-12-04T12:25:43.1089739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_neg_cuda_int8 PASSED [0.0147s] [ 1%] 2025-12-04T12:25:43.1090068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1090450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1090838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1091171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_bfloat16 PASSED [0.0852s] [ 1%] 2025-12-04T12:25:43.1091458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_complex64 PASSED [0.1147s] [ 1%] 2025-12-04T12:25:43.1091741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int16 PASSED [0.0583s] [ 1%] 2025-12-04T12:25:43.1092020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_pow_cuda_int32 PASSED [0.0579s] [ 1%] 2025-12-04T12:25:43.1092313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_bfloat16 PASSED [0.0150s] [ 1%] 2025-12-04T12:25:43.1092614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_bool XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1092914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_float16 PASSED [0.0148s] [ 1%] 2025-12-04T12:25:43.1093232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_float32 PASSED [0.0146s] [ 1%] 2025-12-04T12:25:43.1093547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int16 XFAIL [0.0056s] [ 1%] 2025-12-04T12:25:43.1093841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_reciprocal_cuda_int32 XFAIL [0.0055s] [ 1%] 2025-12-04T12:25:43.1094131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float16 PASSED [0.0147s] [ 1%] 2025-12-04T12:25:43.1094420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_float64 PASSED [0.0144s] [ 1%] 2025-12-04T12:25:43.1094707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_int64 PASSED [0.0145s] [ 1%] 2025-12-04T12:25:43.1094993Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_round_cuda_int8 PASSED [0.0144s] [ 1%] 2025-12-04T12:25:43.1095279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_bool XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1095586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_complex128 PASSED [0.9597s] [ 1%] 2025-12-04T12:25:43.1095878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_float64 PASSED [0.0152s] [ 1%] 2025-12-04T12:25:43.1096162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int64 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1096440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_rsqrt_cuda_int8 XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1096725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_float64 PASSED [0.9745s] [ 1%] 2025-12-04T12:25:43.1097013Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int32 XFAIL [0.0061s] [ 1%] 2025-12-04T12:25:43.1097299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sigmoid_cuda_int64 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1097585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_bfloat16 PASSED [0.9639s] [ 1%] 2025-12-04T12:25:43.1097873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_complex64 XFAIL [0.0061s] [ 1%] 2025-12-04T12:25:43.1098186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float16 PASSED [0.9654s] [ 1%] 2025-12-04T12:25:43.1098469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sign_cuda_float64 PASSED [0.0152s] [ 1%] 2025-12-04T12:25:43.1098750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sin_cuda_bfloat16 PASSED [0.0152s] [ 1%] 2025-12-04T12:25:43.1099031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_float32 PASSED [0.0149s] [ 1%] 2025-12-04T12:25:43.1099309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int16 XFAIL [0.0057s] [ 1%] 2025-12-04T12:25:43.1099686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_int32 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1099965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sinh_cuda_uint8 XFAIL [0.9484s] [ 1%] 2025-12-04T12:25:43.1100246Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float32 PASSED [0.9565s] [ 1%] 2025-12-04T12:25:43.1100530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_float64 PASSED [0.0151s] [ 1%] 2025-12-04T12:25:43.1100808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_int8 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1101082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sqrt_cuda_uint8 XFAIL [0.0058s] [ 1%] 2025-12-04T12:25:43.1101360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_bfloat16 XFAIL [0.9719s] [ 1%] 2025-12-04T12:25:43.1101635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_bool XFAIL [0.9427s] [ 1%] 2025-12-04T12:25:43.1101932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_float32 XFAIL [0.9563s] [ 1%] 2025-12-04T12:25:43.1102226Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_sub_cuda_int16 XFAIL [0.9639s] [ 1%] 2025-12-04T12:25:43.1102498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_bool XFAIL [0.9538s] [ 1%] 2025-12-04T12:25:43.1102769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int32 XFAIL [0.9458s] [ 1%] 2025-12-04T12:25:43.1103039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_int8 XFAIL [0.9642s] [ 1%] 2025-12-04T12:25:43.1103309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tan_cuda_uint8 XFAIL [0.9477s] [ 1%] 2025-12-04T12:25:43.1103593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_complex128 PASSED [0.9588s] [ 1%] 2025-12-04T12:25:43.1103893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_complex64 PASSED [0.0150s] [ 1%] 2025-12-04T12:25:43.1104184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_tanh_cuda_float32 PASSED [0.0147s] [ 1%] 2025-12-04T12:25:43.1104479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_bool XFAIL [0.0060s] [ 1%] 2025-12-04T12:25:43.1104761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_trunc_cuda_float32 PASSED [0.9464s] [ 1%] 2025-12-04T12:25:43.1105043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_bool PASSED [0.0132s] [ 1%] 2025-12-04T12:25:43.1105322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_float32 PASSED [0.0128s] [ 1%] 2025-12-04T12:25:43.1105604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__foreach_zero_cuda_uint8 PASSED [0.0126s] [ 1%] 2025-12-04T12:25:43.1105949Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1106355Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1106758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__segment_reduce_lengths_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1107158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1107559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1107958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1108437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__softmax_backward_data_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1108857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1109250Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1109638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1110022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1110427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1110857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.1111218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_complex32 XFAIL [0.0028s] [ 1%] 2025-12-04T12:25:43.1111498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_complex64 XFAIL [0.0028s] [ 1%] 2025-12-04T12:25:43.1111761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_abs_cuda_int64 PASSED [1.8985s] [ 1%] 2025-12-04T12:25:43.1112129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 1%] 2025-12-04T12:25:43.1112493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_float32 PASSED [0.9736s] [ 1%] 2025-12-04T12:25:43.1112756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_float64 PASSED [0.9503s] [ 2%] 2025-12-04T12:25:43.1113111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1113571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acos_cuda_uint8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1114027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1114491Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_complex32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1114854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float16 PASSED [0.9474s] [ 2%] 2025-12-04T12:25:43.1138548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float32 PASSED [0.9174s] [ 2%] 2025-12-04T12:25:43.1138855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_float64 PASSED [0.0052s] [ 2%] 2025-12-04T12:25:43.1139233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_acosh_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1139595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_bool PASSED [0.9021s] [ 2%] 2025-12-04T12:25:43.1139851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int16 PASSED [0.0079s] [ 2%] 2025-12-04T12:25:43.1140103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int32 PASSED [0.9256s] [ 2%] 2025-12-04T12:25:43.1140352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_int8 PASSED [0.0081s] [ 2%] 2025-12-04T12:25:43.1140600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_add_cuda_uint8 PASSED [0.9200s] [ 2%] 2025-12-04T12:25:43.1140957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_complex128 PASSED [0.0962s] [ 2%] 2025-12-04T12:25:43.1141231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_float16 PASSED [0.6551s] [ 2%] 2025-12-04T12:25:43.1141498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addbmm_cuda_float64 PASSED [0.9970s] [ 2%] 2025-12-04T12:25:43.1141765Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcdiv_cuda_float16 PASSED [0.0533s] [ 2%] 2025-12-04T12:25:43.1142035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_bfloat16 PASSED [0.9489s] [ 2%] 2025-12-04T12:25:43.1142306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_complex128 PASSED [0.3641s] [ 2%] 2025-12-04T12:25:43.1142582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_complex64 PASSED [0.9174s] [ 2%] 2025-12-04T12:25:43.1142850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_float16 PASSED [0.0098s] [ 2%] 2025-12-04T12:25:43.1143113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_float32 PASSED [0.0080s] [ 2%] 2025-12-04T12:25:43.1143392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int32 PASSED [0.0076s] [ 2%] 2025-12-04T12:25:43.1143671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addcmul_cuda_int64 PASSED [0.0075s] [ 2%] 2025-12-04T12:25:43.1143949Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmm_decomposed_cuda_float16 PASSED [0.0055s] [ 2%] 2025-12-04T12:25:43.1144230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_complex128 PASSED [0.0366s] [ 2%] 2025-12-04T12:25:43.1144496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_complex64 PASSED [0.0049s] [ 2%] 2025-12-04T12:25:43.1144761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addmv_cuda_float32 PASSED [0.0046s] [ 2%] 2025-12-04T12:25:43.1145021Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_float16 PASSED [0.0161s] [ 2%] 2025-12-04T12:25:43.1145280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_addr_cuda_int8 PASSED [0.9300s] [ 2%] 2025-12-04T12:25:43.1145586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1145968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1146335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1146690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1147045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_alias_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1147396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1147749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1148131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1148470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_all_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1148809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amax_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1149155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1149497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1149861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1150200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_amin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1150544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1150896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1151244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_aminmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1151586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1151935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex128 SKIPPED [0.0035s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1152311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1152680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1153023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1153365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_angle_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1153707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1154053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1154402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_float32 SKIPPED [0.0017s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1154757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_int64 SKIPPED [0.0020s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1155094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_any_cuda_int8 SKIPPED [0.0015s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1155439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_bfloat16 SKIPPED [0.0014s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1155788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1156132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1156479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1156825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1157174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_arange_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1157524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1157874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1158256Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argmin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1158601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argsort_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1158971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1159325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1159679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1160031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_argwhere_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1160399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1160777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1161167Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1161519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_bfloat16 PASSED [0.0054s] [ 2%] 2025-12-04T12:25:43.1161791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int16 PASSED [0.0043s] [ 2%] 2025-12-04T12:25:43.1162059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int64 PASSED [0.9335s] [ 2%] 2025-12-04T12:25:43.1162325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_cuda_int8 PASSED [0.0060s] [ 2%] 2025-12-04T12:25:43.1162616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_complex32 XFAIL [0.0045s] [ 2%] 2025-12-04T12:25:43.1162927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float16 XFAIL [0.9063s] [ 2%] 2025-12-04T12:25:43.1163234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_float64 XFAIL [0.9068s] [ 2%] 2025-12-04T12:25:43.1163553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_int32 XFAIL [0.9031s] [ 2%] 2025-12-04T12:25:43.1163857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_partial_views_cuda_uint8 XFAIL [0.9038s] [ 2%] 2025-12-04T12:25:43.1164202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_bfloat16 SKIPPED [0.9017s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1164585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1164961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1165337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1165709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_as_strided_scatter_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1166118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1166472Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float16 PASSED [0.9109s] [ 2%] 2025-12-04T12:25:43.1166734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float32 PASSED [0.9179s] [ 2%] 2025-12-04T12:25:43.1167079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_float64 PASSED [0.0041s] [ 2%] 2025-12-04T12:25:43.1167430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asin_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1167803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_bfloat16 PASSED [0.9231s] [ 2%] 2025-12-04T12:25:43.1168204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1168656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_complex128 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1169115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1169478Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_float16 PASSED [0.9183s] [ 2%] 2025-12-04T12:25:43.1169741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_float32 PASSED [0.0043s] [ 2%] 2025-12-04T12:25:43.1170111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1170571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_asinh_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1170930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan2_cuda_bfloat16 PASSED [0.0064s] [ 2%] 2025-12-04T12:25:43.1171192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_bfloat16 PASSED [0.9195s] [ 2%] 2025-12-04T12:25:43.1171543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1171992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_complex32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1172466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_complex64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1172914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1173354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1173792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atan_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1174146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_bfloat16 PASSED [0.9032s] [ 2%] 2025-12-04T12:25:43.1174510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atanh_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.1174924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1175282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1175636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_1d_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1175992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1176353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1176736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1177103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1177463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1177822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1178281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_2d_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1178639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1179002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1179375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1179743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1180095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_atleast_3d_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1180409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_bfloat16 PASSED [1.4084s] [ 2%] 2025-12-04T12:25:43.1180685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_complex128 PASSED [0.0123s] [ 2%] 2025-12-04T12:25:43.1180959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_baddbmm_cuda_complex64 PASSED [0.0747s] [ 2%] 2025-12-04T12:25:43.1181282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1181657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bernoulli_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1182015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1182366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1182715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bfloat16_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1183067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.1183418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bincount_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1183728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_bool PASSED [0.0061s] [ 3%] 2025-12-04T12:25:43.1183995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_and_cuda_int64 PASSED [0.0057s] [ 3%] 2025-12-04T12:25:43.1184274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int32 PASSED [0.0143s] [ 3%] 2025-12-04T12:25:43.1184564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_left_shift_cuda_int64 PASSED [0.0057s] [ 3%] 2025-12-04T12:25:43.1184844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_int8 PASSED [0.9181s] [ 3%] 2025-12-04T12:25:43.1185112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_not_cuda_uint8 PASSED [0.0055s] [ 3%] 2025-12-04T12:25:43.1185376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int16 PASSED [0.0062s] [ 3%] 2025-12-04T12:25:43.1185655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int32 PASSED [0.0057s] [ 3%] 2025-12-04T12:25:43.1185923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_int8 PASSED [0.0056s] [ 3%] 2025-12-04T12:25:43.1186188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_or_cuda_uint8 PASSED [0.0056s] [ 3%] 2025-12-04T12:25:43.1186467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_right_shift_cuda_int16 PASSED [0.0057s] [ 3%] 2025-12-04T12:25:43.1186751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int16 PASSED [0.0057s] [ 3%] 2025-12-04T12:25:43.1187020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int32 PASSED [0.0056s] [ 3%] 2025-12-04T12:25:43.1187289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bitwise_xor_cuda_int8 PASSED [0.0056s] [ 3%] 2025-12-04T12:25:43.1187598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1187970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1188391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1188746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1189099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_block_diag_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1189450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1189799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1190144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bmm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1190505Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1190849Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1191195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1191550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1191904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1192251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1192594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1192933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bool_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1193293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_shapes_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1193676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1194064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1194458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1194832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1195197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1195565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1195929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_broadcast_to_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1196293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1196659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1197037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1197409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1197766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_bucketize_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1198152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1198504Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1198854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1199202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1199562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_byte_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1199926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1200311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1200691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1201064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cartesian_prod_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1201434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1201793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1202143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1202487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1202794Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cauchy_cuda_float64 PASSED [0.9155s] [ 3%] 2025-12-04T12:25:43.1203111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1203484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1203839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cdouble_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1204153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float32 PASSED [0.8966s] [ 3%] 2025-12-04T12:25:43.1204419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_float64 PASSED [0.0046s] [ 3%] 2025-12-04T12:25:43.1204683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int16 PASSED [0.9065s] [ 3%] 2025-12-04T12:25:43.1204944Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int32 PASSED [0.0043s] [ 3%] 2025-12-04T12:25:43.1205202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ceil_cuda_int8 PASSED [0.9189s] [ 3%] 2025-12-04T12:25:43.1205515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_complex128 SKIPPED [0.0016s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1205893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1206269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cfloat_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1206628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1206986Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1207340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1207686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chalf_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1208031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1208442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1208792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1209136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1209475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1209812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1210150Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_char_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1210508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_inverse_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1210885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1211264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cholesky_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1211629Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1211981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1212333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1212691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_chunk_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1212996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_bfloat16 PASSED [0.9328s] [ 3%] 2025-12-04T12:25:43.1213259Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_float64 PASSED [0.0105s] [ 3%] 2025-12-04T12:25:43.1213524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_int16 PASSED [0.0086s] [ 3%] 2025-12-04T12:25:43.1213785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_cuda_int8 PASSED [0.0084s] [ 3%] 2025-12-04T12:25:43.1214049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_bfloat16 PASSED [0.0067s] [ 3%] 2025-12-04T12:25:43.1214322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_float64 PASSED [0.0065s] [ 3%] 2025-12-04T12:25:43.1214591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_max_cuda_int32 PASSED [0.0065s] [ 3%] 2025-12-04T12:25:43.1214878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clamp_min_cuda_bfloat16 PASSED [0.0066s] [ 3%] 2025-12-04T12:25:43.1215276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1215623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1215973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1216329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1216676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1217023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_clone_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1217400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1217777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1218181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1218545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1218905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_column_stack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1219270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1219643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1220015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1220384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_combinations_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1220746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1221089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1221451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1221766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_bool PASSED [0.9186s] [ 3%] 2025-12-04T12:25:43.1222041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_conj_physical_cuda_int8 PASSED [0.0039s] [ 3%] 2025-12-04T12:25:43.1222362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1222740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1223117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1223486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1223866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_constant_pad_nd_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1224244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1224603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1224968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1225329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1225685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_contiguous_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1226100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1226576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_copysign_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1226989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1227351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1227708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1228061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_corrcoef_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.1228492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1228937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1229390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1229749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_float64 PASSED [0.0082s] [ 3%] 2025-12-04T12:25:43.1230102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1230563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cos_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 3%] 2025-12-04T12:25:43.1231018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_complex128 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1231380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_float16 PASSED [0.9142s] [ 4%] 2025-12-04T12:25:43.1231731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cosh_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1232140Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_bfloat16 SKIPPED [0.0013s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1232518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1232912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1233301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_count_nonzero_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1233655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1234000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1234339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cov_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1234682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1235039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1235402Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1235745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1236090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cross_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1236437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1236785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1237134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1237486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1237835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1238218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1238562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1238913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1239265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1239632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1239981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1240326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cummin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1240687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_complex128 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1241059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_float32 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1241421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_int32 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1241792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumprod_cuda_uint8 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1242168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_float16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1242528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_int16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1242880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumsum_cuda_uint8 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 4%] 2025-12-04T12:25:43.1243251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_cumulative_trapezoid_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1243673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1244130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1244596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1245048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_deg2rad_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1245453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1245806Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1246154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1246499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1246853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1247208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1247563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1247917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diag_embed_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1248335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1248721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1249090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagflat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1249456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1249825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1250192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1250555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1250931Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1251306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1251674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1252035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1252393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1252748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1253104Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1253487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1253873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1254260Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1254642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1255016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1255392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1255768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diagonal_scatter_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1256132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1256488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1256842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1257197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1257560Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_diff_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1257963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1258371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_float16 PASSED [0.9133s] [ 4%] 2025-12-04T12:25:43.1258740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_digamma_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1259143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dist_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1259472Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_float16 PASSED [0.9470s] [ 4%] 2025-12-04T12:25:43.1259771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int16 PASSED [0.0109s] [ 4%] 2025-12-04T12:25:43.1260074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int32 PASSED [0.0084s] [ 4%] 2025-12-04T12:25:43.1260377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_floor_rounding_cuda_int64 PASSED [0.0080s] [ 4%] 2025-12-04T12:25:43.1260766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1261263Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1261665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_no_rounding_mode_cuda_float64 PASSED [0.0059s] [ 4%] 2025-12-04T12:25:43.1261979Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_bfloat16 PASSED [0.0189s] [ 4%] 2025-12-04T12:25:43.1262282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_float32 PASSED [0.0064s] [ 4%] 2025-12-04T12:25:43.1262604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int16 PASSED [0.0061s] [ 4%] 2025-12-04T12:25:43.1262899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int32 PASSED [0.0061s] [ 4%] 2025-12-04T12:25:43.1263194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_div_trunc_rounding_cuda_int8 PASSED [0.0061s] [ 4%] 2025-12-04T12:25:43.1263522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1263888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1264249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1264603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_double_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1264961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1265325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1265685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1266040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1266389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dsplit_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1266757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1267110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_dstack_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1267466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1267828Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1268224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_einsum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1268581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1268965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1269347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1269716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1270081Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1270446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1270809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1271169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_like_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1271555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1271944Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1272330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1272718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1273096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_permuted_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1273474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1273855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1274238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1274613Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1274981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_empty_strided_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1275295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_bool PASSED [0.9172s] [ 4%] 2025-12-04T12:25:43.1275572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_complex32 PASSED [0.0087s] [ 4%] 2025-12-04T12:25:43.1275842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_complex64 PASSED [0.9008s] [ 4%] 2025-12-04T12:25:43.1276107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int16 PASSED [0.0080s] [ 4%] 2025-12-04T12:25:43.1276361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eq_cuda_int32 PASSED [0.9089s] [ 4%] 2025-12-04T12:25:43.1276669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1277027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1277378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1277723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1278129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1278493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_equal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.1278802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_float32 PASSED [0.9217s] [ 4%] 2025-12-04T12:25:43.1279160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1279608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1280050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erf_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 4%] 2025-12-04T12:25:43.1280407Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float16 PASSED [0.9097s] [ 4%] 2025-12-04T12:25:43.1280689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_float64 PASSED [0.9090s] [ 5%] 2025-12-04T12:25:43.1281048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfc_cuda_int32 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1281500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_erfinv_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1281865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_bfloat16 PASSED [1.0277s] [ 5%] 2025-12-04T12:25:43.1282132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_float64 PASSED [0.9048s] [ 5%] 2025-12-04T12:25:43.1282490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp2_cuda_uint8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1282846Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_bfloat16 PASSED [0.0048s] [ 5%] 2025-12-04T12:25:43.1283199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1283551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_float16 PASSED [0.9023s] [ 5%] 2025-12-04T12:25:43.1283906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1284349Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1284806Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exp_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1285214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1285582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_as_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1285947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1286315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1286690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1287068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1287441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1287796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expand_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1288218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1288676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1289044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float16 PASSED [0.9010s] [ 5%] 2025-12-04T12:25:43.1289314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_float32 PASSED [0.0041s] [ 5%] 2025-12-04T12:25:43.1289691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1290141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_expm1_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1290511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_float16 PASSED [0.0085s] [ 5%] 2025-12-04T12:25:43.1290800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_exponential_cuda_float64 PASSED [0.0044s] [ 5%] 2025-12-04T12:25:43.1291130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_float8_e5m2fnuz SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1291485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1291821Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1292156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_eye_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1292501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1292860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1293218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1293572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1293946Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1294304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1294657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1295022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1295394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1295761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1296136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1296516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_fftshift_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1296871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1297225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1297578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1297934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft2_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1298333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1298705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1299067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1299431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1299793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1300148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1300500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_hfftn_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1300862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1301227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1301584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1301937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft2_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1302289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1302662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1303030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1303389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1303739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1304090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1304452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1304825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1305212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1305597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1305962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ifftshift_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1306327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1306692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1307043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1307399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft2_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1307783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1308180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1308538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1308894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1309254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1309621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1309985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_ihfftn_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1310343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1310706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1311077Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1311444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1311825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1312185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft2_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1312552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1312913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1313268Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1313623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1313979Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1314361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1314737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1315095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1315451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_irfftn_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1315808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1316165Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1316533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1316964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1317310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1317656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1318009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1318479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1318836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fft_rfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1319151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_bfloat16 PASSED [0.9204s] [ 5%] 2025-12-04T12:25:43.1319419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float16 PASSED [0.0053s] [ 5%] 2025-12-04T12:25:43.1319683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float32 PASSED [0.0037s] [ 5%] 2025-12-04T12:25:43.1319947Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_float64 PASSED [0.9047s] [ 5%] 2025-12-04T12:25:43.1320209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fill_cuda_int8 PASSED [0.0051s] [ 5%] 2025-12-04T12:25:43.1320523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1320905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flatten_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1321261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1321615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1321971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1322323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1322670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1323015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flip_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1323374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1323747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fliplr_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1324108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1324463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1324813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_flipud_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1325166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1325523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1325888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.1326255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_bfloat16 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 5%] 2025-12-04T12:25:43.1326687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1327171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1327604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_float32 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 5%] 2025-12-04T12:25:43.1328030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1328540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_float_power_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.1328907Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_float64 PASSED [0.9137s] [ 5%] 2025-12-04T12:25:43.1329176Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int16 PASSED [0.0041s] [ 5%] 2025-12-04T12:25:43.1329439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_int8 PASSED [0.9065s] [ 5%] 2025-12-04T12:25:43.1329701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_cuda_uint8 PASSED [0.0044s] [ 5%] 2025-12-04T12:25:43.1329991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_int64 PASSED [0.0090s] [ 5%] 2025-12-04T12:25:43.1330271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_floor_divide_cuda_uint8 PASSED [0.0062s] [ 6%] 2025-12-04T12:25:43.1330585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1330938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1331288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1331636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1331980Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1332347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1332706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1333048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1333354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_bfloat16 PASSED [0.9108s] [ 6%] 2025-12-04T12:25:43.1333619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_fmod_cuda_float16 PASSED [0.0086s] [ 6%] 2025-12-04T12:25:43.1333884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frac_cuda_float64 PASSED [0.0033s] [ 6%] 2025-12-04T12:25:43.1334199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1334557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_frexp_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1334919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1335267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1335617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1335967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1336312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1336656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1337012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1337378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1337743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1338418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1338815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1339234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_uint16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1339642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_full_like_cuda_uint32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1340046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1340459Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1340860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1341241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1341651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gather_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1342012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int16 PASSED [0.3232s] [ 6%] 2025-12-04T12:25:43.1342330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gcd_cuda_int64 PASSED [0.0078s] [ 6%] 2025-12-04T12:25:43.1342640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float16 PASSED [0.0060s] [ 6%] 2025-12-04T12:25:43.1342935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float32 PASSED [0.0058s] [ 6%] 2025-12-04T12:25:43.1343249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_float64 PASSED [0.0057s] [ 6%] 2025-12-04T12:25:43.1343540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_int64 PASSED [0.0056s] [ 6%] 2025-12-04T12:25:43.1343822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ge_cuda_int8 PASSED [0.0056s] [ 6%] 2025-12-04T12:25:43.1344148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float16 PASSED [0.0082s] [ 6%] 2025-12-04T12:25:43.1344464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_float32 PASSED [0.0043s] [ 6%] 2025-12-04T12:25:43.1344879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_geometric_cuda_uint8 PASSED [0.0042s] [ 6%] 2025-12-04T12:25:43.1345250Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1345653Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1346068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1346460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gradient_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1346898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1347331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1347726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 6%] 2025-12-04T12:25:43.1348109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_float32 PASSED [0.9142s] [ 6%] 2025-12-04T12:25:43.1348405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_int16 PASSED [0.0073s] [ 6%] 2025-12-04T12:25:43.1348695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_gt_cuda_int8 PASSED [0.0058s] [ 6%] 2025-12-04T12:25:43.1349051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1349464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1349862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1350254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1350639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1351030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1351412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_half_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1351793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1352228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1352646Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hash_tensor_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1353005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_float64 PASSED [0.0307s] [ 6%] 2025-12-04T12:25:43.1353327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_heaviside_cuda_uint8 PASSED [0.0077s] [ 6%] 2025-12-04T12:25:43.1353671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_histc_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1354082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1354475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1354884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1355302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1355689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hsplit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1356089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1356490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1356881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1357290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_hstack_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1357725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T12:25:43.1358265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_i0_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 6%] 2025-12-04T12:25:43.1358664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igamma_cuda_float32 PASSED [0.9204s] [ 6%] 2025-12-04T12:25:43.1358977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igamma_cuda_float64 PASSED [0.0077s] [ 6%] 2025-12-04T12:25:43.1359294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_igammac_cuda_float32 PASSED [0.0060s] [ 6%] 2025-12-04T12:25:43.1359619Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_bfloat16 PASSED [0.0717s] [ 6%] 2025-12-04T12:25:43.1359932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_complex32 PASSED [0.9272s] [ 6%] 2025-12-04T12:25:43.1360265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_add_cuda_float32 PASSED [0.0103s] [ 6%] 2025-12-04T12:25:43.1360576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_bool PASSED [0.9203s] [ 6%] 2025-12-04T12:25:43.1360890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float16 PASSED [0.9109s] [ 6%] 2025-12-04T12:25:43.1361207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_float64 PASSED [0.0056s] [ 6%] 2025-12-04T12:25:43.1361519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int32 PASSED [0.0041s] [ 6%] 2025-12-04T12:25:43.1361835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_copy_cuda_int64 PASSED [0.0039s] [ 6%] 2025-12-04T12:25:43.1362149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_complex128 PASSED [0.0058s] [ 6%] 2025-12-04T12:25:43.1362484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_complex64 PASSED [0.0054s] [ 6%] 2025-12-04T12:25:43.1362839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_float64 PASSED [0.0053s] [ 6%] 2025-12-04T12:25:43.1363147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_fill_cuda_int64 PASSED [0.0053s] [ 6%] 2025-12-04T12:25:43.1363456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_bool PASSED [0.0149s] [ 6%] 2025-12-04T12:25:43.1363778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_complex128 PASSED [0.9174s] [ 6%] 2025-12-04T12:25:43.1364091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float16 PASSED [0.0067s] [ 6%] 2025-12-04T12:25:43.1364412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float32 PASSED [0.0048s] [ 6%] 2025-12-04T12:25:43.1364723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_float64 PASSED [0.0046s] [ 6%] 2025-12-04T12:25:43.1365042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_int16 PASSED [0.0045s] [ 6%] 2025-12-04T12:25:43.1365366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_put_cuda_int32 PASSED [0.9073s] [ 6%] 2025-12-04T12:25:43.1365685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_bfloat16 PASSED [0.0084s] [ 6%] 2025-12-04T12:25:43.1366022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amax_cuda_int16 PASSED [0.0065s] [ 6%] 2025-12-04T12:25:43.1366359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_float16 PASSED [0.0064s] [ 6%] 2025-12-04T12:25:43.1366686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int16 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1367034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int64 PASSED [0.0062s] [ 6%] 2025-12-04T12:25:43.1367358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_amin_cuda_int8 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1367688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_bfloat16 PASSED [0.0064s] [ 6%] 2025-12-04T12:25:43.1368038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float16 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1368399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_float32 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1368742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int32 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1369071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int64 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1369403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_reduce_prod_cuda_int8 PASSED [0.0063s] [ 6%] 2025-12-04T12:25:43.1369810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1370212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1370616Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1371042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1371452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1371859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1372286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_index_select_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1372697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_inner_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1373096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1373492Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1373902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1374297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1374676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_int_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1375091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1375486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isclose_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1375871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1376285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isfinite_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1376672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1377060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1377452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1377839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1378283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1404751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1405159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1405510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1405964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isinf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1406313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.1406661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isnan_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1407015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1407373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1407726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1408124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1408499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isneginf_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1408868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1409218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isposinf_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1409571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1409930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1410290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1410665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_isreal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1411018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_istft_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1411368Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1411710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_item_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1412086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1412501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1412914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1413318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_2inputs_2outputs_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1413725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1414141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1414561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1414996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1415389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1415767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1416145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1416540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1416953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1417502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1417927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1418369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_binary_return_by_ref_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1418759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1419129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1419496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1419925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1420291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_jiterator_unary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1420650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1420995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1421337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kron_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1421686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1422046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1422399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1422749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1423102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_kthvalue_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1423406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lcm_cuda_uint8 PASSED [0.3039s] [ 7%] 2025-12-04T12:25:43.1423668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_float16 PASSED [0.0079s] [ 7%] 2025-12-04T12:25:43.1423960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_float32 PASSED [0.9367s] [ 7%] 2025-12-04T12:25:43.1424318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.1424773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ldexp_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.1425129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_bfloat16 PASSED [0.0071s] [ 7%] 2025-12-04T12:25:43.1425387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_bool PASSED [0.0057s] [ 7%] 2025-12-04T12:25:43.1425636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_le_cuda_int16 PASSED [0.0056s] [ 7%] 2025-12-04T12:25:43.1425890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_float32 PASSED [0.0142s] [ 7%] 2025-12-04T12:25:43.1426148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lerp_cuda_float64 PASSED [0.0074s] [ 7%] 2025-12-04T12:25:43.1426518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.1426891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float16 PASSED [0.9137s] [ 7%] 2025-12-04T12:25:43.1427155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_float64 PASSED [0.0065s] [ 7%] 2025-12-04T12:25:43.1427509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lgamma_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.1427932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cholesky_ex_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1428365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1428732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cond_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1429111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1429474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1429837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1430197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1430359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1430523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_cross_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1430689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_det_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1430856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_diagonal_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1431867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvals_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1432096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1432264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_eigvalsh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1432496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_complex64 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T12:25:43.1432737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T12:25:43.1432924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_inv_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_factor_ex_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1433999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_complex128 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T12:25:43.1434208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 7%] 2025-12-04T12:25:43.1434370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lstsq_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1434534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1434696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1434865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_factor_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_lu_solve_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_power_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1435933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_matrix_rank_hermitian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_multi_dot_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1436978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1437141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1437303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1437469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1437635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1437829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_pinv_hermitian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_qr_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_slogdet_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1438955Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_slogdet_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_ex_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1439988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1440169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_solve_triangular_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1440330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svd_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1440496Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1440662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_svdvals_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1440851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441034Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorinv_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_tensorsolve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1441871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vander_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vecdot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linalg_vector_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1442896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.1443057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1443212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1443394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1443575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1443752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1443941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1444119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_linspace_tensor_overload_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1444333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log10_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1444450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_bfloat16 PASSED [0.9244s] [ 8%] 2025-12-04T12:25:43.1444662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_complex128 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1444871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_complex64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1444998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_float64 PASSED [0.9106s] [ 8%] 2025-12-04T12:25:43.1445212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log1p_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1445403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_bfloat16 PASSED [0.9277s] [ 8%] 2025-12-04T12:25:43.1445615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_complex128 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1445728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_float16 PASSED [0.9285s] [ 8%] 2025-12-04T12:25:43.1445838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_float32 PASSED [0.0055s] [ 8%] 2025-12-04T12:25:43.1446163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1446381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1446583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log2_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1446792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1446902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float16 PASSED [0.9130s] [ 8%] 2025-12-04T12:25:43.1447013Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_float64 PASSED [0.0053s] [ 8%] 2025-12-04T12:25:43.1447215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1447416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1447615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1447735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float16 PASSED [0.0087s] [ 8%] 2025-12-04T12:25:43.1447851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_normal_cuda_float64 PASSED [0.0045s] [ 8%] 2025-12-04T12:25:43.1448018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1448228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1448392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1448569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1448741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_log_softmax_with_dtype_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1448905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp2_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449067Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449250Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449429Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logaddexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1449921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logcumsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1450037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_bool PASSED [0.9318s] [ 8%] 2025-12-04T12:25:43.1450164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_complex128 PASSED [0.2840s] [ 8%] 2025-12-04T12:25:43.1450301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float16 PASSED [0.0056s] [ 8%] 2025-12-04T12:25:43.1450425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_float64 PASSED [0.0052s] [ 8%] 2025-12-04T12:25:43.1450543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int16 PASSED [0.0052s] [ 8%] 2025-12-04T12:25:43.1450658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_and_cuda_int32 PASSED [0.0052s] [ 8%] 2025-12-04T12:25:43.1450781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_complex128 PASSED [0.9259s] [ 8%] 2025-12-04T12:25:43.1450902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_not_cuda_complex64 PASSED [0.0052s] [ 8%] 2025-12-04T12:25:43.1451020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_bfloat16 PASSED [0.0058s] [ 8%] 2025-12-04T12:25:43.1451137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_bool PASSED [0.0054s] [ 8%] 2025-12-04T12:25:43.1451254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int16 PASSED [0.0053s] [ 8%] 2025-12-04T12:25:43.1451366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_int8 PASSED [0.0052s] [ 8%] 2025-12-04T12:25:43.1451479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_or_cuda_uint8 PASSED [0.0051s] [ 8%] 2025-12-04T12:25:43.1451600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_complex128 PASSED [0.2810s] [ 8%] 2025-12-04T12:25:43.1451718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_float32 PASSED [0.0055s] [ 8%] 2025-12-04T12:25:43.1451834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logical_xor_cuda_float64 PASSED [0.0053s] [ 8%] 2025-12-04T12:25:43.1452055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1452168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_float16 PASSED [0.0060s] [ 8%] 2025-12-04T12:25:43.1452279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_float64 PASSED [0.0054s] [ 8%] 2025-12-04T12:25:43.1452485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1452693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logit_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.1452858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453021Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1453878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logspace_tensor_overload_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1454036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1454207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1454365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_logsumexp_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1454521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_long_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1454631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_float32 PASSED [0.0058s] [ 8%] 2025-12-04T12:25:43.1454739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lt_cuda_uint8 PASSED [0.0057s] [ 8%] 2025-12-04T12:25:43.1454894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_lu_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1455958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mH_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456579Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1456877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mT_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amax_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457541Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1457857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_amin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmax_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1458887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_argmin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumprod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_cumsum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1459826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_bfloat16 PASSED [0.0069s] [ 8%] 2025-12-04T12:25:43.1459941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_bool PASSED [0.0062s] [ 8%] 2025-12-04T12:25:43.1460082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_complex128 PASSED [0.0064s] [ 8%] 2025-12-04T12:25:43.1460204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_float32 PASSED [0.0064s] [ 8%] 2025-12-04T12:25:43.1460319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_fill_cuda_int16 PASSED [0.0063s] [ 8%] 2025-12-04T12:25:43.1460491Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1460663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_log_softmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1460833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logaddexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461367Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1461865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_logsumexp_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_mean_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_median_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1462876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_normalize_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1463869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1464038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.1464166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_bfloat16 PASSED [0.9272s] [ 8%] 2025-12-04T12:25:43.1464297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_complex64 PASSED [0.0057s] [ 8%] 2025-12-04T12:25:43.1464422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float16 PASSED [0.0043s] [ 8%] 2025-12-04T12:25:43.1464544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_float64 PASSED [0.0041s] [ 8%] 2025-12-04T12:25:43.1464665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_scatter_cuda_int16 PASSED [0.9313s] [ 8%] 2025-12-04T12:25:43.1464838Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_complex128 SKIPPED [0.0015s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_select_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465520Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_softmin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1465847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_std_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1466983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_sum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_masked_var_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matmul_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1467935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_matrix_exp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_binary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_pool2d_with_indices_backward_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_no_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1468988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1469179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_max_reduction_with_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1469351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_maximum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1469508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mean_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1469661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_median_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1469848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_list_of_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1470950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_meshgrid_variadic_tensors_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1471971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_binary_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1472164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1472337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1472505Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1472675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_no_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1472852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_min_reduction_with_dim_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473724Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1473878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_minimum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474356Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474664Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1474971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mode_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1475915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1476072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1476237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_movedim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1476394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1476549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_msort_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1476660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float16 PASSED [0.0062s] [ 9%] 2025-12-04T12:25:43.1476773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_float64 PASSED [0.0053s] [ 9%] 2025-12-04T12:25:43.1476884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int16 PASSED [0.0053s] [ 9%] 2025-12-04T12:25:43.1476991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int32 PASSED [0.0053s] [ 9%] 2025-12-04T12:25:43.1477097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int64 PASSED [0.0052s] [ 9%] 2025-12-04T12:25:43.1477216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mul_cuda_int8 PASSED [0.0052s] [ 9%] 2025-12-04T12:25:43.1477386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1477539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1477674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float64 PASSED [0.9430s] [ 9%] 2025-12-04T12:25:43.1477906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T12:25:43.1478172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T12:25:43.1478324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16 PASSED [0.0132s] [ 9%] 2025-12-04T12:25:43.1478457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0111s] [ 9%] 2025-12-04T12:25:43.1478594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float64 PASSED [0.9359s] [ 9%] 2025-12-04T12:25:43.1478716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_float32 PASSED [0.0068s] [ 9%] 2025-12-04T12:25:43.1478832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int16 PASSED [0.0036s] [ 9%] 2025-12-04T12:25:43.1478948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nan_to_num_cuda_int64 PASSED [0.9061s] [ 9%] 2025-12-04T12:25:43.1479110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex32 SKIPPED [0.0016s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1479274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmean_cuda_complex64 SKIPPED [0.0013s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1479437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1479597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1479755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nanmedian_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1479913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480071Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nansum_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480719Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1480881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481045Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481388Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1481850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_narrow_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_batch_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482678Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1482858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_dropout_backward_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1483029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1483202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_native_layer_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1483318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_bfloat16 PASSED [0.0070s] [ 9%] 2025-12-04T12:25:43.1483435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_complex128 PASSED [0.0058s] [ 9%] 2025-12-04T12:25:43.1483546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float16 PASSED [0.0058s] [ 9%] 2025-12-04T12:25:43.1483656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_float32 PASSED [0.0057s] [ 9%] 2025-12-04T12:25:43.1483760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ne_cuda_int16 PASSED [0.0056s] [ 9%] 2025-12-04T12:25:43.1483873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_complex32 PASSED [0.9209s] [ 9%] 2025-12-04T12:25:43.1483983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_float16 PASSED [0.0043s] [ 9%] 2025-12-04T12:25:43.1484105Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_neg_cuda_int32 PASSED [0.9131s] [ 9%] 2025-12-04T12:25:43.1484268Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1484434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_complex32 SKIPPED [0.0013s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1484592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1484751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1484908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1485961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_empty_strided_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.1486935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_full_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1487885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_ones_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1488051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1488274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1488432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1488585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_new_zeros_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1488707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_bfloat16 PASSED [0.0069s] [ 10%] 2025-12-04T12:25:43.1488822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nextafter_cuda_float16 PASSED [0.0057s] [ 10%] 2025-12-04T12:25:43.1489017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1489228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1489432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1489626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1489814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1490958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1491135Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1491321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1491502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_avg_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1491681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1491878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1492073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_batch_norm_without_cudnn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1492251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1492449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_bilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1492641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_binary_cross_entropy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1492774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float32 PASSED [0.0118s] [ 10%] 2025-12-04T12:25:43.1492905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_celu_cuda_float64 PASSED [0.0040s] [ 10%] 2025-12-04T12:25:43.1493095Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1493281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1493479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_channel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1493672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1493850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv1d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494379Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv2d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1494925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1495116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1495309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1495497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1495689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1495878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1496073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose2d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1496262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1496465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1496654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1496840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_conv_transpose3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1497033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1497223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1497411Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cosine_similarity_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1497617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_cross_entropy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1497768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout2d_cuda_float32 PASSED [0.0194s] [ 10%] 2025-12-04T12:25:43.1497906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_bfloat16 PASSED [0.9389s] [ 10%] 2025-12-04T12:25:43.1498044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout3d_cuda_float16 PASSED [0.0132s] [ 10%] 2025-12-04T12:25:43.1498412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_dropout_cuda_float64 PASSED [0.0134s] [ 10%] 2025-12-04T12:25:43.1498542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_bfloat16 PASSED [0.0049s] [ 10%] 2025-12-04T12:25:43.1498667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float16 PASSED [0.0044s] [ 10%] 2025-12-04T12:25:43.1498797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float32 PASSED [0.0043s] [ 10%] 2025-12-04T12:25:43.1498942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_elu_cuda_float64 PASSED [0.0043s] [ 10%] 2025-12-04T12:25:43.1499124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1499303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1499483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_embedding_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1499649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [0.9303s] [ 10%] 2025-12-04T12:25:43.1499823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128 PASSED [0.0077s] [ 10%] 2025-12-04T12:25:43.1499992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0063s] [ 10%] 2025-12-04T12:25:43.1500157Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32 PASSED [0.0063s] [ 10%] 2025-12-04T12:25:43.1500325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8 PASSED [0.9372s] [ 10%] 2025-12-04T12:25:43.1500518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float64 SKIPPED [0.0016s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1500714Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float32 SKIPPED [0.0014s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1500902Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_gelu_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_glu_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_grid_sample_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501798Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_group_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1501995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardshrink_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1502148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0134s] [ 10%] 2025-12-04T12:25:43.1502328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_huber_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1502512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1502698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_instance_norm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1502891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1503101Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1503296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1503489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_linear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1503693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1503895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_interpolate_nearest_cuda_float32 SKIPPED [0.0020s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504268Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_l1_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_layer_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1504959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_leaky_relu_cuda_bfloat16 PASSED [0.0155s] [ 10%] 2025-12-04T12:25:43.1505138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_linear_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1505333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1505514Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1505706Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1505896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1506096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_margin_ranking_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1506286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1506467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1506646Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool2d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1506824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1507006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1507206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool2d_grad_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1507397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1507583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1507717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0152s] [ 10%] 2025-12-04T12:25:43.1507847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_float16 PASSED [0.0044s] [ 10%] 2025-12-04T12:25:43.1507978Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mish_cuda_float64 PASSED [0.0042s] [ 10%] 2025-12-04T12:25:43.1508206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1508384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_mse_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1508587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1508774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1508960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multi_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1509171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1509351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_nll_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1509533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1509715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1509894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_normalize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1510072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_circular_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1510275Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1510466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1510648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_constant_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1510832Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1511011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1511190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1511384Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_reflect_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1511576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.1511760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1511939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1512121Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1512324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1512526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1512725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1512921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1513119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1513323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pad_replicate_negative_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1513516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1513701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pairwise_distance_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1513877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pdist_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514242Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1514992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_shuffle_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1515175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1515361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1515563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1515746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_pixel_unshuffle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1515934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_prelu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516806Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1516974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu6_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1517148Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1517315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_relu_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1517502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1517683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rms_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1517817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_rrelu_cuda_float16 PASSED [0.9504s] [ 11%] 2025-12-04T12:25:43.1517962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_complex_cuda_complex128 PASSED [0.0128s] [ 11%] 2025-12-04T12:25:43.1518470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_silu_cuda_float16 PASSED [0.0038s] [ 11%] 2025-12-04T12:25:43.1518659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_smooth_l1_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1518861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1519072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1519258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1519444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_soft_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1519635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1519826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softmin_with_dtype_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softplus_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_softsign_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1520920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521476Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_tanhshrink_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1521983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_int16 PASSED [0.9614s] [ 11%] 2025-12-04T12:25:43.1522116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_threshold_cuda_uint8 PASSED [0.0068s] [ 11%] 2025-12-04T12:25:43.1522312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1522502Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1522694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1522882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1523112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1523335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1523546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1523752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1523929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1524109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1524297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_unfold_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1524487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1524674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1524863Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nn_functional_upsample_nearest_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1525025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1525188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1525348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1525506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1525657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_bool SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1525811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_complex32 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1525964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_complex64 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1526123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int16 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1526270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int64 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1526415Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_nonzero_static_cuda_int8 SKIPPED [0.0005s] (Only runs on cpu) [ 11%] 2025-12-04T12:25:43.1526575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1526738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_fro_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1526897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1527059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1527232Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_inf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1527404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1527565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_norm_nuc_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1527698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_complex128 PASSED [0.9473s] [ 11%] 2025-12-04T12:25:43.1527825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_complex64 PASSED [0.0054s] [ 11%] 2025-12-04T12:25:43.1527952Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float16 PASSED [0.0038s] [ 11%] 2025-12-04T12:25:43.1528090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_in_place_cuda_float64 PASSED [0.9593s] [ 11%] 2025-12-04T12:25:43.1528286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_normal_number_mean_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1528443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1528596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1528747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1528900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1529858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ones_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ormqr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530189Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_outer_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1530973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pca_lowrank_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1531995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_copy_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_permute_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1532973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_complex128 SKIPPED [0.0008s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1533132Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1533292Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pinverse_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1533447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polar_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.1533584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_float64 PASSED [0.0098s] [ 11%] 2025-12-04T12:25:43.1533817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_0_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1533956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_float16 PASSED [0.0077s] [ 11%] 2025-12-04T12:25:43.1534202Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1534434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1534659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_1_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1534801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.1662s] [ 11%] 2025-12-04T12:25:43.1534937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float16 PASSED [0.0068s] [ 11%] 2025-12-04T12:25:43.1535075Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_float64 PASSED [0.0064s] [ 11%] 2025-12-04T12:25:43.1535319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_2_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1535469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float16 PASSED [0.9482s] [ 11%] 2025-12-04T12:25:43.1535603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_3_cuda_float64 PASSED [0.0086s] [ 11%] 2025-12-04T12:25:43.1535829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 11%] 2025-12-04T12:25:43.1535967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0085s] [ 12%] 2025-12-04T12:25:43.1536193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1536436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_polygamma_polygamma_n_4_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1536604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1536765Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1536921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_positive_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1537039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex128 PASSED [0.9581s] [ 12%] 2025-12-04T12:25:43.1537152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex32 PASSED [0.4906s] [ 12%] 2025-12-04T12:25:43.1537267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_complex64 PASSED [0.0060s] [ 12%] 2025-12-04T12:25:43.1537379Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_float16 PASSED [0.0057s] [ 12%] 2025-12-04T12:25:43.1537489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_pow_cuda_int16 PASSED [0.0057s] [ 12%] 2025-12-04T12:25:43.1537647Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1537805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1537957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1538197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1538366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_prod_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1538479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_bfloat16 PASSED [0.0166s] [ 12%] 2025-12-04T12:25:43.1538586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_bool PASSED [0.0151s] [ 12%] 2025-12-04T12:25:43.1538692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int64 PASSED [0.0151s] [ 12%] 2025-12-04T12:25:43.1538798Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_put_cuda_int8 PASSED [0.0149s] [ 12%] 2025-12-04T12:25:43.1538959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_quantile_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1539080Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_float16 PASSED [0.0030s] [ 12%] 2025-12-04T12:25:43.1539195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_float32 PASSED [0.9695s] [ 12%] 2025-12-04T12:25:43.1539423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1539644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rad2deg_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1539808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1539973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rand_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540790Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randint_like_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1540951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1541916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_randn_like_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1542859Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_ravel_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1543959Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1544110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1544261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_real_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1544500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1544621Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reciprocal_cuda_float32 PASSED [0.9461s] [ 12%] 2025-12-04T12:25:43.1544740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_float32 PASSED [0.0085s] [ 12%] 2025-12-04T12:25:43.1544857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_float64 PASSED [0.0063s] [ 12%] 2025-12-04T12:25:43.1544974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_int32 PASSED [0.0061s] [ 12%] 2025-12-04T12:25:43.1545089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_remainder_cuda_uint8 PASSED [0.0061s] [ 12%] 2025-12-04T12:25:43.1545207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_complex128 PASSED [0.0265s] [ 12%] 2025-12-04T12:25:43.1545321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_complex64 PASSED [0.0065s] [ 12%] 2025-12-04T12:25:43.1545440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_renorm_cuda_float16 PASSED [0.0069s] [ 12%] 2025-12-04T12:25:43.1545600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1545763Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1545915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_repeat_interleave_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1546950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_as_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1547112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1547272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1547441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1547609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_reshape_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1547728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_bfloat16 PASSED [0.9308s] [ 12%] 2025-12-04T12:25:43.1547842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_bool PASSED [0.0049s] [ 12%] 2025-12-04T12:25:43.1547957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_float32 PASSED [0.0036s] [ 12%] 2025-12-04T12:25:43.1548099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize__cuda_int64 PASSED [0.9377s] [ 12%] 2025-12-04T12:25:43.1548220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_bfloat16 PASSED [0.0055s] [ 12%] 2025-12-04T12:25:43.1548339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_float16 PASSED [0.0039s] [ 12%] 2025-12-04T12:25:43.1548471Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_int64 PASSED [0.9289s] [ 12%] 2025-12-04T12:25:43.1548583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resize_as__cuda_uint8 PASSED [0.0052s] [ 12%] 2025-12-04T12:25:43.1548751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1548915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549076Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_conj_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549729Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1549888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_resolve_neg_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550547Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1550855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_roll_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1551011Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1551168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1551337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1551505Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rot90_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1551618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_float16 PASSED [0.9384s] [ 12%] 2025-12-04T12:25:43.1551730Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int32 PASSED [0.0044s] [ 12%] 2025-12-04T12:25:43.1551841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int64 PASSED [0.9297s] [ 12%] 2025-12-04T12:25:43.1551951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_int8 PASSED [0.0043s] [ 12%] 2025-12-04T12:25:43.1552062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_cuda_uint8 PASSED [0.9211s] [ 12%] 2025-12-04T12:25:43.1552193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_0_cuda_bfloat16 PASSED [0.0056s] [ 12%] 2025-12-04T12:25:43.1552332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_3_cuda_float32 PASSED [0.0040s] [ 12%] 2025-12-04T12:25:43.1552461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_3_cuda_float64 PASSED [0.9226s] [ 12%] 2025-12-04T12:25:43.1552590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_round_decimals_neg_3_cuda_float64 PASSED [0.0056s] [ 12%] 2025-12-04T12:25:43.1552803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_complex32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1553016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_complex64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1553129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_float32 PASSED [0.9313s] [ 12%] 2025-12-04T12:25:43.1553335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1553540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsqrt_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 12%] 2025-12-04T12:25:43.1553699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1553858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_rsub_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scalar_tensor_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.1554811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_bfloat16 PASSED [0.0148s] [ 12%] 2025-12-04T12:25:43.1554934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_complex64 PASSED [0.0071s] [ 12%] 2025-12-04T12:25:43.1555057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_float16 PASSED [0.0070s] [ 12%] 2025-12-04T12:25:43.1555175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_add_cuda_int64 PASSED [0.0070s] [ 12%] 2025-12-04T12:25:43.1555316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_bfloat16 PASSED [0.0163s] [ 12%] 2025-12-04T12:25:43.1555451Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_complex128 PASSED [0.0120s] [ 12%] 2025-12-04T12:25:43.1555566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float16 PASSED [0.0160s] [ 12%] 2025-12-04T12:25:43.1555679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_cuda_float32 PASSED [0.0160s] [ 12%] 2025-12-04T12:25:43.1555810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amax_cuda_int64 PASSED [0.9406s] [ 12%] 2025-12-04T12:25:43.1555940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float16 PASSED [0.0158s] [ 12%] 2025-12-04T12:25:43.1556070Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float32 PASSED [0.0141s] [ 12%] 2025-12-04T12:25:43.1556200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_float64 PASSED [0.0139s] [ 12%] 2025-12-04T12:25:43.1556338Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_int64 PASSED [0.0139s] [ 12%] 2025-12-04T12:25:43.1556463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_int8 PASSED [0.0139s] [ 12%] 2025-12-04T12:25:43.1556591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_amin_cuda_uint8 PASSED [0.0139s] [ 13%] 2025-12-04T12:25:43.1556717Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int16 PASSED [0.0148s] [ 13%] 2025-12-04T12:25:43.1556844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_int64 PASSED [0.0146s] [ 13%] 2025-12-04T12:25:43.1556972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_mean_cuda_uint8 PASSED [0.0145s] [ 13%] 2025-12-04T12:25:43.1557103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_bfloat16 PASSED [0.0145s] [ 13%] 2025-12-04T12:25:43.1557236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_float16 PASSED [0.0146s] [ 13%] 2025-12-04T12:25:43.1557360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int32 PASSED [0.0140s] [ 13%] 2025-12-04T12:25:43.1557486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_int8 PASSED [0.0138s] [ 13%] 2025-12-04T12:25:43.1557611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_prod_cuda_uint8 PASSED [0.0137s] [ 13%] 2025-12-04T12:25:43.1557741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_float32 PASSED [0.0138s] [ 13%] 2025-12-04T12:25:43.1557865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_int64 PASSED [0.0137s] [ 13%] 2025-12-04T12:25:43.1558006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_scatter_reduce_sum_cuda_uint8 PASSED [0.0137s] [ 13%] 2025-12-04T12:25:43.1558215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1558380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_searchsorted_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1558536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1558695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1558853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1559008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1559195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1559373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1559537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_select_scatter_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1559648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_float32 PASSED [0.9361s] [ 13%] 2025-12-04T12:25:43.1559759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sgn_cuda_int16 PASSED [0.0043s] [ 13%] 2025-12-04T12:25:43.1559918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1560074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1560230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1560400Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_short_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1560617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_complex32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1560733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float16 PASSED [0.0045s] [ 13%] 2025-12-04T12:25:43.1560848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_float64 PASSED [0.9226s] [ 13%] 2025-12-04T12:25:43.1561059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1561268Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1561473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sigmoid_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1561587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_bfloat16 PASSED [0.9221s] [ 13%] 2025-12-04T12:25:43.1561697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_bool PASSED [0.0043s] [ 13%] 2025-12-04T12:25:43.1561810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float16 PASSED [0.9351s] [ 13%] 2025-12-04T12:25:43.1561920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float32 PASSED [0.0044s] [ 13%] 2025-12-04T12:25:43.1562047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sign_cuda_float64 PASSED [0.9282s] [ 13%] 2025-12-04T12:25:43.1562230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_blackman_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1562414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_blackman_cuda_float64 SKIPPED [0.0013s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1562591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_cosine_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1562771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_gaussian_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1562949Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_gaussian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563139Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_general_cosine_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_hamming_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signal_windows_nuttall_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1563991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1564145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1564319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1564473Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_signbit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1564676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1564885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1565097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1565306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1565419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_float16 PASSED [0.9372s] [ 13%] 2025-12-04T12:25:43.1565618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sin_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1565731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_bfloat16 PASSED [0.1926s] [ 13%] 2025-12-04T12:25:43.1565934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1566046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_float64 PASSED [0.9360s] [ 13%] 2025-12-04T12:25:43.1566262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1566463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinc_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1566578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_bfloat16 PASSED [0.0080s] [ 13%] 2025-12-04T12:25:43.1566689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float16 PASSED [0.9288s] [ 13%] 2025-12-04T12:25:43.1566801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_float64 PASSED [0.0042s] [ 13%] 2025-12-04T12:25:43.1567003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1567206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sinh_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.1567372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1567540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1567697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1567850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568016Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_slice_scatter_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568386Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1568916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1569088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_softmax_with_dtype_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1569248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1569399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sort_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1569553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_mm_reduce_cuda_bfloat16 SKIPPED [0.0007s] (Only runs on cpu) [ 13%] 2025-12-04T12:25:43.1569703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_mm_reduce_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 13%] 2025-12-04T12:25:43.1569879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sparse_sampled_addmm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570210Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_airy_ai_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570563Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1570892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571058Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_j1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571586Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1571919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y0_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572436Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572602Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_bessel_y1_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572792Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1572981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1573170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1573362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_t_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1573549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1573735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1573920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1574105Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1574290Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1574488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1574674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1574860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_chebyshev_polynomial_w_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_entr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1575948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1576131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1576310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_h_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1576500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1576695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1576878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_hermite_polynomial_he_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1577895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i0e_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1578941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_i1e_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1579125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1579314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.1579498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1579697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1579901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_laguerre_polynomial_l_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_legendre_polynomial_p_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580602Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_log_ndtr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1580977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1581153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1581333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1581511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_i1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1581692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1581871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_modified_bessel_k1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1582916Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1583076Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1583237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_ndtri_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1583437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1583638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1583840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1584042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_scaled_modified_bessel_k1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1584238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1584439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1584638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1584834Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1585046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1585241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1585435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1585629Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1585825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_spherical_bessel_j0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1586886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_xlog1py_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_special_zeta_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1587882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_list_args_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1588977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1589166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1589339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1589507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1589674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1589836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_split_with_sizes_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1590043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1590258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1590371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sqrt_cuda_float16 PASSED [0.9346s] [ 14%] 2025-12-04T12:25:43.1590490Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_complex64 PASSED [0.0063s] [ 14%] 2025-12-04T12:25:43.1590605Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float16 PASSED [0.0041s] [ 14%] 2025-12-04T12:25:43.1590720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_float32 PASSED [0.9238s] [ 14%] 2025-12-04T12:25:43.1590847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_int8 PASSED [0.0058s] [ 14%] 2025-12-04T12:25:43.1590962Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_square_cuda_uint8 PASSED [0.0041s] [ 14%] 2025-12-04T12:25:43.1591129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1591301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1591466Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1591628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1591747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex128 PASSED [0.0056s] [ 14%] 2025-12-04T12:25:43.1591868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_complex32 PASSED [0.0053s] [ 14%] 2025-12-04T12:25:43.1591997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_float64 PASSED [0.0053s] [ 14%] 2025-12-04T12:25:43.1592130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int32 PASSED [0.0052s] [ 14%] 2025-12-04T12:25:43.1592244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_cuda_int64 PASSED [0.0052s] [ 14%] 2025-12-04T12:25:43.1592373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_complex64 PASSED [0.0047s] [ 14%] 2025-12-04T12:25:43.1592501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_float32 PASSED [0.0045s] [ 14%] 2025-12-04T12:25:43.1592623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int32 PASSED [0.9479s] [ 14%] 2025-12-04T12:25:43.1592746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_int8 PASSED [0.0063s] [ 14%] 2025-12-04T12:25:43.1592870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_squeeze_multiple_cuda_uint8 PASSED [0.0048s] [ 14%] 2025-12-04T12:25:43.1593038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stack_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1593974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_mean_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1594993Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_std_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1595151Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_stft_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1595264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_complex128 PASSED [0.0065s] [ 14%] 2025-12-04T12:25:43.1595379Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float32 PASSED [0.0062s] [ 14%] 2025-12-04T12:25:43.1595489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_float64 PASSED [0.0061s] [ 14%] 2025-12-04T12:25:43.1595598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int32 PASSED [0.0061s] [ 14%] 2025-12-04T12:25:43.1595702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sub_cuda_int8 PASSED [0.0062s] [ 14%] 2025-12-04T12:25:43.1595880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_sum_to_size_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1596844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597028Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_svd_lowrank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1597973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1598162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1598314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1598422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_float32 PASSED [0.0035s] [ 14%] 2025-12-04T12:25:43.1598531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_int16 PASSED [0.9340s] [ 14%] 2025-12-04T12:25:43.1598639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_t_cuda_uint8 PASSED [0.0048s] [ 14%] 2025-12-04T12:25:43.1598815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1598987Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1599152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_along_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1599310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1599463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1599612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_take_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1599723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_bfloat16 PASSED [0.0077s] [ 14%] 2025-12-04T12:25:43.1599954Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1600177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1600385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1600594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1600794Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tan_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1600998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1601220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1601336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float32 PASSED [0.9471s] [ 14%] 2025-12-04T12:25:43.1601447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_float64 PASSED [0.0042s] [ 14%] 2025-12-04T12:25:43.1601648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1601848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tanh_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 14%] 2025-12-04T12:25:43.1602018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.1602181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1602346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1602506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensor_split_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1602666Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tensordot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1602820Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tile_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1602985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_to_sparse_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1603940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_topk_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1604121Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0006s] (Requires CUDA SM >= 8.9) [ 15%] 2025-12-04T12:25:43.1604381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_float16 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 15%] 2025-12-04T12:25:43.1604575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1604769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1604928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605254Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605570Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trace_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1605911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1606085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1606255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1606423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1606590Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_copy_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1606710Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_bfloat16 PASSED [0.0057s] [ 15%] 2025-12-04T12:25:43.1606827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_bool PASSED [0.0067s] [ 15%] 2025-12-04T12:25:43.1606964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex128 PASSED [0.0057s] [ 15%] 2025-12-04T12:25:43.1607088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex32 PASSED [0.0054s] [ 15%] 2025-12-04T12:25:43.1607209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_complex64 PASSED [0.0053s] [ 15%] 2025-12-04T12:25:43.1607325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_float32 PASSED [0.0052s] [ 15%] 2025-12-04T12:25:43.1607441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_float64 PASSED [0.0053s] [ 15%] 2025-12-04T12:25:43.1607556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int32 PASSED [0.0053s] [ 15%] 2025-12-04T12:25:43.1607669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_transpose_cuda_int8 PASSED [0.0053s] [ 15%] 2025-12-04T12:25:43.1607836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapezoid_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1608873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1609031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1609206Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1609362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trapz_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1609537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triangular_solve_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1609651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_bfloat16 PASSED [0.0145s] [ 15%] 2025-12-04T12:25:43.1609760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_bool PASSED [0.0078s] [ 15%] 2025-12-04T12:25:43.1609877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex128 PASSED [0.0077s] [ 15%] 2025-12-04T12:25:43.1609991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_complex64 PASSED [0.0077s] [ 15%] 2025-12-04T12:25:43.1610103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_tril_cuda_int64 PASSED [0.0076s] [ 15%] 2025-12-04T12:25:43.1610209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_bool PASSED [0.9488s] [ 15%] 2025-12-04T12:25:43.1610320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_complex32 PASSED [0.0099s] [ 15%] 2025-12-04T12:25:43.1610430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_float64 PASSED [0.0080s] [ 15%] 2025-12-04T12:25:43.1610540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int16 PASSED [0.0079s] [ 15%] 2025-12-04T12:25:43.1610646Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int32 PASSED [0.0077s] [ 15%] 2025-12-04T12:25:43.1610756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_int64 PASSED [0.0077s] [ 15%] 2025-12-04T12:25:43.1610893Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_cuda_uint8 PASSED [0.9394s] [ 15%] 2025-12-04T12:25:43.1611059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_triu_indices_cuda_int32 SKIPPED [0.0015s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1611183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_bfloat16 PASSED [0.9303s] [ 15%] 2025-12-04T12:25:43.1611408Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T12:25:43.1611631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_complex32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T12:25:43.1611847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_true_divide_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 15%] 2025-12-04T12:25:43.1611964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_bfloat16 PASSED [0.9417s] [ 15%] 2025-12-04T12:25:43.1612090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_float32 PASSED [0.0042s] [ 15%] 2025-12-04T12:25:43.1612218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_trunc_cuda_int64 PASSED [0.9475s] [ 15%] 2025-12-04T12:25:43.1612385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_complex128 SKIPPED [0.0016s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1612544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1612705Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_copy_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1612868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unbind_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1613992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unflatten_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614157Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614481Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1614968Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615771Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1615925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1616090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1616255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unfold_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1616375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_bfloat16 PASSED [0.9454s] [ 15%] 2025-12-04T12:25:43.1616494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_complex128 PASSED [0.0050s] [ 15%] 2025-12-04T12:25:43.1616610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_uniform_cuda_float64 PASSED [0.0037s] [ 15%] 2025-12-04T12:25:43.1616784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1616960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_consecutive_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617455Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1617922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unique_cuda_uint64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unravel_index_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1618930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_chunk_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsafe_split_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1619938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1620116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1620258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_complex64 PASSED [0.0058s] [ 15%] 2025-12-04T12:25:43.1620378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_unsqueeze_cuda_float16 PASSED [0.0056s] [ 15%] 2025-12-04T12:25:43.1620538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1620693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1620848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_mean_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1621862Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_var_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vdot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622682Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_complex_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1622847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1623024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1623188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1623344Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1623501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_as_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.1623655Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1623816Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1623974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624469Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1624930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_view_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625275Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1625898Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1626055Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1626211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1632967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1633180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1633342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1633503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1633660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1633878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_vstack_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634670Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_where_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1634785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_float32 PASSED [0.0341s] [ 16%] 2025-12-04T12:25:43.1634917Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_float64 PASSED [0.0072s] [ 16%] 2025-12-04T12:25:43.1635142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 16%] 2025-12-04T12:25:43.1635346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 16%] 2025-12-04T12:25:43.1635548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_xlogy_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 16%] 2025-12-04T12:25:43.1635663Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float16 PASSED [0.9419s] [ 16%] 2025-12-04T12:25:43.1635773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zero__cuda_float32 PASSED [0.0052s] [ 16%] 2025-12-04T12:25:43.1635950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1636904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1637066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_inplace_zeros_like_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.1637178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_complex64 PASSED [0.9456s] [ 16%] 2025-12-04T12:25:43.1637291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_float64 PASSED [0.0044s] [ 16%] 2025-12-04T12:25:43.1637396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int64 PASSED [0.9547s] [ 16%] 2025-12-04T12:25:43.1637503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_H_cuda_int8 PASSED [0.0044s] [ 16%] 2025-12-04T12:25:43.1637612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_bfloat16 PASSED [0.9343s] [ 16%] 2025-12-04T12:25:43.1637739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_complex128 PASSED [0.0045s] [ 16%] 2025-12-04T12:25:43.1637848Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_float32 PASSED [0.9276s] [ 16%] 2025-12-04T12:25:43.1637956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int16 PASSED [0.0043s] [ 16%] 2025-12-04T12:25:43.1638061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int32 PASSED [0.9312s] [ 16%] 2025-12-04T12:25:43.1638228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_T_cuda_int8 PASSED [0.0045s] [ 16%] 2025-12-04T12:25:43.1638353Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_complex32 PASSED [0.0099s] [ 16%] 2025-12-04T12:25:43.1638471Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___getitem___cuda_int16 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1638594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_complex128 PASSED [0.0094s] [ 16%] 2025-12-04T12:25:43.1638708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float16 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1638846Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_float32 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1638973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int16 PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1639089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int64 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1639201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_int8 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1639313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___radd___cuda_uint8 PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1639428Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_bool PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1639539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int16 PASSED [0.0086s] [ 16%] 2025-12-04T12:25:43.1639651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rand___cuda_int64 PASSED [0.0086s] [ 16%] 2025-12-04T12:25:43.1639782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_bfloat16 PASSED [0.0126s] [ 16%] 2025-12-04T12:25:43.1639894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_bool PASSED [0.0124s] [ 16%] 2025-12-04T12:25:43.1640015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_complex64 PASSED [0.0126s] [ 16%] 2025-12-04T12:25:43.1640128Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_float64 PASSED [0.0122s] [ 16%] 2025-12-04T12:25:43.1640240Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rdiv___cuda_uint8 PASSED [0.0124s] [ 16%] 2025-12-04T12:25:43.1640361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_bfloat16 PASSED [0.0356s] [ 16%] 2025-12-04T12:25:43.1640482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float16 PASSED [0.0226s] [ 16%] 2025-12-04T12:25:43.1640603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmatmul___cuda_float32 PASSED [1.1919s] [ 16%] 2025-12-04T12:25:43.1640721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_bfloat16 PASSED [0.0095s] [ 16%] 2025-12-04T12:25:43.1640835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_float32 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1640945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmod___cuda_int32 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1641061Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_bfloat16 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1641171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_bool PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1641289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_complex128 PASSED [0.0090s] [ 16%] 2025-12-04T12:25:43.1641420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_complex64 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1641537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_float16 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1641648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rmul___cuda_int64 PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1641759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int32 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1641868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___ror___cuda_int64 PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1641981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rpow___cuda_int16 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1642092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_int8 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1642203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rsub___cuda_uint8 PASSED [0.0088s] [ 16%] 2025-12-04T12:25:43.1642314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace___rxor___cuda_int8 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1642447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_bfloat16 PASSED [0.0230s] [ 16%] 2025-12-04T12:25:43.1642581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_bool PASSED [0.0111s] [ 16%] 2025-12-04T12:25:43.1642704Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_complex128 PASSED [0.0114s] [ 16%] 2025-12-04T12:25:43.1642821Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__chunk_cat_cuda_float16 PASSED [0.0111s] [ 16%] 2025-12-04T12:25:43.1642943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_bfloat16 PASSED [0.0084s] [ 16%] 2025-12-04T12:25:43.1643066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_abs_cuda_complex64 PASSED [0.0098s] [ 16%] 2025-12-04T12:25:43.1643194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_complex128 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1643318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_float32 PASSED [0.0081s] [ 16%] 2025-12-04T12:25:43.1643449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_acos_cuda_int32 PASSED [0.0082s] [ 16%] 2025-12-04T12:25:43.1643571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_bfloat16 PASSED [0.0509s] [ 16%] 2025-12-04T12:25:43.1643694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_complex64 PASSED [0.0639s] [ 16%] 2025-12-04T12:25:43.1643815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_float32 PASSED [0.0510s] [ 16%] 2025-12-04T12:25:43.1643933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int16 PASSED [0.0387s] [ 16%] 2025-12-04T12:25:43.1644049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_add_cuda_int8 PASSED [0.0384s] [ 16%] 2025-12-04T12:25:43.1644169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_bool XFAIL [0.0054s] [ 16%] 2025-12-04T12:25:43.1644304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_complex64 PASSED [1.0887s] [ 16%] 2025-12-04T12:25:43.1644431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcdiv_cuda_float16 PASSED [0.0994s] [ 16%] 2025-12-04T12:25:43.1644564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_complex128 PASSED [0.1305s] [ 16%] 2025-12-04T12:25:43.1644690Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_float32 PASSED [0.1002s] [ 16%] 2025-12-04T12:25:43.1644815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int64 PASSED [0.0690s] [ 16%] 2025-12-04T12:25:43.1684457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_addcmul_cuda_int8 PASSED [0.0681s] [ 16%] 2025-12-04T12:25:43.1684627Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_complex128 PASSED [0.0086s] [ 16%] 2025-12-04T12:25:43.1684796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_float64 PASSED [0.0081s] [ 16%] 2025-12-04T12:25:43.1684923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int32 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_asin_cuda_int64 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685163Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_bool PASSED [0.0084s] [ 16%] 2025-12-04T12:25:43.1685281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int16 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int32 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_int8 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_atan_cuda_uint8 PASSED [0.0083s] [ 16%] 2025-12-04T12:25:43.1685774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_float32 PASSED [0.0082s] [ 16%] 2025-12-04T12:25:43.1685908Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int16 PASSED [0.0082s] [ 16%] 2025-12-04T12:25:43.1686025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_ceil_cuda_int8 PASSED [0.0082s] [ 16%] 2025-12-04T12:25:43.1686150Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_bool XFAIL [0.0158s] [ 16%] 2025-12-04T12:25:43.1686281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_complex64 XFAIL [0.9545s] [ 16%] 2025-12-04T12:25:43.1686411Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float16 PASSED [0.1159s] [ 16%] 2025-12-04T12:25:43.1686540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float32 PASSED [0.1115s] [ 16%] 2025-12-04T12:25:43.1686667Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_float64 PASSED [0.1113s] [ 16%] 2025-12-04T12:25:43.1686813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int16 PASSED [0.0766s] [ 16%] 2025-12-04T12:25:43.1686939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int32 PASSED [0.0763s] [ 16%] 2025-12-04T12:25:43.1687064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_int8 PASSED [0.0764s] [ 16%] 2025-12-04T12:25:43.1687188Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_max_cuda_uint8 PASSED [0.0763s] [ 16%] 2025-12-04T12:25:43.1687318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_bfloat16 PASSED [0.1168s] [ 16%] 2025-12-04T12:25:43.1687442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_clamp_min_cuda_int16 PASSED [0.0769s] [ 16%] 2025-12-04T12:25:43.1687565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_bfloat16 PASSED [0.0092s] [ 16%] 2025-12-04T12:25:43.1687684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_bool PASSED [0.0123s] [ 16%] 2025-12-04T12:25:43.1687808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_float16 PASSED [0.0089s] [ 16%] 2025-12-04T12:25:43.1687926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int16 PASSED [0.0085s] [ 16%] 2025-12-04T12:25:43.1688044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int32 PASSED [0.0087s] [ 16%] 2025-12-04T12:25:43.1688193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_copy_cuda_int8 PASSED [0.0086s] [ 17%] 2025-12-04T12:25:43.1688316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_complex128 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1688440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_complex64 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1688574Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1688697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1688815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int16 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1688932Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_int8 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1689048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cos_cuda_uint8 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1689166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_cosh_cuda_int16 PASSED [0.0085s] [ 17%] 2025-12-04T12:25:43.1689281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_bool PASSED [0.0434s] [ 17%] 2025-12-04T12:25:43.1689404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_complex128 PASSED [0.0703s] [ 17%] 2025-12-04T12:25:43.1689539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_complex64 PASSED [0.0701s] [ 17%] 2025-12-04T12:25:43.1689669Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int32 PASSED [0.0437s] [ 17%] 2025-12-04T12:25:43.1689785Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_div_cuda_int8 PASSED [0.0473s] [ 17%] 2025-12-04T12:25:43.1689904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float16 PASSED [0.0088s] [ 17%] 2025-12-04T12:25:43.1690024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1690139Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erf_cuda_int64 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1690257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_bool PASSED [0.0096s] [ 17%] 2025-12-04T12:25:43.1690380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_complex128 XFAIL [0.0035s] [ 17%] 2025-12-04T12:25:43.1690503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_complex64 XFAIL [0.9664s] [ 17%] 2025-12-04T12:25:43.1690639Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_float16 PASSED [0.9454s] [ 17%] 2025-12-04T12:25:43.1690759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int64 PASSED [0.0090s] [ 17%] 2025-12-04T12:25:43.1690876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_int8 PASSED [0.0086s] [ 17%] 2025-12-04T12:25:43.1690995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_erfc_cuda_uint8 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1691115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1691230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_bool PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1691352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_complex64 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1691475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1691595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1691712Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_exp_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1691835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1691953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_bool PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1692076Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1692205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_expm1_cuda_int16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1692332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_complex128 XFAIL [0.0037s] [ 17%] 2025-12-04T12:25:43.1692456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_complex64 XFAIL [0.9291s] [ 17%] 2025-12-04T12:25:43.1692578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_float16 PASSED [0.9357s] [ 17%] 2025-12-04T12:25:43.1692696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_floor_cuda_int32 PASSED [0.0090s] [ 17%] 2025-12-04T12:25:43.1692817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_bfloat16 PASSED [0.0196s] [ 17%] 2025-12-04T12:25:43.1692942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_complex128 XFAIL [0.0037s] [ 17%] 2025-12-04T12:25:43.1693063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_float64 PASSED [0.9517s] [ 17%] 2025-12-04T12:25:43.1693181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_int16 XFAIL [0.0041s] [ 17%] 2025-12-04T12:25:43.1693310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_frac_cuda_uint8 XFAIL [0.9435s] [ 17%] 2025-12-04T12:25:43.1693441Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_bfloat16 PASSED [1.0066s] [ 17%] 2025-12-04T12:25:43.1693564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_complex64 PASSED [0.0797s] [ 17%] 2025-12-04T12:25:43.1693684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_float16 PASSED [0.0613s] [ 17%] 2025-12-04T12:25:43.1693801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lerp_cuda_int64 XFAIL [0.0054s] [ 17%] 2025-12-04T12:25:43.1693922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_bool PASSED [0.9326s] [ 17%] 2025-12-04T12:25:43.1694047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_float64 PASSED [0.0088s] [ 17%] 2025-12-04T12:25:43.1694168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_int32 PASSED [0.0125s] [ 17%] 2025-12-04T12:25:43.1694314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_lgamma_cuda_int8 PASSED [0.0098s] [ 17%] 2025-12-04T12:25:43.1694440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_complex64 PASSED [0.0088s] [ 17%] 2025-12-04T12:25:43.1694561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_float32 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1694681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1694799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log10_cuda_int64 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1694922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_bfloat16 PASSED [0.0085s] [ 17%] 2025-12-04T12:25:43.1695043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_int8 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1695167Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log1p_cuda_uint8 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1695291Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_bfloat16 PASSED [0.0085s] [ 17%] 2025-12-04T12:25:43.1695418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_complex128 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1695538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1695657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1695781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1695899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log2_cuda_int64 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1696036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_float64 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1696155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_log_cuda_int32 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1696276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float16 PASSED [0.0120s] [ 17%] 2025-12-04T12:25:43.1696394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_float64 PASSED [0.0051s] [ 17%] 2025-12-04T12:25:43.1696509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int16 PASSED [0.0050s] [ 17%] 2025-12-04T12:25:43.1696628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int64 PASSED [0.0051s] [ 17%] 2025-12-04T12:25:43.1696743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_max_cuda_int8 PASSED [0.0050s] [ 17%] 2025-12-04T12:25:43.1696875Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bfloat16 PASSED [0.1120s] [ 17%] 2025-12-04T12:25:43.1697005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_bool XFAIL [0.0155s] [ 17%] 2025-12-04T12:25:43.1697155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_float64 PASSED [1.0546s] [ 17%] 2025-12-04T12:25:43.1697280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int64 PASSED [0.0775s] [ 17%] 2025-12-04T12:25:43.1697405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_int8 PASSED [0.0778s] [ 17%] 2025-12-04T12:25:43.1697526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_maximum_cuda_uint8 PASSED [0.0762s] [ 17%] 2025-12-04T12:25:43.1697648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_bool XFAIL [0.0161s] [ 17%] 2025-12-04T12:25:43.1697769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_minimum_cuda_int16 PASSED [1.0237s] [ 17%] 2025-12-04T12:25:43.1697895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_bfloat16 PASSED [0.0579s] [ 17%] 2025-12-04T12:25:43.1698030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_complex128 PASSED [0.0701s] [ 17%] 2025-12-04T12:25:43.1698186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float16 PASSED [0.0573s] [ 17%] 2025-12-04T12:25:43.1698305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float32 PASSED [0.0559s] [ 17%] 2025-12-04T12:25:43.1698428Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_float64 PASSED [0.0557s] [ 17%] 2025-12-04T12:25:43.1698544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_mul_cuda_int64 PASSED [0.0420s] [ 17%] 2025-12-04T12:25:43.1698672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_complex128 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1698795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_float32 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1698913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_int64 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1699032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_neg_cuda_uint8 PASSED [0.0080s] [ 17%] 2025-12-04T12:25:43.1699156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_complex128 PASSED [0.1544s] [ 17%] 2025-12-04T12:25:43.1699281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_float16 PASSED [0.0773s] [ 17%] 2025-12-04T12:25:43.1699397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_int8 XFAIL [0.0036s] [ 17%] 2025-12-04T12:25:43.1699517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_norm_cuda_uint8 XFAIL [0.9468s] [ 17%] 2025-12-04T12:25:43.1699640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_complex64 PASSED [1.0005s] [ 17%] 2025-12-04T12:25:43.1699773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int64 PASSED [0.0342s] [ 17%] 2025-12-04T12:25:43.1699890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_int8 PASSED [0.0340s] [ 17%] 2025-12-04T12:25:43.1700012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_pow_cuda_uint8 PASSED [0.0337s] [ 17%] 2025-12-04T12:25:43.1700145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_complex64 PASSED [0.0085s] [ 17%] 2025-12-04T12:25:43.1700277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_float16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1700403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_reciprocal_cuda_uint8 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1700528Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_float32 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1700651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_round_cuda_float64 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1700787Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_bfloat16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1700924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_complex64 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1701047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1701169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_int64 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1701288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_rsqrt_cuda_uint8 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1701420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_complex64 PASSED [0.0076s] [ 17%] 2025-12-04T12:25:43.1701545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_float32 PASSED [0.0073s] [ 17%] 2025-12-04T12:25:43.1701672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sigmoid_cuda_uint8 PASSED [0.0075s] [ 17%] 2025-12-04T12:25:43.1701806Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1701930Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1702048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sign_cuda_int8 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1702171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_float32 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1702288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1702409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int64 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1702525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sin_cuda_int8 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1702645Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_bool PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1702768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1702889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int64 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1703006Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sinh_cuda_int8 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1703129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_float16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1703249Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_float32 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1703370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int32 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1703499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sqrt_cuda_int64 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1703622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_complex64 XFAIL [0.0110s] [ 17%] 2025-12-04T12:25:43.1703740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_sub_cuda_int16 XFAIL [0.0090s] [ 17%] 2025-12-04T12:25:43.1703864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_complex128 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1703985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_float16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1704107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_float32 PASSED [0.0080s] [ 17%] 2025-12-04T12:25:43.1704227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tan_cuda_int16 PASSED [0.0084s] [ 17%] 2025-12-04T12:25:43.1704350Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_bfloat16 PASSED [0.0083s] [ 17%] 2025-12-04T12:25:43.1704486Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_complex64 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1704617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_float64 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.1704736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_tanh_cuda_int16 PASSED [0.0082s] [ 17%] 2025-12-04T12:25:43.1704853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_bool XFAIL [0.0035s] [ 17%] 2025-12-04T12:25:43.1704976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_float16 PASSED [0.9580s] [ 17%] 2025-12-04T12:25:43.1705096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_trunc_cuda_int64 PASSED [0.0088s] [ 18%] 2025-12-04T12:25:43.1705214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_bool PASSED [0.0058s] [ 18%] 2025-12-04T12:25:43.1705335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_float32 PASSED [0.0060s] [ 18%] 2025-12-04T12:25:43.1705454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__foreach_zero_cuda_int8 PASSED [0.0058s] [ 18%] 2025-12-04T12:25:43.1705603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float32 PASSED [0.0347s] [ 18%] 2025-12-04T12:25:43.1705745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__native_batch_norm_legit_cuda_float64 PASSED [0.0134s] [ 18%] 2025-12-04T12:25:43.1705884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_float16 PASSED [1.0512s] [ 18%] 2025-12-04T12:25:43.1706021Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_lengths_cuda_float32 PASSED [0.0731s] [ 18%] 2025-12-04T12:25:43.1706158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__segment_reduce_offsets_cuda_bfloat16 PASSED [0.0623s] [ 18%] 2025-12-04T12:25:43.1706294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__softmax_backward_data_cuda_float32 PASSED [0.0253s] [ 18%] 2025-12-04T12:25:43.1706423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_bool PASSED [0.0140s] [ 18%] 2025-12-04T12:25:43.1706561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_complex128 PASSED [0.0136s] [ 18%] 2025-12-04T12:25:43.1706694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_float16 PASSED [0.0134s] [ 18%] 2025-12-04T12:25:43.1706827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_float64 PASSED [0.0135s] [ 18%] 2025-12-04T12:25:43.1706957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_cuda_int8 PASSED [0.0134s] [ 18%] 2025-12-04T12:25:43.1707106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool PASSED [0.0145s] [ 18%] 2025-12-04T12:25:43.1707277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex128 PASSED [0.0144s] [ 18%] 2025-12-04T12:25:43.1707433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex64 PASSED [0.0143s] [ 18%] 2025-12-04T12:25:43.1707587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.0144s] [ 18%] 2025-12-04T12:25:43.1707736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int32 PASSED [0.0143s] [ 18%] 2025-12-04T12:25:43.1707886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int64 PASSED [0.0143s] [ 18%] 2025-12-04T12:25:43.1708025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16 PASSED [0.0099s] [ 18%] 2025-12-04T12:25:43.1708187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0056s] [ 18%] 2025-12-04T12:25:43.1708305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_bfloat16 PASSED [0.9632s] [ 18%] 2025-12-04T12:25:43.1708434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_complex128 PASSED [0.0046s] [ 18%] 2025-12-04T12:25:43.1708557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_int16 PASSED [0.9351s] [ 18%] 2025-12-04T12:25:43.1708665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_abs_cuda_uint8 PASSED [0.0045s] [ 18%] 2025-12-04T12:25:43.1708779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_bfloat16 PASSED [0.0043s] [ 18%] 2025-12-04T12:25:43.1708890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_float16 PASSED [0.9410s] [ 18%] 2025-12-04T12:25:43.1709000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_int64 PASSED [0.0058s] [ 18%] 2025-12-04T12:25:43.1709110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acos_cuda_uint8 PASSED [0.0040s] [ 18%] 2025-12-04T12:25:43.1709234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_complex128 PASSED [0.0041s] [ 18%] 2025-12-04T12:25:43.1709347Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float16 PASSED [0.9492s] [ 18%] 2025-12-04T12:25:43.1709472Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_float64 PASSED [0.0055s] [ 18%] 2025-12-04T12:25:43.1709583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_int64 PASSED [0.0041s] [ 18%] 2025-12-04T12:25:43.1709694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_acosh_cuda_uint8 PASSED [0.9569s] [ 18%] 2025-12-04T12:25:43.1709803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_bfloat16 PASSED [0.0124s] [ 18%] 2025-12-04T12:25:43.1709913Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_bool PASSED [0.0102s] [ 18%] 2025-12-04T12:25:43.1710020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int16 PASSED [0.0102s] [ 18%] 2025-12-04T12:25:43.1710129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int32 PASSED [0.0100s] [ 18%] 2025-12-04T12:25:43.1710236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int64 PASSED [0.9592s] [ 18%] 2025-12-04T12:25:43.1710345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_add_cuda_int8 PASSED [0.0123s] [ 18%] 2025-12-04T12:25:43.1710462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_bfloat16 PASSED [0.0081s] [ 18%] 2025-12-04T12:25:43.1710576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_float16 PASSED [0.0074s] [ 18%] 2025-12-04T12:25:43.1710691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addbmm_cuda_float32 PASSED [0.0075s] [ 18%] 2025-12-04T12:25:43.1710808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_complex64 PASSED [0.3867s] [ 18%] 2025-12-04T12:25:43.1710925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcdiv_cuda_float32 PASSED [0.0129s] [ 18%] 2025-12-04T12:25:43.1711056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_complex128 PASSED [0.0129s] [ 18%] 2025-12-04T12:25:43.1711171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_int32 PASSED [0.0125s] [ 18%] 2025-12-04T12:25:43.1711284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_int64 PASSED [0.0124s] [ 18%] 2025-12-04T12:25:43.1711397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addcmul_cuda_uint8 PASSED [0.0124s] [ 18%] 2025-12-04T12:25:43.1711510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_complex64 PASSED [0.0141s] [ 18%] 2025-12-04T12:25:43.1711622Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_cuda_float32 PASSED [0.0099s] [ 18%] 2025-12-04T12:25:43.1711753Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmm_decomposed_cuda_complex64 PASSED [0.0097s] [ 18%] 2025-12-04T12:25:43.1711871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addmv_cuda_complex64 PASSED [0.0101s] [ 18%] 2025-12-04T12:25:43.1711983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_bfloat16 PASSED [0.0090s] [ 18%] 2025-12-04T12:25:43.1712114Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_int32 PASSED [0.9447s] [ 18%] 2025-12-04T12:25:43.1712236Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_addr_cuda_int8 PASSED [0.0093s] [ 18%] 2025-12-04T12:25:43.1712361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_bfloat16 PASSED [0.9438s] [ 18%] 2025-12-04T12:25:43.1712484Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_complex128 PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1712608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_complex64 PASSED [0.0034s] [ 18%] 2025-12-04T12:25:43.1712727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_float32 PASSED [0.9490s] [ 18%] 2025-12-04T12:25:43.1712846Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int16 PASSED [0.0047s] [ 18%] 2025-12-04T12:25:43.1712965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int32 PASSED [0.0033s] [ 18%] 2025-12-04T12:25:43.1713091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_alias_copy_cuda_int64 PASSED [0.9713s] [ 18%] 2025-12-04T12:25:43.1713207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_complex128 PASSED [0.0214s] [ 18%] 2025-12-04T12:25:43.1713320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_complex64 PASSED [0.9691s] [ 18%] 2025-12-04T12:25:43.1713431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_float32 PASSED [0.0209s] [ 18%] 2025-12-04T12:25:43.1713539Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int32 PASSED [0.0190s] [ 18%] 2025-12-04T12:25:43.1713648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_all_cuda_int8 PASSED [0.0189s] [ 18%] 2025-12-04T12:25:43.1713765Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_bfloat16 PASSED [0.0293s] [ 18%] 2025-12-04T12:25:43.1713887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_complex64 PASSED [0.0291s] [ 18%] 2025-12-04T12:25:43.1714002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_allclose_cuda_float64 PASSED [0.0281s] [ 18%] 2025-12-04T12:25:43.1714116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_float64 PASSED [0.0123s] [ 18%] 2025-12-04T12:25:43.1714226Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_int16 PASSED [0.0121s] [ 18%] 2025-12-04T12:25:43.1714336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amax_cuda_int8 PASSED [0.0120s] [ 18%] 2025-12-04T12:25:43.1714449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_bfloat16 PASSED [0.0133s] [ 18%] 2025-12-04T12:25:43.1714561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_amin_cuda_float64 PASSED [0.0121s] [ 18%] 2025-12-04T12:25:43.1714674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_bool PASSED [0.0091s] [ 18%] 2025-12-04T12:25:43.1714799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_float32 PASSED [0.0047s] [ 18%] 2025-12-04T12:25:43.1714911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int16 PASSED [0.0047s] [ 18%] 2025-12-04T12:25:43.1715024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int32 PASSED [0.0046s] [ 18%] 2025-12-04T12:25:43.1715134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int64 PASSED [0.0045s] [ 18%] 2025-12-04T12:25:43.1715247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_aminmax_cuda_int8 PASSED [0.0046s] [ 18%] 2025-12-04T12:25:43.1715357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_bool PASSED [0.0028s] [ 18%] 2025-12-04T12:25:43.1715474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_complex128 PASSED [1.1081s] [ 18%] 2025-12-04T12:25:43.1715588Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int16 PASSED [0.0045s] [ 18%] 2025-12-04T12:25:43.1715700Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_angle_cuda_int8 PASSED [0.9529s] [ 18%] 2025-12-04T12:25:43.1715822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_float16 PASSED [0.0177s] [ 18%] 2025-12-04T12:25:43.1715939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_float32 PASSED [0.0149s] [ 18%] 2025-12-04T12:25:43.1716048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_any_cuda_int16 PASSED [0.0148s] [ 18%] 2025-12-04T12:25:43.1716162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float16 PASSED [0.0136s] [ 18%] 2025-12-04T12:25:43.1716277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_arange_cuda_float64 PASSED [0.0133s] [ 18%] 2025-12-04T12:25:43.1716390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float16 PASSED [0.0132s] [ 18%] 2025-12-04T12:25:43.1716506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_float32 PASSED [0.9604s] [ 18%] 2025-12-04T12:25:43.1716615Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmax_cuda_int8 PASSED [0.0129s] [ 18%] 2025-12-04T12:25:43.1716740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_bfloat16 PASSED [0.0140s] [ 18%] 2025-12-04T12:25:43.1716852Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_int32 PASSED [0.0081s] [ 18%] 2025-12-04T12:25:43.1716964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argmin_cuda_uint8 PASSED [0.9915s] [ 18%] 2025-12-04T12:25:43.1717078Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argsort_cuda_float64 PASSED [0.2249s] [ 18%] 2025-12-04T12:25:43.1717198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_bfloat16 PASSED [0.1132s] [ 18%] 2025-12-04T12:25:43.1717319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_complex64 PASSED [0.9443s] [ 18%] 2025-12-04T12:25:43.1717433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int16 PASSED [0.0063s] [ 18%] 2025-12-04T12:25:43.1717546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_argwhere_cuda_int8 PASSED [0.0045s] [ 18%] 2025-12-04T12:25:43.1717679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_bfloat16 PASSED [0.0053s] [ 18%] 2025-12-04T12:25:43.1717802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_bool PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1717934Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex128 PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1718186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_complex32 PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1718311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_float32 PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1718437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_copy_cuda_int32 PASSED [0.0048s] [ 18%] 2025-12-04T12:25:43.1718573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_bfloat16 PASSED [0.9628s] [ 18%] 2025-12-04T12:25:43.1718698Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_complex128 PASSED [0.0056s] [ 18%] 2025-12-04T12:25:43.1718822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_complex32 PASSED [0.0040s] [ 18%] 2025-12-04T12:25:43.1718942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_float64 PASSED [0.9499s] [ 18%] 2025-12-04T12:25:43.1719057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int32 PASSED [0.0057s] [ 18%] 2025-12-04T12:25:43.1719173Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_cuda_int64 PASSED [0.0041s] [ 18%] 2025-12-04T12:25:43.1719307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_bool PASSED [0.9485s] [ 18%] 2025-12-04T12:25:43.1719445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_int8 PASSED [0.0049s] [ 18%] 2025-12-04T12:25:43.1719582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_partial_views_cuda_uint8 PASSED [0.0035s] [ 18%] 2025-12-04T12:25:43.1719728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_bfloat16 PASSED [0.0070s] [ 18%] 2025-12-04T12:25:43.1719874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex128 PASSED [0.0067s] [ 18%] 2025-12-04T12:25:43.1720008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex32 PASSED [0.0067s] [ 18%] 2025-12-04T12:25:43.1720137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_complex64 PASSED [0.0066s] [ 18%] 2025-12-04T12:25:43.1720265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_float64 PASSED [0.0066s] [ 18%] 2025-12-04T12:25:43.1720392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_int32 PASSED [0.0066s] [ 18%] 2025-12-04T12:25:43.1720518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_as_strided_scatter_cuda_uint8 PASSED [0.0066s] [ 18%] 2025-12-04T12:25:43.1720648Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_complex32 PASSED [0.0030s] [ 18%] 2025-12-04T12:25:43.1720759Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asin_cuda_uint8 PASSED [0.9593s] [ 18%] 2025-12-04T12:25:43.1720880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex128 PASSED [0.0045s] [ 18%] 2025-12-04T12:25:43.1720994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex32 PASSED [0.9649s] [ 18%] 2025-12-04T12:25:43.1721112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_complex64 PASSED [0.0046s] [ 18%] 2025-12-04T12:25:43.1721223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_float16 PASSED [0.9614s] [ 18%] 2025-12-04T12:25:43.1721336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_asinh_cuda_int64 PASSED [0.0046s] [ 18%] 2025-12-04T12:25:43.1721449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_bfloat16 PASSED [0.0097s] [ 18%] 2025-12-04T12:25:43.1721562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_float64 PASSED [0.0088s] [ 18%] 2025-12-04T12:25:43.1721672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int32 PASSED [0.0088s] [ 18%] 2025-12-04T12:25:43.1721782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int64 PASSED [0.0087s] [ 18%] 2025-12-04T12:25:43.1721890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan2_cuda_int8 PASSED [0.0087s] [ 18%] 2025-12-04T12:25:43.1722000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_bool PASSED [0.9830s] [ 19%] 2025-12-04T12:25:43.1722108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_int16 PASSED [0.0045s] [ 19%] 2025-12-04T12:25:43.1722220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atan_cuda_uint8 PASSED [0.9517s] [ 19%] 2025-12-04T12:25:43.1722336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_bool PASSED [0.0046s] [ 19%] 2025-12-04T12:25:43.1722450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_float64 PASSED [0.9384s] [ 19%] 2025-12-04T12:25:43.1722560Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atanh_cuda_int64 PASSED [0.0045s] [ 19%] 2025-12-04T12:25:43.1722677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_bool PASSED [0.9527s] [ 19%] 2025-12-04T12:25:43.1722799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_complex128 PASSED [0.0050s] [ 19%] 2025-12-04T12:25:43.1722921Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_float16 PASSED [0.9672s] [ 19%] 2025-12-04T12:25:43.1723039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int16 PASSED [0.0050s] [ 19%] 2025-12-04T12:25:43.1723153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int32 PASSED [0.9372s] [ 19%] 2025-12-04T12:25:43.1723269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_1d_cuda_int8 PASSED [0.0047s] [ 19%] 2025-12-04T12:25:43.1723401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_complex64 PASSED [0.9328s] [ 19%] 2025-12-04T12:25:43.1723525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_2d_cuda_int16 PASSED [0.0050s] [ 19%] 2025-12-04T12:25:43.1723641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_float32 PASSED [0.0045s] [ 19%] 2025-12-04T12:25:43.1723756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_atleast_3d_cuda_int8 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1723874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_complex128 PASSED [0.0150s] [ 19%] 2025-12-04T12:25:43.1723994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_baddbmm_cuda_complex64 PASSED [0.0142s] [ 19%] 2025-12-04T12:25:43.1724110Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_float16 PASSED [0.0054s] [ 19%] 2025-12-04T12:25:43.1724233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bernoulli_cuda_float32 PASSED [0.9765s] [ 19%] 2025-12-04T12:25:43.1724357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_bool PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1724479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_complex128 PASSED [0.9699s] [ 19%] 2025-12-04T12:25:43.1724596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_complex32 PASSED [0.0036s] [ 19%] 2025-12-04T12:25:43.1724713Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_float32 PASSED [0.9492s] [ 19%] 2025-12-04T12:25:43.1724825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bfloat16_cuda_float64 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1724940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int32 PASSED [0.0145s] [ 19%] 2025-12-04T12:25:43.1725054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_int64 PASSED [0.0077s] [ 19%] 2025-12-04T12:25:43.1725168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bincount_cuda_uint8 PASSED [0.0071s] [ 19%] 2025-12-04T12:25:43.1725284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_bool PASSED [0.0088s] [ 19%] 2025-12-04T12:25:43.1725406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_and_cuda_int64 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1725536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int16 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1725662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_left_shift_cuda_int64 PASSED [0.0084s] [ 19%] 2025-12-04T12:25:43.1725780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int64 PASSED [0.0038s] [ 19%] 2025-12-04T12:25:43.1725894Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_int8 PASSED [0.9505s] [ 19%] 2025-12-04T12:25:43.1726027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_not_cuda_uint8 PASSED [0.0055s] [ 19%] 2025-12-04T12:25:43.1726142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_int16 PASSED [0.0091s] [ 19%] 2025-12-04T12:25:43.1726261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_or_cuda_uint8 PASSED [0.0086s] [ 19%] 2025-12-04T12:25:43.1726393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int16 PASSED [0.0087s] [ 19%] 2025-12-04T12:25:43.1726525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int32 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1726653Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_int64 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1726783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_right_shift_cuda_uint8 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1726900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_int64 PASSED [0.0085s] [ 19%] 2025-12-04T12:25:43.1727022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bitwise_xor_cuda_uint8 PASSED [0.0084s] [ 19%] 2025-12-04T12:25:43.1727153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_bfloat16 PASSED [0.0068s] [ 19%] 2025-12-04T12:25:43.1727281Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_bool PASSED [0.0066s] [ 19%] 2025-12-04T12:25:43.1727403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_block_diag_cuda_complex32 PASSED [0.0113s] [ 19%] 2025-12-04T12:25:43.1727519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_bfloat16 PASSED [0.9440s] [ 19%] 2025-12-04T12:25:43.1727631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_complex128 PASSED [0.0046s] [ 19%] 2025-12-04T12:25:43.1727743Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bmm_cuda_float64 PASSED [0.9333s] [ 19%] 2025-12-04T12:25:43.1727857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_bfloat16 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1727972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_complex32 PASSED [0.9527s] [ 19%] 2025-12-04T12:25:43.1728176Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_complex64 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1728287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bool_cuda_int16 PASSED [0.9528s] [ 19%] 2025-12-04T12:25:43.1728417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_shapes_cuda_float32 PASSED [0.0040s] [ 19%] 2025-12-04T12:25:43.1728550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_complex128 PASSED [0.9625s] [ 19%] 2025-12-04T12:25:43.1728683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_complex64 PASSED [0.0045s] [ 19%] 2025-12-04T12:25:43.1728809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_int64 PASSED [0.0031s] [ 19%] 2025-12-04T12:25:43.1728938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_tensors_cuda_uint8 PASSED [0.9558s] [ 19%] 2025-12-04T12:25:43.1729054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_int8 PASSED [0.9582s] [ 19%] 2025-12-04T12:25:43.1729178Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_broadcast_to_cuda_uint8 PASSED [0.9441s] [ 19%] 2025-12-04T12:25:43.1729292Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_int16 PASSED [0.0221s] [ 19%] 2025-12-04T12:25:43.1729407Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_bucketize_cuda_int64 PASSED [0.0162s] [ 19%] 2025-12-04T12:25:43.1729520Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_bfloat16 PASSED [0.9514s] [ 19%] 2025-12-04T12:25:43.1729635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_complex128 PASSED [0.0040s] [ 19%] 2025-12-04T12:25:43.1729748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_float32 PASSED [0.9453s] [ 19%] 2025-12-04T12:25:43.1729873Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_float64 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1729983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_byte_cuda_int32 PASSED [0.9424s] [ 19%] 2025-12-04T12:25:43.1730115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_complex64 PASSED [0.0102s] [ 19%] 2025-12-04T12:25:43.1730242Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_float64 PASSED [0.0081s] [ 19%] 2025-12-04T12:25:43.1730365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int32 PASSED [0.0077s] [ 19%] 2025-12-04T12:25:43.1730488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_int64 PASSED [0.0077s] [ 19%] 2025-12-04T12:25:43.1730609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cartesian_prod_cuda_uint8 PASSED [0.0076s] [ 19%] 2025-12-04T12:25:43.1730720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_float16 PASSED [0.0084s] [ 19%] 2025-12-04T12:25:43.1730829Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int64 PASSED [0.9480s] [ 19%] 2025-12-04T12:25:43.1730950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cat_cuda_int8 PASSED [0.0101s] [ 19%] 2025-12-04T12:25:43.1731076Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cauchy_cuda_bfloat16 PASSED [0.0049s] [ 19%] 2025-12-04T12:25:43.1731191Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdist_cuda_float64 PASSED [0.2628s] [ 19%] 2025-12-04T12:25:43.1731310Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex128 PASSED [0.9526s] [ 19%] 2025-12-04T12:25:43.1731430Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex32 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1731546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_complex64 PASSED [0.9482s] [ 19%] 2025-12-04T12:25:43.1731662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cdouble_cuda_float16 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1731776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_float16 PASSED [0.9381s] [ 19%] 2025-12-04T12:25:43.1731899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int32 PASSED [0.0045s] [ 19%] 2025-12-04T12:25:43.1732008Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ceil_cuda_int8 PASSED [0.9510s] [ 19%] 2025-12-04T12:25:43.1732123Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_complex32 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1732234Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_float64 PASSED [0.9531s] [ 19%] 2025-12-04T12:25:43.1732348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_int16 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1732458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cfloat_cuda_int8 PASSED [0.9459s] [ 19%] 2025-12-04T12:25:43.1732572Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_float16 PASSED [0.0037s] [ 19%] 2025-12-04T12:25:43.1732686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chalf_cuda_int64 PASSED [0.9429s] [ 19%] 2025-12-04T12:25:43.1732799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_bfloat16 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1732915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_complex64 PASSED [0.9385s] [ 19%] 2025-12-04T12:25:43.1733024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_float64 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.1733137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int16 PASSED [0.9340s] [ 19%] 2025-12-04T12:25:43.1733247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int64 PASSED [0.0039s] [ 19%] 2025-12-04T12:25:43.1733358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_int8 PASSED [0.9524s] [ 19%] 2025-12-04T12:25:43.1733467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_char_cuda_uint8 PASSED [0.0038s] [ 19%] 2025-12-04T12:25:43.1733600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_complex128 PASSED [0.0274s] [ 19%] 2025-12-04T12:25:43.1733716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_cuda_float32 PASSED [1.3717s] [ 19%] 2025-12-04T12:25:43.1733844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_inverse_cuda_float32 PASSED [1.4720s] [ 19%] 2025-12-04T12:25:43.1733972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_complex128 PASSED [1.0062s] [ 19%] 2025-12-04T12:25:43.1734097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cholesky_solve_cuda_float64 PASSED [0.0628s] [ 19%] 2025-12-04T12:25:43.1734205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_bool PASSED [0.0037s] [ 19%] 2025-12-04T12:25:43.1734316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_chunk_cuda_int64 PASSED [0.9459s] [ 19%] 2025-12-04T12:25:43.1734426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_float16 PASSED [0.0142s] [ 19%] 2025-12-04T12:25:43.1734552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_cuda_int16 PASSED [0.9547s] [ 19%] 2025-12-04T12:25:43.1734675Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_bool PASSED [0.0132s] [ 19%] 2025-12-04T12:25:43.1734793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_float16 PASSED [0.0121s] [ 19%] 2025-12-04T12:25:43.1734907Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_max_cuda_int8 PASSED [0.0117s] [ 19%] 2025-12-04T12:25:43.1735024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_float32 PASSED [0.0118s] [ 19%] 2025-12-04T12:25:43.1735138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int32 PASSED [0.0116s] [ 19%] 2025-12-04T12:25:43.1735253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clamp_min_cuda_int64 PASSED [0.0116s] [ 19%] 2025-12-04T12:25:43.1735371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_complex128 PASSED [0.9594s] [ 19%] 2025-12-04T12:25:43.1735483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_complex32 PASSED [0.0052s] [ 19%] 2025-12-04T12:25:43.1735609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float16 PASSED [0.0035s] [ 19%] 2025-12-04T12:25:43.1735720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_clone_cuda_float64 PASSED [0.9547s] [ 19%] 2025-12-04T12:25:43.1735844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_bfloat16 PASSED [0.0073s] [ 19%] 2025-12-04T12:25:43.1735970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_complex128 PASSED [0.0053s] [ 19%] 2025-12-04T12:25:43.1736096Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_column_stack_cuda_complex64 PASSED [0.0052s] [ 19%] 2025-12-04T12:25:43.1736219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_bfloat16 PASSED [0.0829s] [ 19%] 2025-12-04T12:25:43.1736346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_complex64 PASSED [0.0798s] [ 19%] 2025-12-04T12:25:43.1736470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_float16 PASSED [0.0792s] [ 19%] 2025-12-04T12:25:43.1736595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_float64 PASSED [1.0256s] [ 19%] 2025-12-04T12:25:43.1736716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int32 PASSED [0.0822s] [ 19%] 2025-12-04T12:25:43.1736835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int64 PASSED [0.0799s] [ 19%] 2025-12-04T12:25:43.1736953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_int8 PASSED [0.0795s] [ 19%] 2025-12-04T12:25:43.1737074Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_combinations_cuda_uint8 PASSED [0.0795s] [ 19%] 2025-12-04T12:25:43.1737186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_cuda_float16 PASSED [0.9676s] [ 19%] 2025-12-04T12:25:43.1737324Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_bfloat16 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.1737452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_complex64 PASSED [0.9556s] [ 19%] 2025-12-04T12:25:43.1737576Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_float64 PASSED [0.0038s] [ 19%] 2025-12-04T12:25:43.1737701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_conj_physical_cuda_int32 PASSED [0.9382s] [ 19%] 2025-12-04T12:25:43.1737823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_bool PASSED [0.0290s] [ 19%] 2025-12-04T12:25:43.1737950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int16 PASSED [0.0274s] [ 19%] 2025-12-04T12:25:43.1738111Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_int8 PASSED [0.0271s] [ 19%] 2025-12-04T12:25:43.1738239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_constant_pad_nd_cuda_uint8 PASSED [0.9675s] [ 19%] 2025-12-04T12:25:43.1738375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_bfloat16 PASSED [0.0033s] [ 19%] 2025-12-04T12:25:43.1738506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_bool PASSED [0.9341s] [ 20%] 2025-12-04T12:25:43.1738623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_contiguous_cuda_uint8 PASSED [0.0032s] [ 20%] 2025-12-04T12:25:43.1738738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_bool PASSED [0.0192s] [ 20%] 2025-12-04T12:25:43.1738854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_float64 PASSED [0.0124s] [ 20%] 2025-12-04T12:25:43.1738969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_copysign_cuda_int32 PASSED [0.0114s] [ 20%] 2025-12-04T12:25:43.1739084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_float16 PASSED [0.5027s] [ 20%] 2025-12-04T12:25:43.1739201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_float32 PASSED [0.0297s] [ 20%] 2025-12-04T12:25:43.1739314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int16 PASSED [0.0294s] [ 20%] 2025-12-04T12:25:43.1739444Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_int8 PASSED [0.0293s] [ 20%] 2025-12-04T12:25:43.1739556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_corrcoef_cuda_uint8 PASSED [0.0293s] [ 20%] 2025-12-04T12:25:43.1739671Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float32 PASSED [0.0039s] [ 20%] 2025-12-04T12:25:43.1739781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_float64 PASSED [0.0038s] [ 20%] 2025-12-04T12:25:43.1739895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cos_cuda_int64 PASSED [0.9557s] [ 20%] 2025-12-04T12:25:43.1740005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_bool PASSED [0.0057s] [ 20%] 2025-12-04T12:25:43.1740124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_complex128 PASSED [0.0057s] [ 20%] 2025-12-04T12:25:43.1740239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float16 PASSED [0.9587s] [ 20%] 2025-12-04T12:25:43.1740352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float32 PASSED [0.0056s] [ 20%] 2025-12-04T12:25:43.1740465Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_float64 PASSED [0.0041s] [ 20%] 2025-12-04T12:25:43.1740573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int16 PASSED [0.9515s] [ 20%] 2025-12-04T12:25:43.1740683Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_int32 PASSED [0.0053s] [ 20%] 2025-12-04T12:25:43.1740790Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cosh_cuda_uint8 PASSED [0.0039s] [ 20%] 2025-12-04T12:25:43.1740912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int32 PASSED [0.0162s] [ 20%] 2025-12-04T12:25:43.1741049Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_count_nonzero_cuda_int64 PASSED [0.0156s] [ 20%] 2025-12-04T12:25:43.1741162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_bfloat16 PASSED [0.7419s] [ 20%] 2025-12-04T12:25:43.1741276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_complex128 PASSED [0.3253s] [ 20%] 2025-12-04T12:25:43.1741391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_complex64 PASSED [0.3220s] [ 20%] 2025-12-04T12:25:43.1741499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_int64 PASSED [0.3213s] [ 20%] 2025-12-04T12:25:43.1741608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cov_cuda_int8 PASSED [0.3168s] [ 20%] 2025-12-04T12:25:43.1741721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_float64 PASSED [0.9609s] [ 20%] 2025-12-04T12:25:43.1741830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cross_cuda_uint8 PASSED [0.0058s] [ 20%] 2025-12-04T12:25:43.1741941Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_bool PASSED [0.0085s] [ 20%] 2025-12-04T12:25:43.1742066Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_int64 PASSED [0.9531s] [ 20%] 2025-12-04T12:25:43.1742186Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummax_cuda_uint8 PASSED [0.0050s] [ 20%] 2025-12-04T12:25:43.1742297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cummin_cuda_uint8 PASSED [0.9535s] [ 20%] 2025-12-04T12:25:43.1742413Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_bfloat16 PASSED [0.0271s] [ 20%] 2025-12-04T12:25:43.1742532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_complex64 PASSED [0.9580s] [ 20%] 2025-12-04T12:25:43.1742644Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int32 PASSED [0.0130s] [ 20%] 2025-12-04T12:25:43.1742758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_int8 PASSED [0.0111s] [ 20%] 2025-12-04T12:25:43.1742872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumprod_cuda_uint8 PASSED [0.9560s] [ 20%] 2025-12-04T12:25:43.1742988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_bfloat16 PASSED [0.0083s] [ 20%] 2025-12-04T12:25:43.1743118Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_complex64 PASSED [0.9676s] [ 20%] 2025-12-04T12:25:43.1743230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int16 PASSED [0.0081s] [ 20%] 2025-12-04T12:25:43.1743340Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumsum_cuda_int8 PASSED [0.0059s] [ 20%] 2025-12-04T12:25:43.1743475Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_bfloat16 PASSED [0.0346s] [ 20%] 2025-12-04T12:25:43.1743614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_complex64 PASSED [0.0347s] [ 20%] 2025-12-04T12:25:43.1743746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_float64 PASSED [0.9841s] [ 20%] 2025-12-04T12:25:43.1743879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_cumulative_trapezoid_cuda_int64 PASSED [0.0347s] [ 20%] 2025-12-04T12:25:43.1743992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_deg2rad_cuda_int64 PASSED [0.0033s] [ 20%] 2025-12-04T12:25:43.1744107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_float16 PASSED [0.0111s] [ 20%] 2025-12-04T12:25:43.1744216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int64 PASSED [0.9868s] [ 20%] 2025-12-04T12:25:43.1744327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_cuda_int8 PASSED [0.0129s] [ 20%] 2025-12-04T12:25:43.1744446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_bfloat16 PASSED [0.0182s] [ 20%] 2025-12-04T12:25:43.1744568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex128 PASSED [0.0179s] [ 20%] 2025-12-04T12:25:43.1744688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex32 PASSED [1.0064s] [ 20%] 2025-12-04T12:25:43.1744824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_complex64 PASSED [0.0208s] [ 20%] 2025-12-04T12:25:43.1744943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_float32 PASSED [0.0180s] [ 20%] 2025-12-04T12:25:43.1745062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diag_embed_cuda_int16 PASSED [0.0179s] [ 20%] 2025-12-04T12:25:43.1745182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_bfloat16 PASSED [0.0086s] [ 20%] 2025-12-04T12:25:43.1745294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_bool PASSED [0.9912s] [ 20%] 2025-12-04T12:25:43.1745416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_complex128 PASSED [0.0117s] [ 20%] 2025-12-04T12:25:43.1745531Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float16 PASSED [0.0089s] [ 20%] 2025-12-04T12:25:43.1745647Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_float64 PASSED [0.0086s] [ 20%] 2025-12-04T12:25:43.1745770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagflat_cuda_int64 PASSED [1.0118s] [ 20%] 2025-12-04T12:25:43.1745912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_complex128 PASSED [0.0115s] [ 20%] 2025-12-04T12:25:43.1746033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int32 PASSED [0.0095s] [ 20%] 2025-12-04T12:25:43.1746154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_int8 PASSED [0.0092s] [ 20%] 2025-12-04T12:25:43.1746274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_copy_cuda_uint8 PASSED [0.0093s] [ 20%] 2025-12-04T12:25:43.1746395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_bfloat16 PASSED [0.0062s] [ 20%] 2025-12-04T12:25:43.1746509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_bool PASSED [0.0061s] [ 20%] 2025-12-04T12:25:43.1746632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex128 PASSED [0.0062s] [ 20%] 2025-12-04T12:25:43.1746752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_complex32 PASSED [0.0062s] [ 20%] 2025-12-04T12:25:43.1746879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_float32 PASSED [0.0062s] [ 20%] 2025-12-04T12:25:43.1746993Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_cuda_uint8 PASSED [0.0061s] [ 20%] 2025-12-04T12:25:43.1747124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_bfloat16 PASSED [0.0115s] [ 20%] 2025-12-04T12:25:43.1747252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_float16 PASSED [0.0112s] [ 20%] 2025-12-04T12:25:43.1747380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diagonal_scatter_cuda_int32 PASSED [0.9773s] [ 20%] 2025-12-04T12:25:43.1747493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float16 PASSED [0.2181s] [ 20%] 2025-12-04T12:25:43.1747604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_float32 PASSED [0.2136s] [ 20%] 2025-12-04T12:25:43.1747716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_int8 PASSED [0.2129s] [ 20%] 2025-12-04T12:25:43.1747826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_diff_cuda_uint8 PASSED [0.2165s] [ 20%] 2025-12-04T12:25:43.1747940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float16 PASSED [0.0045s] [ 20%] 2025-12-04T12:25:43.1748054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float32 PASSED [0.9619s] [ 20%] 2025-12-04T12:25:43.1748199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_float64 PASSED [0.0073s] [ 20%] 2025-12-04T12:25:43.1748311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int16 PASSED [0.0054s] [ 20%] 2025-12-04T12:25:43.1748424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_int8 PASSED [0.9466s] [ 20%] 2025-12-04T12:25:43.1748551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_digamma_cuda_uint8 PASSED [0.0058s] [ 20%] 2025-12-04T12:25:43.1748666Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_bfloat16 PASSED [0.0561s] [ 20%] 2025-12-04T12:25:43.1748779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_complex64 PASSED [0.0581s] [ 20%] 2025-12-04T12:25:43.1748891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dist_cuda_float64 PASSED [0.0544s] [ 20%] 2025-12-04T12:25:43.1749019Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_float16 PASSED [0.0313s] [ 20%] 2025-12-04T12:25:43.1749150Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_int32 PASSED [0.0162s] [ 20%] 2025-12-04T12:25:43.1749276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_floor_rounding_cuda_uint8 PASSED [0.0095s] [ 20%] 2025-12-04T12:25:43.1749409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_bfloat16 PASSED [0.0093s] [ 20%] 2025-12-04T12:25:43.1749556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex32 PASSED [0.3361s] [ 20%] 2025-12-04T12:25:43.1749703Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_complex64 PASSED [0.0096s] [ 20%] 2025-12-04T12:25:43.1749833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float32 PASSED [0.0092s] [ 20%] 2025-12-04T12:25:43.1749964Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_float64 PASSED [0.0092s] [ 20%] 2025-12-04T12:25:43.1750093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_no_rounding_mode_cuda_uint8 PASSED [0.0093s] [ 20%] 2025-12-04T12:25:43.1750220Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_float32 PASSED [0.0097s] [ 20%] 2025-12-04T12:25:43.1750346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int16 PASSED [0.0091s] [ 20%] 2025-12-04T12:25:43.1750474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int64 PASSED [0.0091s] [ 20%] 2025-12-04T12:25:43.1750612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_div_trunc_rounding_cuda_int8 PASSED [0.0091s] [ 20%] 2025-12-04T12:25:43.1750727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_complex128 PASSED [0.9687s] [ 20%] 2025-12-04T12:25:43.1750840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dot_cuda_float16 PASSED [0.0050s] [ 20%] 2025-12-04T12:25:43.1750957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex128 PASSED [0.9432s] [ 20%] 2025-12-04T12:25:43.1751073Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_complex32 PASSED [0.0042s] [ 20%] 2025-12-04T12:25:43.1751190Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_float32 PASSED [0.9520s] [ 20%] 2025-12-04T12:25:43.1751306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_int32 PASSED [0.0043s] [ 20%] 2025-12-04T12:25:43.1751418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_double_cuda_uint8 PASSED [0.9653s] [ 20%] 2025-12-04T12:25:43.1751537Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_bfloat16 PASSED [0.0049s] [ 20%] 2025-12-04T12:25:43.1751650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dsplit_cuda_complex32 PASSED [0.0035s] [ 20%] 2025-12-04T12:25:43.1751769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_complex128 PASSED [0.0069s] [ 20%] 2025-12-04T12:25:43.1751881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float16 PASSED [0.0066s] [ 20%] 2025-12-04T12:25:43.1751996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_float32 PASSED [0.0064s] [ 20%] 2025-12-04T12:25:43.1752106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_dstack_cuda_int16 PASSED [0.0065s] [ 20%] 2025-12-04T12:25:43.1752231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_complex64 PASSED [1.0501s] [ 20%] 2025-12-04T12:25:43.1752346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_einsum_cuda_float16 PASSED [0.4267s] [ 20%] 2025-12-04T12:25:43.1752462Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_complex128 PASSED [0.0042s] [ 20%] 2025-12-04T12:25:43.1752577Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_float32 PASSED [0.0037s] [ 20%] 2025-12-04T12:25:43.1752688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_int8 PASSED [0.0038s] [ 20%] 2025-12-04T12:25:43.1752801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_cuda_uint8 PASSED [0.0038s] [ 20%] 2025-12-04T12:25:43.1752919Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_bfloat16 PASSED [0.0061s] [ 20%] 2025-12-04T12:25:43.1753043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_complex32 PASSED [0.0061s] [ 20%] 2025-12-04T12:25:43.1753164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_like_cuda_float16 PASSED [0.0060s] [ 20%] 2025-12-04T12:25:43.1753300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_float64 PASSED [0.0139s] [ 20%] 2025-12-04T12:25:43.1753439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int32 PASSED [0.0137s] [ 20%] 2025-12-04T12:25:43.1753560Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_permuted_cuda_int64 PASSED [0.0137s] [ 20%] 2025-12-04T12:25:43.1753679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_empty_strided_cuda_float16 XFAIL [0.0045s] [ 20%] 2025-12-04T12:25:43.1753794Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_bfloat16 PASSED [0.9756s] [ 20%] 2025-12-04T12:25:43.1753905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_complex32 PASSED [0.0098s] [ 20%] 2025-12-04T12:25:43.1754015Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int32 PASSED [0.9621s] [ 20%] 2025-12-04T12:25:43.1754124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_int64 PASSED [0.0113s] [ 20%] 2025-12-04T12:25:43.1754235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eq_cuda_uint8 PASSED [0.0095s] [ 20%] 2025-12-04T12:25:43.1754358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_bfloat16 PASSED [0.9500s] [ 20%] 2025-12-04T12:25:43.1754470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_bool PASSED [0.0067s] [ 20%] 2025-12-04T12:25:43.1754578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int64 PASSED [0.0051s] [ 20%] 2025-12-04T12:25:43.1754689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_int8 PASSED [0.0049s] [ 20%] 2025-12-04T12:25:43.1754800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_equal_cuda_uint8 PASSED [0.0048s] [ 21%] 2025-12-04T12:25:43.1754911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_bfloat16 PASSED [0.9529s] [ 21%] 2025-12-04T12:25:43.1755022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float16 PASSED [0.0045s] [ 21%] 2025-12-04T12:25:43.1755131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float32 PASSED [0.9391s] [ 21%] 2025-12-04T12:25:43.1755243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_float64 PASSED [0.0045s] [ 21%] 2025-12-04T12:25:43.1755351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int16 PASSED [0.9450s] [ 21%] 2025-12-04T12:25:43.1755463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_int32 PASSED [0.0045s] [ 21%] 2025-12-04T12:25:43.1755571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erf_cuda_uint8 PASSED [0.9578s] [ 21%] 2025-12-04T12:25:43.1755685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_float32 PASSED [0.0086s] [ 21%] 2025-12-04T12:25:43.1755793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfc_cuda_int16 PASSED [0.0043s] [ 21%] 2025-12-04T12:25:43.1755915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_int64 PASSED [0.9547s] [ 21%] 2025-12-04T12:25:43.1756026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_erfinv_cuda_uint8 PASSED [0.0045s] [ 21%] 2025-12-04T12:25:43.1756141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_bfloat16 PASSED [0.0047s] [ 21%] 2025-12-04T12:25:43.1756257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_complex128 PASSED [1.1448s] [ 21%] 2025-12-04T12:25:43.1756369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_int16 PASSED [0.0071s] [ 21%] 2025-12-04T12:25:43.1756479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_int32 PASSED [0.0040s] [ 21%] 2025-12-04T12:25:43.1756589Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp2_cuda_uint8 PASSED [0.9589s] [ 21%] 2025-12-04T12:25:43.1756699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_bfloat16 PASSED [0.0060s] [ 21%] 2025-12-04T12:25:43.1756813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_complex64 PASSED [0.0057s] [ 21%] 2025-12-04T12:25:43.1756936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_float32 PASSED [0.9492s] [ 21%] 2025-12-04T12:25:43.1757054Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exp_cuda_uint8 PASSED [0.0055s] [ 21%] 2025-12-04T12:25:43.1757172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_float32 PASSED [0.0034s] [ 21%] 2025-12-04T12:25:43.1757288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_int32 PASSED [0.9631s] [ 21%] 2025-12-04T12:25:43.1757401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_as_cuda_int8 PASSED [0.0051s] [ 21%] 2025-12-04T12:25:43.1757525Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_bfloat16 PASSED [0.0077s] [ 21%] 2025-12-04T12:25:43.1757643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_bool PASSED [0.0067s] [ 21%] 2025-12-04T12:25:43.1757770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_complex128 PASSED [0.0067s] [ 21%] 2025-12-04T12:25:43.1757890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_int64 PASSED [0.0066s] [ 21%] 2025-12-04T12:25:43.1758018Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_copy_cuda_uint8 PASSED [0.0066s] [ 21%] 2025-12-04T12:25:43.1758174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_complex128 PASSED [0.0047s] [ 21%] 2025-12-04T12:25:43.1758289Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_complex64 PASSED [0.0047s] [ 21%] 2025-12-04T12:25:43.1758403Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_int64 PASSED [0.9572s] [ 21%] 2025-12-04T12:25:43.1758513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expand_cuda_uint8 PASSED [0.0064s] [ 21%] 2025-12-04T12:25:43.1758627Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_bfloat16 PASSED [0.0033s] [ 21%] 2025-12-04T12:25:43.1758742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_complex64 PASSED [0.9550s] [ 21%] 2025-12-04T12:25:43.1758860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_float64 PASSED [0.0043s] [ 21%] 2025-12-04T12:25:43.1758971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_expm1_cuda_int8 PASSED [0.9532s] [ 21%] 2025-12-04T12:25:43.1759094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_exponential_cuda_float64 PASSED [0.0066s] [ 21%] 2025-12-04T12:25:43.1759201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_bool PASSED [0.0412s] [ 21%] 2025-12-04T12:25:43.1759313Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float32 PASSED [0.0408s] [ 21%] 2025-12-04T12:25:43.1759433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_float8_e5m2fnuz PASSED [0.0407s] [ 21%] 2025-12-04T12:25:43.1759543Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int32 PASSED [0.0406s] [ 21%] 2025-12-04T12:25:43.1759665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_eye_cuda_int64 PASSED [0.0408s] [ 21%] 2025-12-04T12:25:43.1759780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_bool PASSED [3.4205s] [ 21%] 2025-12-04T12:25:43.1759896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_float16 PASSED [2.8992s] [ 21%] 2025-12-04T12:25:43.1760010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_int16 PASSED [0.0227s] [ 21%] 2025-12-04T12:25:43.1760125Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft2_cuda_uint8 PASSED [0.0161s] [ 21%] 2025-12-04T12:25:43.1760241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex32 PASSED [1.2218s] [ 21%] 2025-12-04T12:25:43.1760358Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_complex64 PASSED [2.1362s] [ 21%] 2025-12-04T12:25:43.1760471Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fft_cuda_float16 PASSED [1.3684s] [ 21%] 2025-12-04T12:25:43.1760592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_complex32 PASSED [0.0269s] [ 21%] 2025-12-04T12:25:43.1760719Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_float64 PASSED [3.9031s] [ 21%] 2025-12-04T12:25:43.1760846Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftn_cuda_uint8 PASSED [1.6413s] [ 21%] 2025-12-04T12:25:43.1760961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_bool PASSED [0.0143s] [ 21%] 2025-12-04T12:25:43.1761084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float32 PASSED [1.0242s] [ 21%] 2025-12-04T12:25:43.1761205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_float64 PASSED [0.0092s] [ 21%] 2025-12-04T12:25:43.1761325Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int16 PASSED [1.0289s] [ 21%] 2025-12-04T12:25:43.1761440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_fftshift_cuda_int8 PASSED [0.0093s] [ 21%] 2025-12-04T12:25:43.1761557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_bool PASSED [2.5511s] [ 21%] 2025-12-04T12:25:43.1761691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex128 PASSED [1.8143s] [ 21%] 2025-12-04T12:25:43.1761812Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft2_cuda_complex64 PASSED [0.0356s] [ 21%] 2025-12-04T12:25:43.1761924Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_bool PASSED [0.6227s] [ 21%] 2025-12-04T12:25:43.1762050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex128 PASSED [0.5155s] [ 21%] 2025-12-04T12:25:43.1762172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_complex64 PASSED [0.0306s] [ 21%] 2025-12-04T12:25:43.1762286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int16 PASSED [0.0136s] [ 21%] 2025-12-04T12:25:43.1762399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int32 PASSED [0.0111s] [ 21%] 2025-12-04T12:25:43.1762512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_int64 PASSED [0.0111s] [ 21%] 2025-12-04T12:25:43.1762626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfft_cuda_uint8 PASSED [0.0111s] [ 21%] 2025-12-04T12:25:43.1762744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_complex64 PASSED [0.0137s] [ 21%] 2025-12-04T12:25:43.1762861Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float16 PASSED [1.4963s] [ 21%] 2025-12-04T12:25:43.1762976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float32 PASSED [0.0373s] [ 21%] 2025-12-04T12:25:43.1763092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_hfftn_cuda_float64 PASSED [0.0145s] [ 21%] 2025-12-04T12:25:43.1763210Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_complex64 PASSED [0.9920s] [ 21%] 2025-12-04T12:25:43.1763339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int16 PASSED [0.0093s] [ 21%] 2025-12-04T12:25:43.1763452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft2_cuda_int64 PASSED [0.9781s] [ 21%] 2025-12-04T12:25:43.1763568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_bool PASSED [0.9012s] [ 21%] 2025-12-04T12:25:43.1763686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_complex128 PASSED [1.5927s] [ 21%] 2025-12-04T12:25:43.1763800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_float64 PASSED [0.8748s] [ 21%] 2025-12-04T12:25:43.1763911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifft_cuda_int16 PASSED [0.0122s] [ 21%] 2025-12-04T12:25:43.1764024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_bool PASSED [0.0083s] [ 21%] 2025-12-04T12:25:43.1764138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftn_cuda_uint8 PASSED [0.0081s] [ 21%] 2025-12-04T12:25:43.1764265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_bfloat16 PASSED [0.0070s] [ 21%] 2025-12-04T12:25:43.1764401Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_complex128 PASSED [0.0068s] [ 21%] 2025-12-04T12:25:43.1764538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_complex64 PASSED [0.0068s] [ 21%] 2025-12-04T12:25:43.1764660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float16 PASSED [0.0067s] [ 21%] 2025-12-04T12:25:43.1764786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ifftshift_cuda_float64 PASSED [0.0068s] [ 21%] 2025-12-04T12:25:43.1764905Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft2_cuda_float32 PASSED [0.3385s] [ 21%] 2025-12-04T12:25:43.1765020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_float32 PASSED [0.0332s] [ 21%] 2025-12-04T12:25:43.1765137Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfft_cuda_int32 PASSED [0.0171s] [ 21%] 2025-12-04T12:25:43.1765251Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_ihfftn_cuda_int8 PASSED [0.0124s] [ 21%] 2025-12-04T12:25:43.1765389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_complex32 PASSED [1.2910s] [ 21%] 2025-12-04T12:25:43.1765508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_float16 PASSED [0.0081s] [ 21%] 2025-12-04T12:25:43.1765623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_int16 PASSED [0.4662s] [ 21%] 2025-12-04T12:25:43.1765738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft2_cuda_int32 PASSED [0.0249s] [ 21%] 2025-12-04T12:25:43.1765853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_bool PASSED [0.0142s] [ 21%] 2025-12-04T12:25:43.1765972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_complex32 PASSED [0.0154s] [ 21%] 2025-12-04T12:25:43.1766094Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_complex64 PASSED [0.0069s] [ 21%] 2025-12-04T12:25:43.1766211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_float16 PASSED [0.0075s] [ 21%] 2025-12-04T12:25:43.1766329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_int16 PASSED [0.0071s] [ 21%] 2025-12-04T12:25:43.1766442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfft_cuda_uint8 PASSED [0.0071s] [ 21%] 2025-12-04T12:25:43.1766568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_complex128 PASSED [0.3085s] [ 21%] 2025-12-04T12:25:43.1766685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_float64 PASSED [0.0276s] [ 21%] 2025-12-04T12:25:43.1766800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_int16 PASSED [0.9703s] [ 21%] 2025-12-04T12:25:43.1766914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_irfftn_cuda_uint8 PASSED [0.0106s] [ 21%] 2025-12-04T12:25:43.1767041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_float32 PASSED [0.4467s] [ 21%] 2025-12-04T12:25:43.1767153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft2_cuda_int16 PASSED [0.0234s] [ 21%] 2025-12-04T12:25:43.1767272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_float32 PASSED [0.0131s] [ 21%] 2025-12-04T12:25:43.1767385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int32 PASSED [0.0069s] [ 21%] 2025-12-04T12:25:43.1767497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfft_cuda_int8 PASSED [0.0070s] [ 21%] 2025-12-04T12:25:43.1767611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fft_rfftn_cuda_int32 PASSED [0.0083s] [ 21%] 2025-12-04T12:25:43.1767722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_complex32 PASSED [0.0038s] [ 21%] 2025-12-04T12:25:43.1767833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int16 PASSED [1.0059s] [ 21%] 2025-12-04T12:25:43.1767942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fill_cuda_int64 PASSED [0.0055s] [ 21%] 2025-12-04T12:25:43.1768092Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_bfloat16 PASSED [0.0039s] [ 21%] 2025-12-04T12:25:43.1768224Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_complex128 PASSED [0.9948s] [ 21%] 2025-12-04T12:25:43.1768343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_complex64 PASSED [0.0056s] [ 21%] 2025-12-04T12:25:43.1768457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_float32 PASSED [0.0040s] [ 21%] 2025-12-04T12:25:43.1768573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int16 PASSED [0.9856s] [ 21%] 2025-12-04T12:25:43.1768686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flatten_cuda_int8 PASSED [0.0054s] [ 21%] 2025-12-04T12:25:43.1768803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_bfloat16 PASSED [0.0080s] [ 21%] 2025-12-04T12:25:43.1768912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_bool PASSED [0.0074s] [ 21%] 2025-12-04T12:25:43.1769030Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_complex64 PASSED [0.0074s] [ 21%] 2025-12-04T12:25:43.1769155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int16 PASSED [0.0072s] [ 21%] 2025-12-04T12:25:43.1769264Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_int8 PASSED [0.0073s] [ 21%] 2025-12-04T12:25:43.1769371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flip_cuda_uint8 PASSED [0.0073s] [ 21%] 2025-12-04T12:25:43.1769487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float16 PASSED [0.0033s] [ 21%] 2025-12-04T12:25:43.1769601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float32 PASSED [0.0031s] [ 21%] 2025-12-04T12:25:43.1769716Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_float64 PASSED [0.0032s] [ 21%] 2025-12-04T12:25:43.1769830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fliplr_cuda_int16 PASSED [0.0032s] [ 21%] 2025-12-04T12:25:43.1769946Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_complex64 PASSED [0.0031s] [ 21%] 2025-12-04T12:25:43.1770060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_int64 PASSED [0.0033s] [ 21%] 2025-12-04T12:25:43.1770170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_flipud_cuda_uint8 PASSED [0.0032s] [ 21%] 2025-12-04T12:25:43.1770285Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_bfloat16 PASSED [1.0073s] [ 21%] 2025-12-04T12:25:43.1770395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_bool PASSED [0.0040s] [ 21%] 2025-12-04T12:25:43.1770515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_complex128 PASSED [0.9824s] [ 21%] 2025-12-04T12:25:43.1770628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_complex32 PASSED [0.0039s] [ 21%] 2025-12-04T12:25:43.1770755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_float16 PASSED [0.9787s] [ 21%] 2025-12-04T12:25:43.1770866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int16 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1770979Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int32 PASSED [0.9857s] [ 22%] 2025-12-04T12:25:43.1771088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_cuda_int64 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1771213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_complex64 PASSED [0.0106s] [ 22%] 2025-12-04T12:25:43.1771333Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float32 PASSED [0.0092s] [ 22%] 2025-12-04T12:25:43.1771452Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_float64 PASSED [0.0085s] [ 22%] 2025-12-04T12:25:43.1771568Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int16 PASSED [0.0091s] [ 22%] 2025-12-04T12:25:43.1771686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_float_power_cuda_int64 PASSED [0.0090s] [ 22%] 2025-12-04T12:25:43.1771810Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_cuda_float16 PASSED [0.9913s] [ 22%] 2025-12-04T12:25:43.1771942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_floor_divide_cuda_int64 PASSED [0.0187s] [ 22%] 2025-12-04T12:25:43.1772052Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_float32 PASSED [0.0089s] [ 22%] 2025-12-04T12:25:43.1772161Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_int32 PASSED [0.0085s] [ 22%] 2025-12-04T12:25:43.1772271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmax_cuda_uint8 PASSED [0.0084s] [ 22%] 2025-12-04T12:25:43.1772381Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_bfloat16 PASSED [0.0087s] [ 22%] 2025-12-04T12:25:43.1772489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_bool PASSED [0.0083s] [ 22%] 2025-12-04T12:25:43.1772599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmin_cuda_uint8 PASSED [0.0084s] [ 22%] 2025-12-04T12:25:43.1772718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_float32 PASSED [0.0091s] [ 22%] 2025-12-04T12:25:43.1772836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int16 PASSED [0.0089s] [ 22%] 2025-12-04T12:25:43.1772945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_int8 PASSED [0.0089s] [ 22%] 2025-12-04T12:25:43.1773052Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_fmod_cuda_uint8 PASSED [0.0090s] [ 22%] 2025-12-04T12:25:43.1773167Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frac_cuda_bfloat16 PASSED [0.0032s] [ 22%] 2025-12-04T12:25:43.1773279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float16 PASSED [1.0201s] [ 22%] 2025-12-04T12:25:43.1773393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_frexp_cuda_float64 PASSED [0.0055s] [ 22%] 2025-12-04T12:25:43.1773506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_bfloat16 PASSED [0.0043s] [ 22%] 2025-12-04T12:25:43.1773617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_bool PASSED [1.0190s] [ 22%] 2025-12-04T12:25:43.1773734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_complex128 PASSED [0.0060s] [ 22%] 2025-12-04T12:25:43.1773847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_float32 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1773953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_int16 PASSED [1.0174s] [ 22%] 2025-12-04T12:25:43.1774063Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_cuda_uint8 PASSED [0.0060s] [ 22%] 2025-12-04T12:25:43.1774180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_float32 PASSED [0.0068s] [ 22%] 2025-12-04T12:25:43.1774297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_float64 PASSED [0.0063s] [ 22%] 2025-12-04T12:25:43.1774420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int16 PASSED [0.0062s] [ 22%] 2025-12-04T12:25:43.1774533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_int64 PASSED [0.0061s] [ 22%] 2025-12-04T12:25:43.1774651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_full_like_cuda_uint16 PASSED [0.0066s] [ 22%] 2025-12-04T12:25:43.1774766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_float16 PASSED [0.0072s] [ 22%] 2025-12-04T12:25:43.1774880Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gather_cuda_int16 PASSED [0.0067s] [ 22%] 2025-12-04T12:25:43.1774988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int16 PASSED [0.0087s] [ 22%] 2025-12-04T12:25:43.1775097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_int32 PASSED [0.0104s] [ 22%] 2025-12-04T12:25:43.1775204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gcd_cuda_uint8 PASSED [0.2926s] [ 22%] 2025-12-04T12:25:43.1775315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_float16 PASSED [0.0094s] [ 22%] 2025-12-04T12:25:43.1775431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ge_cuda_int16 PASSED [0.0087s] [ 22%] 2025-12-04T12:25:43.1775561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_bfloat16 PASSED [1.0267s] [ 22%] 2025-12-04T12:25:43.1775678Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_float32 PASSED [0.0068s] [ 22%] 2025-12-04T12:25:43.1775797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int32 PASSED [0.0049s] [ 22%] 2025-12-04T12:25:43.1775910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geometric_cuda_int8 PASSED [1.0260s] [ 22%] 2025-12-04T12:25:43.1776024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_geqrf_cuda_float32 PASSED [0.0774s] [ 22%] 2025-12-04T12:25:43.1776141Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_bfloat16 PASSED [0.1726s] [ 22%] 2025-12-04T12:25:43.1776262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gradient_cuda_float64 PASSED [0.1690s] [ 22%] 2025-12-04T12:25:43.1776390Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_2d_cuda_bfloat16 PASSED [0.5967s] [ 22%] 2025-12-04T12:25:43.1776546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 22%] 2025-12-04T12:25:43.1776687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_grid_sampler_3d_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 22%] 2025-12-04T12:25:43.1776802Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_bfloat16 PASSED [0.0095s] [ 22%] 2025-12-04T12:25:43.1776910Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_bool PASSED [0.0086s] [ 22%] 2025-12-04T12:25:43.1777022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_float32 PASSED [0.0086s] [ 22%] 2025-12-04T12:25:43.1777131Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_gt_cuda_float64 PASSED [0.0085s] [ 22%] 2025-12-04T12:25:43.1777248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_complex128 PASSED [0.0030s] [ 22%] 2025-12-04T12:25:43.1777363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_float32 PASSED [1.0269s] [ 22%] 2025-12-04T12:25:43.1777471Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int16 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1777581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int64 PASSED [1.0199s] [ 22%] 2025-12-04T12:25:43.1777689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_int8 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1777799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_half_cuda_uint8 PASSED [1.0246s] [ 22%] 2025-12-04T12:25:43.1777914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_bool PASSED [0.0129s] [ 22%] 2025-12-04T12:25:43.1778032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_int16 PASSED [0.0108s] [ 22%] 2025-12-04T12:25:43.1778198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hash_tensor_cuda_uint8 PASSED [0.0107s] [ 22%] 2025-12-04T12:25:43.1778319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_float64 PASSED [0.0150s] [ 22%] 2025-12-04T12:25:43.1778433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int16 PASSED [0.0145s] [ 22%] 2025-12-04T12:25:43.1778550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_heaviside_cuda_int64 PASSED [0.0144s] [ 22%] 2025-12-04T12:25:43.1778661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int16 PASSED [1.0625s] [ 22%] 2025-12-04T12:25:43.1778775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int32 PASSED [0.0507s] [ 22%] 2025-12-04T12:25:43.1778884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_histc_cuda_int8 PASSED [0.0490s] [ 22%] 2025-12-04T12:25:43.1779003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_bfloat16 PASSED [0.0039s] [ 22%] 2025-12-04T12:25:43.1779122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_complex128 PASSED [1.0393s] [ 22%] 2025-12-04T12:25:43.1779253Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int32 PASSED [0.0052s] [ 22%] 2025-12-04T12:25:43.1779376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_int8 PASSED [0.0036s] [ 22%] 2025-12-04T12:25:43.1779489Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hsplit_cuda_uint8 PASSED [1.0278s] [ 22%] 2025-12-04T12:25:43.1779603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_bfloat16 PASSED [0.0057s] [ 22%] 2025-12-04T12:25:43.1779720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_complex64 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1779830Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hstack_cuda_int32 PASSED [0.0039s] [ 22%] 2025-12-04T12:25:43.1779945Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_hypot_cuda_float16 PASSED [0.0089s] [ 22%] 2025-12-04T12:25:43.1780057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_float64 PASSED [0.0046s] [ 22%] 2025-12-04T12:25:43.1780177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int16 PASSED [1.0206s] [ 22%] 2025-12-04T12:25:43.1780286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int32 PASSED [0.0051s] [ 22%] 2025-12-04T12:25:43.1780393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_i0_cuda_int64 PASSED [0.0036s] [ 22%] 2025-12-04T12:25:43.1780510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_igamma_cuda_float32 PASSED [0.0089s] [ 22%] 2025-12-04T12:25:43.1780624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex128 PASSED [0.0038s] [ 22%] 2025-12-04T12:25:43.1780742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_imag_cuda_complex32 PASSED [0.0036s] [ 22%] 2025-12-04T12:25:43.1780860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_bfloat16 PASSED [0.0103s] [ 22%] 2025-12-04T12:25:43.1780976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_int16 PASSED [1.0320s] [ 22%] 2025-12-04T12:25:43.1781090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_add_cuda_int32 PASSED [0.0118s] [ 22%] 2025-12-04T12:25:43.1781207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_bool PASSED [0.0048s] [ 22%] 2025-12-04T12:25:43.1781331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_complex128 PASSED [0.0045s] [ 22%] 2025-12-04T12:25:43.1781453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_float16 PASSED [0.0044s] [ 22%] 2025-12-04T12:25:43.1781566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_copy_cuda_int8 PASSED [0.0043s] [ 22%] 2025-12-04T12:25:43.1781681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_bool PASSED [0.0068s] [ 22%] 2025-12-04T12:25:43.1781801Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_complex32 PASSED [0.0066s] [ 22%] 2025-12-04T12:25:43.1781944Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_float16 PASSED [0.0065s] [ 22%] 2025-12-04T12:25:43.1782064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_float32 PASSED [0.0065s] [ 22%] 2025-12-04T12:25:43.1782181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int16 PASSED [0.0065s] [ 22%] 2025-12-04T12:25:43.1782297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_fill_cuda_int64 PASSED [0.0066s] [ 22%] 2025-12-04T12:25:43.1782422Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex128 PASSED [0.0057s] [ 22%] 2025-12-04T12:25:43.1782542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_complex64 PASSED [0.0055s] [ 22%] 2025-12-04T12:25:43.1782657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_float16 PASSED [0.0054s] [ 22%] 2025-12-04T12:25:43.1782775Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_float32 PASSED [0.0054s] [ 22%] 2025-12-04T12:25:43.1782899Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_int16 PASSED [0.0053s] [ 22%] 2025-12-04T12:25:43.1783027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_put_cuda_uint8 PASSED [0.0054s] [ 22%] 2025-12-04T12:25:43.1783155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float16 PASSED [0.0077s] [ 22%] 2025-12-04T12:25:43.1783283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_float32 PASSED [0.0075s] [ 22%] 2025-12-04T12:25:43.1783409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_int64 PASSED [0.0075s] [ 22%] 2025-12-04T12:25:43.1783535Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amax_cuda_int8 PASSED [0.0074s] [ 22%] 2025-12-04T12:25:43.1783659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_amin_cuda_int64 PASSED [1.0268s] [ 22%] 2025-12-04T12:25:43.1783790Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_bfloat16 PASSED [0.0097s] [ 22%] 2025-12-04T12:25:43.1783931Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_float32 PASSED [0.0083s] [ 22%] 2025-12-04T12:25:43.1784059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int16 PASSED [0.0082s] [ 22%] 2025-12-04T12:25:43.1784183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int32 PASSED [0.0080s] [ 22%] 2025-12-04T12:25:43.1784309Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_mean_cuda_int64 PASSED [0.0080s] [ 22%] 2025-12-04T12:25:43.1784432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int32 PASSED [1.0316s] [ 22%] 2025-12-04T12:25:43.1784559Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_int8 PASSED [0.0099s] [ 22%] 2025-12-04T12:25:43.1784685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_reduce_prod_cuda_uint8 PASSED [0.0078s] [ 22%] 2025-12-04T12:25:43.1784813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_complex128 PASSED [0.0046s] [ 22%] 2025-12-04T12:25:43.1784939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_complex32 PASSED [0.0042s] [ 22%] 2025-12-04T12:25:43.1785060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_float32 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1785180Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_int16 PASSED [1.0148s] [ 22%] 2025-12-04T12:25:43.1785299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_index_select_cuda_uint8 PASSED [0.0064s] [ 22%] 2025-12-04T12:25:43.1785414Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_bfloat16 PASSED [0.0059s] [ 22%] 2025-12-04T12:25:43.1785527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_float16 PASSED [0.0052s] [ 22%] 2025-12-04T12:25:43.1785652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_inner_cuda_float32 PASSED [0.0049s] [ 22%] 2025-12-04T12:25:43.1785760Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_bool PASSED [1.0201s] [ 22%] 2025-12-04T12:25:43.1785876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_complex128 PASSED [0.0042s] [ 22%] 2025-12-04T12:25:43.1785989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_complex64 PASSED [1.0136s] [ 22%] 2025-12-04T12:25:43.1786100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_int_cuda_int64 PASSED [0.0041s] [ 22%] 2025-12-04T12:25:43.1786216Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_bfloat16 PASSED [0.0955s] [ 22%] 2025-12-04T12:25:43.1786336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_complex64 PASSED [0.0931s] [ 22%] 2025-12-04T12:25:43.1786449Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float16 PASSED [0.0935s] [ 22%] 2025-12-04T12:25:43.1786565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_float64 PASSED [0.0915s] [ 22%] 2025-12-04T12:25:43.1786689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int16 PASSED [0.0866s] [ 22%] 2025-12-04T12:25:43.1786814Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isclose_cuda_int32 PASSED [0.0867s] [ 22%] 2025-12-04T12:25:43.1786933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_complex32 PASSED [0.0174s] [ 22%] 2025-12-04T12:25:43.1787048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isfinite_cuda_int16 PASSED [0.0041s] [ 23%] 2025-12-04T12:25:43.1787160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float32 PASSED [1.0450s] [ 23%] 2025-12-04T12:25:43.1787273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_float64 PASSED [0.0061s] [ 23%] 2025-12-04T12:25:43.1787385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isin_cuda_int32 PASSED [0.0043s] [ 23%] 2025-12-04T12:25:43.1787500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_complex32 PASSED [1.0340s] [ 23%] 2025-12-04T12:25:43.1787614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_float64 PASSED [0.0051s] [ 23%] 2025-12-04T12:25:43.1787733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isinf_cuda_int8 PASSED [1.0000s] [ 23%] 2025-12-04T12:25:43.1787845Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_bool PASSED [0.0046s] [ 23%] 2025-12-04T12:25:43.1787960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_complex128 PASSED [1.0129s] [ 23%] 2025-12-04T12:25:43.1788183Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_complex64 PASSED [0.0046s] [ 23%] 2025-12-04T12:25:43.1788296Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isnan_cuda_float64 PASSED [1.0074s] [ 23%] 2025-12-04T12:25:43.1788410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_bool PASSED [0.0047s] [ 23%] 2025-12-04T12:25:43.1788526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_float64 PASSED [0.9918s] [ 23%] 2025-12-04T12:25:43.1788640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isneginf_cuda_int64 PASSED [0.0046s] [ 23%] 2025-12-04T12:25:43.1788758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_bfloat16 PASSED [0.9948s] [ 23%] 2025-12-04T12:25:43.1788874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float32 PASSED [0.0048s] [ 23%] 2025-12-04T12:25:43.1788988Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_float64 PASSED [1.0096s] [ 23%] 2025-12-04T12:25:43.1789104Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int32 PASSED [0.0046s] [ 23%] 2025-12-04T12:25:43.1789217Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int64 PASSED [0.9858s] [ 23%] 2025-12-04T12:25:43.1789332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_int8 PASSED [0.0045s] [ 23%] 2025-12-04T12:25:43.1789459Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isposinf_cuda_uint8 PASSED [0.9829s] [ 23%] 2025-12-04T12:25:43.1789573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_bool PASSED [0.0056s] [ 23%] 2025-12-04T12:25:43.1789689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex32 PASSED [0.0060s] [ 23%] 2025-12-04T12:25:43.1789807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_complex64 PASSED [0.0054s] [ 23%] 2025-12-04T12:25:43.1789922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float16 PASSED [0.0037s] [ 23%] 2025-12-04T12:25:43.1790033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_float32 PASSED [0.9934s] [ 23%] 2025-12-04T12:25:43.1790145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_int16 PASSED [0.0057s] [ 23%] 2025-12-04T12:25:43.1790256Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_int32 PASSED [0.0041s] [ 23%] 2025-12-04T12:25:43.1790372Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_isreal_cuda_uint8 PASSED [0.9987s] [ 23%] 2025-12-04T12:25:43.1790500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_istft_cuda_complex128 PASSED [0.8783s] [ 23%] 2025-12-04T12:25:43.1790628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_istft_cuda_complex64 PASSED [0.8917s] [ 23%] 2025-12-04T12:25:43.1790737Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_bool PASSED [1.0234s] [ 23%] 2025-12-04T12:25:43.1790856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_item_cuda_complex128 PASSED [0.0058s] [ 23%] 2025-12-04T12:25:43.1790998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_bfloat16 PASSED [0.3349s] [ 23%] 2025-12-04T12:25:43.1791144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_complex64 PASSED [0.2770s] [ 23%] 2025-12-04T12:25:43.1791288Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_float16 PASSED [0.3011s] [ 23%] 2025-12-04T12:25:43.1791427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_int8 PASSED [0.0056s] [ 23%] 2025-12-04T12:25:43.1791578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8 PASSED [0.2639s] [ 23%] 2025-12-04T12:25:43.1791727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int32 PASSED [0.2607s] [ 23%] 2025-12-04T12:25:43.1791857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_bfloat16 PASSED [0.0056s] [ 23%] 2025-12-04T12:25:43.1791983Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_bool PASSED [0.2609s] [ 23%] 2025-12-04T12:25:43.1792113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_complex128 PASSED [0.3006s] [ 23%] 2025-12-04T12:25:43.1792247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_complex64 PASSED [0.3083s] [ 23%] 2025-12-04T12:25:43.1792375Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_float64 PASSED [0.2604s] [ 23%] 2025-12-04T12:25:43.1792501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int16 PASSED [0.2951s] [ 23%] 2025-12-04T12:25:43.1792624Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int32 PASSED [0.2610s] [ 23%] 2025-12-04T12:25:43.1792746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_cuda_int8 PASSED [0.2534s] [ 23%] 2025-12-04T12:25:43.1792896Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_bfloat16 PASSED [0.3130s] [ 23%] 2025-12-04T12:25:43.1793048Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_complex128 PASSED [0.2750s] [ 23%] 2025-12-04T12:25:43.1793192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_binary_return_by_ref_cuda_int64 PASSED [0.2603s] [ 23%] 2025-12-04T12:25:43.1793330Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_bfloat16 PASSED [1.0083s] [ 23%] 2025-12-04T12:25:43.1793457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_int32 PASSED [0.1133s] [ 23%] 2025-12-04T12:25:43.1793579Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_int8 PASSED [1.1158s] [ 23%] 2025-12-04T12:25:43.1793704Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_jiterator_unary_cuda_uint8 PASSED [0.1257s] [ 23%] 2025-12-04T12:25:43.1793815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int16 PASSED [0.0048s] [ 23%] 2025-12-04T12:25:43.1793927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int32 PASSED [0.0037s] [ 23%] 2025-12-04T12:25:43.1794036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_int8 PASSED [0.9880s] [ 23%] 2025-12-04T12:25:43.1794147Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kron_cuda_uint8 PASSED [0.0056s] [ 23%] 2025-12-04T12:25:43.1794272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_float32 PASSED [0.0204s] [ 23%] 2025-12-04T12:25:43.1794396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_int32 PASSED [0.0059s] [ 23%] 2025-12-04T12:25:43.1794509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_int64 PASSED [0.0055s] [ 23%] 2025-12-04T12:25:43.1794623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_kthvalue_cuda_uint8 PASSED [0.9985s] [ 23%] 2025-12-04T12:25:43.1794734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_int32 PASSED [0.0179s] [ 23%] 2025-12-04T12:25:43.1794842Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lcm_cuda_uint8 PASSED [0.0134s] [ 23%] 2025-12-04T12:25:43.1794957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_bfloat16 PASSED [0.0134s] [ 23%] 2025-12-04T12:25:43.1795072Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_float16 PASSED [0.0132s] [ 23%] 2025-12-04T12:25:43.1795185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_int32 PASSED [0.0119s] [ 23%] 2025-12-04T12:25:43.1795311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ldexp_cuda_uint8 PASSED [0.0118s] [ 23%] 2025-12-04T12:25:43.1795424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_bfloat16 PASSED [0.0086s] [ 23%] 2025-12-04T12:25:43.1795532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_float16 PASSED [0.0085s] [ 23%] 2025-12-04T12:25:43.1795642Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_le_cuda_float64 PASSED [0.0084s] [ 23%] 2025-12-04T12:25:43.1795754Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_bfloat16 PASSED [0.0123s] [ 23%] 2025-12-04T12:25:43.1795871Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_complex32 PASSED [0.7616s] [ 23%] 2025-12-04T12:25:43.1795984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lerp_cuda_float32 PASSED [0.0124s] [ 23%] 2025-12-04T12:25:43.1796097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_bool PASSED [0.0040s] [ 23%] 2025-12-04T12:25:43.1796212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lgamma_cuda_float16 PASSED [0.9946s] [ 23%] 2025-12-04T12:25:43.1796342Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_complex128 PASSED [0.0198s] [ 23%] 2025-12-04T12:25:43.1796468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_cuda_float32 PASSED [0.0170s] [ 23%] 2025-12-04T12:25:43.1796601Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_complex64 PASSED [1.0057s] [ 23%] 2025-12-04T12:25:43.1796730Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cholesky_ex_cuda_float32 PASSED [0.0107s] [ 23%] 2025-12-04T12:25:43.1796854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cond_cuda_complex64 PASSED [0.9980s] [ 23%] 2025-12-04T12:25:43.1796987Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cond_cuda_float64 PASSED [1.0178s] [ 23%] 2025-12-04T12:25:43.1797116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_complex128 PASSED [0.0059s] [ 23%] 2025-12-04T12:25:43.1797241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_float64 PASSED [0.0042s] [ 23%] 2025-12-04T12:25:43.1797360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_cross_cuda_int64 PASSED [0.0040s] [ 23%] 2025-12-04T12:25:43.1797483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_complex64 PASSED [0.1468s] [ 23%] 2025-12-04T12:25:43.1797602Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_det_cuda_float64 PASSED [0.0489s] [ 23%] 2025-12-04T12:25:43.1797729Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float16 PASSED [0.9954s] [ 23%] 2025-12-04T12:25:43.1797857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_float64 PASSED [0.0084s] [ 23%] 2025-12-04T12:25:43.1797992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int16 PASSED [0.0064s] [ 23%] 2025-12-04T12:25:43.1798198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_int8 PASSED [0.0063s] [ 23%] 2025-12-04T12:25:43.1798323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_diagonal_cuda_uint8 PASSED [0.0062s] [ 23%] 2025-12-04T12:25:43.1798440Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eig_cuda_float32 PASSED [0.0490s] [ 23%] 2025-12-04T12:25:43.1798566Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_complex128 PASSED [0.0305s] [ 23%] 2025-12-04T12:25:43.1798688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigh_cuda_complex64 PASSED [0.0074s] [ 23%] 2025-12-04T12:25:43.1798817Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvals_cuda_complex128 PASSED [0.0390s] [ 23%] 2025-12-04T12:25:43.1798948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_complex128 PASSED [0.0110s] [ 23%] 2025-12-04T12:25:43.1799097Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_complex64 PASSED [0.0118s] [ 23%] 2025-12-04T12:25:43.1799223Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_float32 PASSED [0.0318s] [ 23%] 2025-12-04T12:25:43.1799349Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_eigvalsh_cuda_float64 PASSED [0.0104s] [ 23%] 2025-12-04T12:25:43.1799591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_complex64 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T12:25:43.1799818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_householder_product_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 23%] 2025-12-04T12:25:43.1799942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_cuda_float64 PASSED [0.0154s] [ 23%] 2025-12-04T12:25:43.1800070Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_complex128 PASSED [0.0831s] [ 23%] 2025-12-04T12:25:43.1800200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_inv_ex_cuda_complex64 PASSED [1.0506s] [ 23%] 2025-12-04T12:25:43.1800332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_complex128 PASSED [1.1748s] [ 23%] 2025-12-04T12:25:43.1800461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_cuda_float32 PASSED [0.0069s] [ 23%] 2025-12-04T12:25:43.1800591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0043s] [ 23%] 2025-12-04T12:25:43.1800715Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_cuda_float64 PASSED [0.1123s] [ 23%] 2025-12-04T12:25:43.1800856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lstsq_grad_oriented_cuda_float32 PASSED [1.1460s] [ 23%] 2025-12-04T12:25:43.1800996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_cuda_complex64 PASSED [0.1440s] [ 23%] 2025-12-04T12:25:43.1801116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_cuda_float64 PASSED [0.0179s] [ 23%] 2025-12-04T12:25:43.1801250Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_complex128 PASSED [0.1114s] [ 23%] 2025-12-04T12:25:43.1801380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_complex64 PASSED [0.0424s] [ 23%] 2025-12-04T12:25:43.1801508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_cuda_float32 PASSED [0.0729s] [ 23%] 2025-12-04T12:25:43.1801640Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_ex_cuda_complex64 PASSED [0.0169s] [ 23%] 2025-12-04T12:25:43.1801772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_factor_ex_cuda_float64 PASSED [0.0159s] [ 23%] 2025-12-04T12:25:43.1801904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_lu_solve_cuda_complex64 PASSED [0.0872s] [ 23%] 2025-12-04T12:25:43.1802047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_bfloat16 PASSED [0.0521s] [ 23%] 2025-12-04T12:25:43.1802193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float16 PASSED [0.0514s] [ 23%] 2025-12-04T12:25:43.1802322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_norm_cuda_float64 PASSED [0.1036s] [ 23%] 2025-12-04T12:25:43.1802461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_power_cuda_complex128 PASSED [0.1022s] [ 23%] 2025-12-04T12:25:43.1802591Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_cuda_complex64 PASSED [0.1493s] [ 23%] 2025-12-04T12:25:43.1802742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_complex128 PASSED [0.0194s] [ 23%] 2025-12-04T12:25:43.1802886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.0191s] [ 23%] 2025-12-04T12:25:43.1803041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_matrix_rank_hermitian_cuda_float64 PASSED [0.0187s] [ 23%] 2025-12-04T12:25:43.1803169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_multi_dot_cuda_complex64 PASSED [0.0112s] [ 23%] 2025-12-04T12:25:43.1803298Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_complex128 PASSED [0.1261s] [ 23%] 2025-12-04T12:25:43.1803417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_norm_cuda_float16 PASSED [0.0845s] [ 23%] 2025-12-04T12:25:43.1803552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_hermitian_cuda_float32 PASSED [0.0147s] [ 23%] 2025-12-04T12:25:43.1803767Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_pinv_singular_cuda_complex128 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 23%] 2025-12-04T12:25:43.1803892Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_complex64 PASSED [0.0220s] [ 23%] 2025-12-04T12:25:43.1804012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_qr_cuda_float64 PASSED [0.0208s] [ 23%] 2025-12-04T12:25:43.1804138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_cuda_complex128 PASSED [0.0446s] [ 23%] 2025-12-04T12:25:43.1804262Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_cuda_float64 PASSED [0.0437s] [ 23%] 2025-12-04T12:25:43.1804404Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_solve_triangular_cuda_complex128 PASSED [0.2789s] [ 23%] 2025-12-04T12:25:43.1804527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_svdvals_cuda_float64 PASSED [0.0301s] [ 23%] 2025-12-04T12:25:43.1804654Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_tensorinv_cuda_float32 PASSED [0.0074s] [ 24%] 2025-12-04T12:25:43.1804791Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_complex64 PASSED [0.0251s] [ 24%] 2025-12-04T12:25:43.1804912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_float64 PASSED [0.0245s] [ 24%] 2025-12-04T12:25:43.1805037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vander_cuda_uint8 PASSED [0.0246s] [ 24%] 2025-12-04T12:25:43.1805162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vecdot_cuda_float32 PASSED [0.0430s] [ 24%] 2025-12-04T12:25:43.1805297Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_bfloat16 PASSED [0.1339s] [ 24%] 2025-12-04T12:25:43.1805425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float16 PASSED [0.1326s] [ 24%] 2025-12-04T12:25:43.1805558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float32 PASSED [0.1254s] [ 24%] 2025-12-04T12:25:43.1805685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linalg_vector_norm_cuda_float64 PASSED [0.1270s] [ 24%] 2025-12-04T12:25:43.1805816Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_bfloat16 PASSED [0.0212s] [ 24%] 2025-12-04T12:25:43.1805944Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float16 PASSED [0.0205s] [ 24%] 2025-12-04T12:25:43.1806062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_float32 PASSED [0.0204s] [ 24%] 2025-12-04T12:25:43.1806177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_cuda_int8 PASSED [0.0204s] [ 24%] 2025-12-04T12:25:43.1806320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_float64 PASSED [0.1071s] [ 24%] 2025-12-04T12:25:43.1806458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_linspace_tensor_overload_cuda_int8 PASSED [0.1063s] [ 24%] 2025-12-04T12:25:43.1806569Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_bool PASSED [0.0045s] [ 24%] 2025-12-04T12:25:43.1806688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_complex128 PASSED [1.0336s] [ 24%] 2025-12-04T12:25:43.1806803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_complex64 PASSED [0.1398s] [ 24%] 2025-12-04T12:25:43.1806929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_float32 PASSED [0.0042s] [ 24%] 2025-12-04T12:25:43.1807042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_float64 PASSED [1.0416s] [ 24%] 2025-12-04T12:25:43.1807155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log10_cuda_uint8 PASSED [0.0061s] [ 24%] 2025-12-04T12:25:43.1807269Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_bfloat16 PASSED [0.0033s] [ 24%] 2025-12-04T12:25:43.1807382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_bool PASSED [1.0280s] [ 24%] 2025-12-04T12:25:43.1807497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_complex128 PASSED [0.0044s] [ 24%] 2025-12-04T12:25:43.1807611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int16 PASSED [1.0534s] [ 24%] 2025-12-04T12:25:43.1807721Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log1p_cuda_int64 PASSED [0.0045s] [ 24%] 2025-12-04T12:25:43.1807837Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_bfloat16 PASSED [0.0043s] [ 24%] 2025-12-04T12:25:43.1807948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_float32 PASSED [1.0453s] [ 24%] 2025-12-04T12:25:43.1808059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log2_cuda_uint8 PASSED [0.0059s] [ 24%] 2025-12-04T12:25:43.1808287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_bfloat16 PASSED [0.0042s] [ 24%] 2025-12-04T12:25:43.1808395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float16 PASSED [1.0334s] [ 24%] 2025-12-04T12:25:43.1808503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_float64 PASSED [0.0056s] [ 24%] 2025-12-04T12:25:43.1808632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int32 PASSED [0.0041s] [ 24%] 2025-12-04T12:25:43.1808741Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_cuda_int64 PASSED [1.0472s] [ 24%] 2025-12-04T12:25:43.1808866Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_bfloat16 PASSED [0.0070s] [ 24%] 2025-12-04T12:25:43.1808990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float16 PASSED [0.0048s] [ 24%] 2025-12-04T12:25:43.1809108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_normal_cuda_float32 PASSED [0.0045s] [ 24%] 2025-12-04T12:25:43.1809230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_bfloat16 PASSED [0.0094s] [ 24%] 2025-12-04T12:25:43.1809348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_float32 PASSED [0.0089s] [ 24%] 2025-12-04T12:25:43.1809470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_cuda_float64 PASSED [0.0089s] [ 24%] 2025-12-04T12:25:43.1809608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_bfloat16 PASSED [0.0091s] [ 24%] 2025-12-04T12:25:43.1809764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_complex128 PASSED [1.0507s] [ 24%] 2025-12-04T12:25:43.1809920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_complex32 PASSED [1.0456s] [ 24%] 2025-12-04T12:25:43.1810060Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float32 PASSED [1.0384s] [ 24%] 2025-12-04T12:25:43.1810194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_float64 PASSED [1.0458s] [ 24%] 2025-12-04T12:25:43.1810326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_log_softmax_with_dtype_cuda_uint8 PASSED [1.0500s] [ 24%] 2025-12-04T12:25:43.1810450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logaddexp_cuda_complex128 PASSED [0.5429s] [ 24%] 2025-12-04T12:25:43.1810579Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_bfloat16 PASSED [0.0175s] [ 24%] 2025-12-04T12:25:43.1810709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_complex128 PASSED [0.0054s] [ 24%] 2025-12-04T12:25:43.1810850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_complex64 PASSED [0.0051s] [ 24%] 2025-12-04T12:25:43.1810972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logcumsumexp_cuda_float32 PASSED [0.0050s] [ 24%] 2025-12-04T12:25:43.1811091Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logdet_cuda_complex128 PASSED [0.1819s] [ 24%] 2025-12-04T12:25:43.1811209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_bool PASSED [0.0096s] [ 24%] 2025-12-04T12:25:43.1811329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_float16 PASSED [0.0111s] [ 24%] 2025-12-04T12:25:43.1811447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int32 PASSED [0.0109s] [ 24%] 2025-12-04T12:25:43.1811562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_and_cuda_int8 PASSED [0.0109s] [ 24%] 2025-12-04T12:25:43.1811686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_bfloat16 PASSED [1.0416s] [ 24%] 2025-12-04T12:25:43.1811804Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float16 PASSED [0.0063s] [ 24%] 2025-12-04T12:25:43.1811926Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_float64 PASSED [0.0043s] [ 24%] 2025-12-04T12:25:43.1812041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int16 PASSED [0.0041s] [ 24%] 2025-12-04T12:25:43.1812159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_not_cuda_int8 PASSED [1.0369s] [ 24%] 2025-12-04T12:25:43.1812279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_complex64 PASSED [0.4018s] [ 24%] 2025-12-04T12:25:43.1812398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_or_cuda_int16 PASSED [0.0112s] [ 24%] 2025-12-04T12:25:43.1812527Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_bool PASSED [0.0093s] [ 24%] 2025-12-04T12:25:43.1812656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_complex128 PASSED [0.0134s] [ 24%] 2025-12-04T12:25:43.1812780Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_complex64 PASSED [0.0130s] [ 24%] 2025-12-04T12:25:43.1812904Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float32 PASSED [0.0109s] [ 24%] 2025-12-04T12:25:43.1813023Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_float64 PASSED [0.0109s] [ 24%] 2025-12-04T12:25:43.1813142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int16 PASSED [0.0108s] [ 24%] 2025-12-04T12:25:43.1813258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logical_xor_cuda_int32 PASSED [0.0109s] [ 24%] 2025-12-04T12:25:43.1813374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_bfloat16 PASSED [0.0074s] [ 24%] 2025-12-04T12:25:43.1813493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_bool PASSED [0.0071s] [ 24%] 2025-12-04T12:25:43.1813618Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_float32 PASSED [0.0070s] [ 24%] 2025-12-04T12:25:43.1813730Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int32 PASSED [0.0071s] [ 24%] 2025-12-04T12:25:43.1813840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logit_cuda_int64 PASSED [0.0071s] [ 24%] 2025-12-04T12:25:43.1813952Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int16 PASSED [0.1117s] [ 24%] 2025-12-04T12:25:43.1814064Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_int32 PASSED [0.1116s] [ 24%] 2025-12-04T12:25:43.1814177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_cuda_uint8 PASSED [0.0376s] [ 24%] 2025-12-04T12:25:43.1814317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_bfloat16 PASSED [0.6993s] [ 24%] 2025-12-04T12:25:43.1814464Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_complex128 PASSED [0.6917s] [ 24%] 2025-12-04T12:25:43.1814612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_float32 PASSED [0.6933s] [ 24%] 2025-12-04T12:25:43.1814751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_int32 PASSED [0.6536s] [ 24%] 2025-12-04T12:25:43.1814887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logspace_tensor_overload_cuda_uint8 PASSED [0.2120s] [ 24%] 2025-12-04T12:25:43.1815010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_bfloat16 PASSED [0.0102s] [ 24%] 2025-12-04T12:25:43.1815127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_float64 PASSED [0.0229s] [ 24%] 2025-12-04T12:25:43.1815245Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int32 PASSED [1.0585s] [ 24%] 2025-12-04T12:25:43.1815362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int64 PASSED [0.0115s] [ 24%] 2025-12-04T12:25:43.1815480Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_logsumexp_cuda_int8 PASSED [0.0095s] [ 24%] 2025-12-04T12:25:43.1815592Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_bfloat16 PASSED [0.0029s] [ 24%] 2025-12-04T12:25:43.1815702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_bool PASSED [1.0411s] [ 24%] 2025-12-04T12:25:43.1815815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_complex64 PASSED [0.0042s] [ 24%] 2025-12-04T12:25:43.1815925Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_long_cuda_int64 PASSED [1.0392s] [ 24%] 2025-12-04T12:25:43.1816031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_bool PASSED [0.0107s] [ 24%] 2025-12-04T12:25:43.1816142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_float16 PASSED [0.0111s] [ 24%] 2025-12-04T12:25:43.1816261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lt_cuda_int64 PASSED [0.0107s] [ 24%] 2025-12-04T12:25:43.1816374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_cuda_float64 PASSED [0.0267s] [ 24%] 2025-12-04T12:25:43.1816495Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_lu_solve_cuda_complex128 PASSED [0.0282s] [ 24%] 2025-12-04T12:25:43.1816605Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_bfloat16 PASSED [1.0380s] [ 24%] 2025-12-04T12:25:43.1816714Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_bool PASSED [0.0056s] [ 24%] 2025-12-04T12:25:43.1816826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_complex128 PASSED [0.0051s] [ 24%] 2025-12-04T12:25:43.1816936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_float16 PASSED [1.0214s] [ 24%] 2025-12-04T12:25:43.1817042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mH_cuda_int64 PASSED [0.0057s] [ 24%] 2025-12-04T12:25:43.1817152Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_bool PASSED [0.0039s] [ 24%] 2025-12-04T12:25:43.1817276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_complex32 PASSED [1.0289s] [ 24%] 2025-12-04T12:25:43.1817393Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int32 PASSED [0.0055s] [ 24%] 2025-12-04T12:25:43.1817500Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_int64 PASSED [0.0039s] [ 24%] 2025-12-04T12:25:43.1817608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mT_cuda_uint8 PASSED [1.0366s] [ 24%] 2025-12-04T12:25:43.1817732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_bfloat16 PASSED [0.1441s] [ 24%] 2025-12-04T12:25:43.1817854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amax_cuda_float64 PASSED [0.1349s] [ 24%] 2025-12-04T12:25:43.1817974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float16 PASSED [0.1409s] [ 24%] 2025-12-04T12:25:43.1818144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float32 PASSED [0.1352s] [ 24%] 2025-12-04T12:25:43.1818277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_float64 PASSED [0.1351s] [ 24%] 2025-12-04T12:25:43.1818396Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int16 PASSED [0.1145s] [ 24%] 2025-12-04T12:25:43.1818512Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int32 PASSED [0.1149s] [ 24%] 2025-12-04T12:25:43.1818631Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_int64 PASSED [0.1146s] [ 24%] 2025-12-04T12:25:43.1818749Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_amin_cuda_uint8 PASSED [0.1150s] [ 24%] 2025-12-04T12:25:43.1818870Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float16 PASSED [0.0802s] [ 24%] 2025-12-04T12:25:43.1818997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_float32 PASSED [0.0812s] [ 24%] 2025-12-04T12:25:43.1819116Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int16 PASSED [0.0684s] [ 24%] 2025-12-04T12:25:43.1819241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmax_cuda_int8 PASSED [0.0676s] [ 24%] 2025-12-04T12:25:43.1819363Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_float32 PASSED [0.0798s] [ 24%] 2025-12-04T12:25:43.1819488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int16 PASSED [0.0676s] [ 24%] 2025-12-04T12:25:43.1819606Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_argmin_cuda_int32 PASSED [0.0677s] [ 24%] 2025-12-04T12:25:43.1819732Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_float32 PASSED [0.0380s] [ 24%] 2025-12-04T12:25:43.1819855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_cumprod_cuda_int16 PASSED [0.0375s] [ 24%] 2025-12-04T12:25:43.1819989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_bool PASSED [0.0100s] [ 24%] 2025-12-04T12:25:43.1820115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex128 PASSED [0.0099s] [ 24%] 2025-12-04T12:25:43.1820239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex32 PASSED [1.0682s] [ 24%] 2025-12-04T12:25:43.1820361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_complex64 PASSED [0.0124s] [ 24%] 2025-12-04T12:25:43.1820482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_float16 PASSED [0.0101s] [ 24%] 2025-12-04T12:25:43.1820598Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_fill_cuda_int32 PASSED [0.0097s] [ 24%] 2025-12-04T12:25:43.1820729Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_log_softmax_cuda_float32 PASSED [0.0464s] [ 24%] 2025-12-04T12:25:43.1820855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float16 PASSED [0.0480s] [ 24%] 2025-12-04T12:25:43.1820996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float32 PASSED [0.0475s] [ 24%] 2025-12-04T12:25:43.1821136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logaddexp_cuda_float64 PASSED [0.0476s] [ 24%] 2025-12-04T12:25:43.1821267Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_complex128 PASSED [0.1897s] [ 24%] 2025-12-04T12:25:43.1821398Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_complex64 PASSED [0.1897s] [ 24%] 2025-12-04T12:25:43.1821523Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_float16 PASSED [0.2205s] [ 25%] 2025-12-04T12:25:43.1821647Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_logsumexp_cuda_int64 PASSED [0.1921s] [ 25%] 2025-12-04T12:25:43.1821766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_mean_cuda_float16 PASSED [0.2379s] [ 25%] 2025-12-04T12:25:43.1821890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_float16 PASSED [0.0261s] [ 25%] 2025-12-04T12:25:43.1822022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_median_cuda_float32 PASSED [0.0258s] [ 25%] 2025-12-04T12:25:43.1822144Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_bfloat16 PASSED [0.8021s] [ 25%] 2025-12-04T12:25:43.1822263Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_norm_cuda_float64 PASSED [0.7701s] [ 25%] 2025-12-04T12:25:43.1822392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_bfloat16 PASSED [0.0736s] [ 25%] 2025-12-04T12:25:43.1822519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_normalize_cuda_float16 PASSED [0.0731s] [ 25%] 2025-12-04T12:25:43.1822636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_bool PASSED [0.1401s] [ 25%] 2025-12-04T12:25:43.1822758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_complex128 PASSED [0.1467s] [ 25%] 2025-12-04T12:25:43.1822881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float16 PASSED [0.4996s] [ 25%] 2025-12-04T12:25:43.1823001Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float32 PASSED [0.1616s] [ 25%] 2025-12-04T12:25:43.1823122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_prod_cuda_float64 PASSED [0.4439s] [ 25%] 2025-12-04T12:25:43.1823248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_complex64 PASSED [0.0061s] [ 25%] 2025-12-04T12:25:43.1823373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_float64 PASSED [0.0055s] [ 25%] 2025-12-04T12:25:43.1823494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_scatter_cuda_uint8 PASSED [0.0054s] [ 25%] 2025-12-04T12:25:43.1823617Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_bfloat16 PASSED [0.0069s] [ 25%] 2025-12-04T12:25:43.1823753Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_complex64 PASSED [0.0067s] [ 25%] 2025-12-04T12:25:43.1823874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int32 PASSED [0.0067s] [ 25%] 2025-12-04T12:25:43.1823995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_select_cuda_int64 PASSED [0.0067s] [ 25%] 2025-12-04T12:25:43.1824117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmax_cuda_float32 PASSED [0.0361s] [ 25%] 2025-12-04T12:25:43.1824241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_softmin_cuda_float16 PASSED [0.0445s] [ 25%] 2025-12-04T12:25:43.1824359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_bfloat16 PASSED [0.4046s] [ 25%] 2025-12-04T12:25:43.1824480Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_complex64 PASSED [0.4486s] [ 25%] 2025-12-04T12:25:43.1824595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int16 PASSED [0.3963s] [ 25%] 2025-12-04T12:25:43.1824726Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int32 PASSED [0.3984s] [ 25%] 2025-12-04T12:25:43.1824850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_std_cuda_int8 PASSED [0.3999s] [ 25%] 2025-12-04T12:25:43.1824974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_complex128 PASSED [0.1163s] [ 25%] 2025-12-04T12:25:43.1825093Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_float64 PASSED [0.1365s] [ 25%] 2025-12-04T12:25:43.1825209Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int32 PASSED [0.1192s] [ 25%] 2025-12-04T12:25:43.1825323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_int8 PASSED [0.1188s] [ 25%] 2025-12-04T12:25:43.1825438Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_sum_cuda_uint8 PASSED [0.1177s] [ 25%] 2025-12-04T12:25:43.1825553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int64 PASSED [0.3789s] [ 25%] 2025-12-04T12:25:43.1825668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_masked_var_cuda_int8 PASSED [0.3786s] [ 25%] 2025-12-04T12:25:43.1825799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float32 PASSED [1.0609s] [ 25%] 2025-12-04T12:25:43.1825918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_matrix_exp_cuda_float64 PASSED [0.0078s] [ 25%] 2025-12-04T12:25:43.1826031Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_bool PASSED [0.0090s] [ 25%] 2025-12-04T12:25:43.1826143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int32 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1826257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_int8 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1826369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_binary_cuda_uint8 PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1826521Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_pool2d_with_indices_backward_cuda_bfloat16 PASSED [2.1979s] [ 25%] 2025-12-04T12:25:43.1826652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_float64 PASSED [0.9718s] [ 25%] 2025-12-04T12:25:43.1826784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int32 PASSED [0.0048s] [ 25%] 2025-12-04T12:25:43.1826911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int64 PASSED [0.0034s] [ 25%] 2025-12-04T12:25:43.1827039Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_no_dim_cuda_int8 PASSED [0.9701s] [ 25%] 2025-12-04T12:25:43.1827173Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_float64 PASSED [0.0056s] [ 25%] 2025-12-04T12:25:43.1827305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_max_reduction_with_dim_cuda_int64 PASSED [0.0039s] [ 25%] 2025-12-04T12:25:43.1827427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_bool PASSED [0.0089s] [ 25%] 2025-12-04T12:25:43.1827544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_float16 PASSED [0.0087s] [ 25%] 2025-12-04T12:25:43.1827660Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_int32 PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1827773Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_maximum_cuda_uint8 PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1827885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_float32 PASSED [0.0100s] [ 25%] 2025-12-04T12:25:43.1827997Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_int16 PASSED [0.0097s] [ 25%] 2025-12-04T12:25:43.1828161Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_median_cuda_uint8 PASSED [0.0097s] [ 25%] 2025-12-04T12:25:43.1828299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_bfloat16 PASSED [0.0106s] [ 25%] 2025-12-04T12:25:43.1828442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_complex128 PASSED [0.0105s] [ 25%] 2025-12-04T12:25:43.1828593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_list_of_tensors_cuda_uint8 PASSED [0.0102s] [ 25%] 2025-12-04T12:25:43.1828747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_bfloat16 PASSED [0.0103s] [ 25%] 2025-12-04T12:25:43.1828888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_complex64 PASSED [0.0103s] [ 25%] 2025-12-04T12:25:43.1829029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_float32 PASSED [0.0103s] [ 25%] 2025-12-04T12:25:43.1829164Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_meshgrid_variadic_tensors_cuda_int32 PASSED [0.0103s] [ 25%] 2025-12-04T12:25:43.1829284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_bfloat16 PASSED [0.0088s] [ 25%] 2025-12-04T12:25:43.1829399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_bool PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1829515Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int32 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1829641Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_binary_cuda_int64 PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1829770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_bool PASSED [0.9603s] [ 25%] 2025-12-04T12:25:43.1829900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_float16 PASSED [0.0073s] [ 25%] 2025-12-04T12:25:43.1830032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_float32 PASSED [0.0057s] [ 25%] 2025-12-04T12:25:43.1830159Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int32 PASSED [0.9592s] [ 25%] 2025-12-04T12:25:43.1830287Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_no_dim_cuda_int8 PASSED [0.0047s] [ 25%] 2025-12-04T12:25:43.1830424Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_float32 PASSED [0.0039s] [ 25%] 2025-12-04T12:25:43.1830555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_int16 PASSED [1.0403s] [ 25%] 2025-12-04T12:25:43.1830686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_min_reduction_with_dim_cuda_uint8 PASSED [0.0058s] [ 25%] 2025-12-04T12:25:43.1830800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_float16 PASSED [0.0096s] [ 25%] 2025-12-04T12:25:43.1830915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_minimum_cuda_float64 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1831027Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_bfloat16 PASSED [0.2091s] [ 25%] 2025-12-04T12:25:43.1831136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_bool PASSED [0.0051s] [ 25%] 2025-12-04T12:25:43.1831258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_float16 PASSED [1.0573s] [ 25%] 2025-12-04T12:25:43.1831370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_float32 PASSED [0.0068s] [ 25%] 2025-12-04T12:25:43.1831481Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mode_cuda_int64 PASSED [0.0050s] [ 25%] 2025-12-04T12:25:43.1831597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_bfloat16 PASSED [1.0428s] [ 25%] 2025-12-04T12:25:43.1831707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_bool PASSED [0.0046s] [ 25%] 2025-12-04T12:25:43.1831826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_complex32 PASSED [1.0474s] [ 25%] 2025-12-04T12:25:43.1831938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_movedim_cuda_float64 PASSED [0.0045s] [ 25%] 2025-12-04T12:25:43.1832050Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float16 PASSED [0.0044s] [ 25%] 2025-12-04T12:25:43.1832162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float32 PASSED [1.0393s] [ 25%] 2025-12-04T12:25:43.1832284Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_float64 PASSED [0.0059s] [ 25%] 2025-12-04T12:25:43.1832405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int16 PASSED [0.0043s] [ 25%] 2025-12-04T12:25:43.1832516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int64 PASSED [1.0448s] [ 25%] 2025-12-04T12:25:43.1832625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_msort_cuda_int8 PASSED [0.0057s] [ 25%] 2025-12-04T12:25:43.1832735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_bfloat16 PASSED [0.0092s] [ 25%] 2025-12-04T12:25:43.1832841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_bool PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1832953Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_complex32 PASSED [0.3338s] [ 25%] 2025-12-04T12:25:43.1833065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_float32 PASSED [0.0090s] [ 25%] 2025-12-04T12:25:43.1833173Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_float64 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1833293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int64 PASSED [0.0086s] [ 25%] 2025-12-04T12:25:43.1833399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mul_cuda_int8 PASSED [0.0085s] [ 25%] 2025-12-04T12:25:43.1833522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_multinomial_cuda_float16 PASSED [1.0668s] [ 25%] 2025-12-04T12:25:43.1833630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mv_cuda_float32 PASSED [0.0052s] [ 25%] 2025-12-04T12:25:43.1833768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16 PASSED [0.0149s] [ 25%] 2025-12-04T12:25:43.1833900Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64 PASSED [0.0140s] [ 25%] 2025-12-04T12:25:43.1834033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_uint8 PASSED [1.0673s] [ 25%] 2025-12-04T12:25:43.1834169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float16 PASSED [0.0165s] [ 25%] 2025-12-04T12:25:43.1834304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0147s] [ 25%] 2025-12-04T12:25:43.1834435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32 PASSED [0.0148s] [ 25%] 2025-12-04T12:25:43.1834564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int8 PASSED [0.0141s] [ 25%] 2025-12-04T12:25:43.1834692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64 PASSED [0.0140s] [ 25%] 2025-12-04T12:25:43.1834823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_uint8 PASSED [0.0139s] [ 25%] 2025-12-04T12:25:43.1834951Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_bool PASSED [1.3482s] [ 25%] 2025-12-04T12:25:43.1835069Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nan_to_num_cuda_int16 PASSED [0.0061s] [ 25%] 2025-12-04T12:25:43.1835185Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_bfloat16 PASSED [0.0714s] [ 25%] 2025-12-04T12:25:43.1835299Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmean_cuda_float64 PASSED [0.0692s] [ 25%] 2025-12-04T12:25:43.1835417Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float16 PASSED [0.0104s] [ 25%] 2025-12-04T12:25:43.1835533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float32 PASSED [0.0101s] [ 25%] 2025-12-04T12:25:43.1835651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_float64 PASSED [0.0100s] [ 25%] 2025-12-04T12:25:43.1835764Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanmedian_cuda_int64 PASSED [0.0100s] [ 25%] 2025-12-04T12:25:43.1835887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nanquantile_cuda_float32 PASSED [0.4089s] [ 25%] 2025-12-04T12:25:43.1836009Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_complex32 PASSED [1.4963s] [ 25%] 2025-12-04T12:25:43.1836133Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int32 PASSED [0.0165s] [ 25%] 2025-12-04T12:25:43.1838627Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_int8 PASSED [1.4721s] [ 25%] 2025-12-04T12:25:43.1838746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nansum_cuda_uint8 PASSED [0.0164s] [ 25%] 2025-12-04T12:25:43.1838864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_copy_cuda_bool XFAIL [0.0035s] [ 25%] 2025-12-04T12:25:43.1838977Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_complex32 PASSED [1.4215s] [ 25%] 2025-12-04T12:25:43.1839090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_float32 PASSED [1.4366s] [ 25%] 2025-12-04T12:25:43.1839203Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_float64 PASSED [0.0077s] [ 25%] 2025-12-04T12:25:43.1839316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int16 PASSED [0.0057s] [ 25%] 2025-12-04T12:25:43.1839454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int32 PASSED [0.0055s] [ 25%] 2025-12-04T12:25:43.1839564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_narrow_cuda_int64 PASSED [0.0054s] [ 25%] 2025-12-04T12:25:43.1839693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_bfloat16 PASSED [0.0143s] [ 25%] 2025-12-04T12:25:43.1839823Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_float16 PASSED [0.0135s] [ 25%] 2025-12-04T12:25:43.1839950Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_batch_norm_cuda_float64 PASSED [0.0133s] [ 25%] 2025-12-04T12:25:43.1840090Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_bfloat16 PASSED [0.0221s] [ 25%] 2025-12-04T12:25:43.1840228Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_dropout_backward_cuda_float16 PASSED [0.0100s] [ 25%] 2025-12-04T12:25:43.1840361Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float32 PASSED [0.0397s] [ 25%] 2025-12-04T12:25:43.1840487Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_native_layer_norm_cuda_float64 PASSED [0.0328s] [ 25%] 2025-12-04T12:25:43.1840599Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_float32 PASSED [0.0087s] [ 25%] 2025-12-04T12:25:43.1840708Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ne_cuda_int64 PASSED [0.0085s] [ 26%] 2025-12-04T12:25:43.1840822Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_complex64 PASSED [1.4185s] [ 26%] 2025-12-04T12:25:43.1840935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_float16 PASSED [0.0045s] [ 26%] 2025-12-04T12:25:43.1841068Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_int64 PASSED [1.4220s] [ 26%] 2025-12-04T12:25:43.1841179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_neg_cuda_uint8 PASSED [0.0045s] [ 26%] 2025-12-04T12:25:43.1841300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_complex64 PASSED [0.0066s] [ 26%] 2025-12-04T12:25:43.1841418Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float16 PASSED [0.0061s] [ 26%] 2025-12-04T12:25:43.1841533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_float32 PASSED [0.0061s] [ 26%] 2025-12-04T12:25:43.1841647Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_cuda_int8 PASSED [0.0061s] [ 26%] 2025-12-04T12:25:43.1841778Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex32 PASSED [0.0064s] [ 26%] 2025-12-04T12:25:43.1841911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_complex64 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1842038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_empty_strided_cuda_float32 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1842174Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_complex64 PASSED [0.0065s] [ 26%] 2025-12-04T12:25:43.1842302Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float32 PASSED [0.0063s] [ 26%] 2025-12-04T12:25:43.1842419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_float64 PASSED [0.0064s] [ 26%] 2025-12-04T12:25:43.1842532Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_full_cuda_int8 PASSED [0.0065s] [ 26%] 2025-12-04T12:25:43.1842643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_bool PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1842756Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float16 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1842869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float32 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1842981Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_ones_cuda_float64 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_complex128 PASSED [0.0063s] [ 26%] 2025-12-04T12:25:43.1843231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float16 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_float32 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int16 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843573Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int32 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_new_zeros_cuda_int64 PASSED [0.0062s] [ 26%] 2025-12-04T12:25:43.1843803Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nextafter_cuda_bfloat16 PASSED [0.0087s] [ 26%] 2025-12-04T12:25:43.1843957Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float16 PASSED [1.4571s] [ 26%] 2025-12-04T12:25:43.1844107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64 PASSED [0.0151s] [ 26%] 2025-12-04T12:25:43.1844257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16 PASSED [0.0175s] [ 26%] 2025-12-04T12:25:43.1844405Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [1.4344s] [ 26%] 2025-12-04T12:25:43.1844553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float16 PASSED [0.0195s] [ 26%] 2025-12-04T12:25:43.1844699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [1.4198s] [ 26%] 2025-12-04T12:25:43.1844856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float16 PASSED [0.0226s] [ 26%] 2025-12-04T12:25:43.1845004Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float64 PASSED [1.4349s] [ 26%] 2025-12-04T12:25:43.1845149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_alpha_dropout_cuda_float64 PASSED [0.0183s] [ 26%] 2025-12-04T12:25:43.1845286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool1d_cuda_float16 PASSED [0.0168s] [ 26%] 2025-12-04T12:25:43.1845421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool2d_cuda_float16 PASSED [0.0058s] [ 26%] 2025-12-04T12:25:43.1845558Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0119s] [ 26%] 2025-12-04T12:25:43.1845695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_cuda_bfloat16 PASSED [0.0248s] [ 26%] 2025-12-04T12:25:43.1845855Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float32 PASSED [0.0243s] [ 26%] 2025-12-04T12:25:43.1846005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_bfloat16 PASSED [0.0401s] [ 26%] 2025-12-04T12:25:43.1846149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_float16 PASSED [0.0395s] [ 26%] 2025-12-04T12:25:43.1846282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_bilinear_cuda_float32 PASSED [0.0387s] [ 26%] 2025-12-04T12:25:43.1846450Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0609s] [ 26%] 2025-12-04T12:25:43.1846614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64 PASSED [0.0510s] [ 26%] 2025-12-04T12:25:43.1846745Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_bfloat16 PASSED [0.0048s] [ 26%] 2025-12-04T12:25:43.1846875Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float16 PASSED [0.0046s] [ 26%] 2025-12-04T12:25:43.1847014Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float32 PASSED [0.0046s] [ 26%] 2025-12-04T12:25:43.1847142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_celu_cuda_float64 PASSED [0.0045s] [ 26%] 2025-12-04T12:25:43.1847293Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_complex128 PASSED [1.4889s] [ 26%] 2025-12-04T12:25:43.1847437Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0061s] [ 26%] 2025-12-04T12:25:43.1847578Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_int32 PASSED [0.0042s] [ 26%] 2025-12-04T12:25:43.1847720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_channel_shuffle_cuda_uint8 PASSED [1.4479s] [ 26%] 2025-12-04T12:25:43.1847858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_complex128 PASSED [0.1039s] [ 26%] 2025-12-04T12:25:43.1847991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv1d_cuda_float64 PASSED [0.0122s] [ 26%] 2025-12-04T12:25:43.1848208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_complex128 PASSED [0.2652s] [ 26%] 2025-12-04T12:25:43.1848526Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x7da788200c00 size: 1024 2025-12-04T12:25:43.1848716Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x7da788200c00 size: 1024 2025-12-04T12:25:43.1848762Z PASSED [2.4228s] [ 26%] 2025-12-04T12:25:43.1848909Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv2d_cuda_float64 PASSED [0.0327s] [ 26%] 2025-12-04T12:25:43.1849047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv3d_cuda_complex128 PASSED [0.1814s] [ 26%] 2025-12-04T12:25:43.1849197Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_complex32 PASSED [0.8915s] [ 26%] 2025-12-04T12:25:43.1849345Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float16 PASSED [0.9793s] [ 26%] 2025-12-04T12:25:43.1849488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose1d_cuda_float64 PASSED [0.0139s] [ 26%] 2025-12-04T12:25:43.1849637Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_bfloat16 PASSED [1.5319s] [ 26%] 2025-12-04T12:25:43.1849786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose2d_cuda_complex64 PASSED [0.1063s] [ 26%] 2025-12-04T12:25:43.1849935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32 PASSED [0.1219s] [ 26%] 2025-12-04T12:25:43.1850156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_embedding_loss_cuda_bool PASSED [0.0572s] [ 26%] 2025-12-04T12:25:43.1850319Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cosine_similarity_cuda_bfloat16 PASSED [0.0524s] [ 26%] 2025-12-04T12:25:43.1850461Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_cross_entropy_cuda_float32 PASSED [0.0738s] [ 26%] 2025-12-04T12:25:43.1853145Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_ctc_loss_cuda_float64 PASSED [0.0354s] [ 26%] 2025-12-04T12:25:43.1853858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout2d_cuda_float16 PASSED [0.9709s] [ 26%] 2025-12-04T12:25:43.1853995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout2d_cuda_float64 PASSED [0.0125s] [ 26%] 2025-12-04T12:25:43.1854134Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_dropout_cuda_bfloat16 PASSED [0.0132s] [ 26%] 2025-12-04T12:25:43.1854279Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_bfloat16 PASSED [0.9948s] [ 26%] 2025-12-04T12:25:43.1854410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_elu_cuda_float16 PASSED [0.0074s] [ 26%] 2025-12-04T12:25:43.1854551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float32 PASSED [0.0774s] [ 26%] 2025-12-04T12:25:43.1854695Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_bag_cuda_float64 PASSED [0.0688s] [ 26%] 2025-12-04T12:25:43.1854851Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_embedding_cuda_float16 PASSED [0.0097s] [ 26%] 2025-12-04T12:25:43.1855020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16 PASSED [0.0104s] [ 26%] 2025-12-04T12:25:43.1855187Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.9878s] [ 26%] 2025-12-04T12:25:43.1855357Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16 PASSED [0.0058s] [ 26%] 2025-12-04T12:25:43.1855530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128 PASSED [0.9767s] [ 26%] 2025-12-04T12:25:43.1855701Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0058s] [ 26%] 2025-12-04T12:25:43.1855872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float64 PASSED [0.9806s] [ 26%] 2025-12-04T12:25:43.1856042Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32 PASSED [0.0056s] [ 26%] 2025-12-04T12:25:43.1856219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_uint8 PASSED [0.9853s] [ 26%] 2025-12-04T12:25:43.1856377Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_bfloat16 PASSED [0.0457s] [ 26%] 2025-12-04T12:25:43.1856530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0384s] [ 26%] 2025-12-04T12:25:43.1856677Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16 PASSED [1.7591s] [ 26%] 2025-12-04T12:25:43.1856824Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [1.7317s] [ 26%] 2025-12-04T12:25:43.1856969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float64 PASSED [1.7356s] [ 26%] 2025-12-04T12:25:43.1857099Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float16 PASSED [0.0259s] [ 26%] 2025-12-04T12:25:43.1857230Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_gelu_cuda_float64 PASSED [0.0121s] [ 26%] 2025-12-04T12:25:43.1857359Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_glu_cuda_float64 PASSED [0.0428s] [ 26%] 2025-12-04T12:25:43.1857499Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_bfloat16 PASSED [0.0672s] [ 26%] 2025-12-04T12:25:43.1857635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float16 PASSED [0.0427s] [ 26%] 2025-12-04T12:25:43.1857839Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_group_norm_cuda_float64 PASSED [0.0419s] [ 26%] 2025-12-04T12:25:43.1857995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_bfloat16 PASSED [0.0138s] [ 26%] 2025-12-04T12:25:43.1858170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardshrink_cuda_float32 PASSED [0.0069s] [ 26%] 2025-12-04T12:25:43.1858314Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardsigmoid_cuda_bfloat16 PASSED [0.0066s] [ 26%] 2025-12-04T12:25:43.1858468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardswish_cuda_bfloat16 PASSED [0.0157s] [ 26%] 2025-12-04T12:25:43.1858604Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_float32 PASSED [0.0090s] [ 26%] 2025-12-04T12:25:43.1858734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hardtanh_cuda_int64 PASSED [0.0088s] [ 26%] 2025-12-04T12:25:43.1858888Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float16 PASSED [0.0472s] [ 26%] 2025-12-04T12:25:43.1859024Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_huber_loss_cuda_float16 PASSED [0.0161s] [ 26%] 2025-12-04T12:25:43.1859162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_huber_loss_cuda_float32 PASSED [0.9860s] [ 26%] 2025-12-04T12:25:43.1859301Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_instance_norm_cuda_float64 PASSED [0.0585s] [ 26%] 2025-12-04T12:25:43.1859448Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16 PASSED [0.0288s] [ 26%] 2025-12-04T12:25:43.1859593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float16 PASSED [0.0282s] [ 26%] 2025-12-04T12:25:43.1859739Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float32 PASSED [1.0253s] [ 26%] 2025-12-04T12:25:43.1859884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_area_cuda_float64 PASSED [0.0310s] [ 26%] 2025-12-04T12:25:43.1860035Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16 PASSED [0.8431s] [ 26%] 2025-12-04T12:25:43.1860200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float16 PASSED [0.8323s] [ 26%] 2025-12-04T12:25:43.1860351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.8344s] [ 26%] 2025-12-04T12:25:43.1860503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_bilinear_cuda_float16 PASSED [0.1462s] [ 26%] 2025-12-04T12:25:43.1860651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_linear_cuda_bfloat16 PASSED [0.0594s] [ 26%] 2025-12-04T12:25:43.1860799Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_linear_cuda_float16 PASSED [0.0539s] [ 26%] 2025-12-04T12:25:43.1860960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16 PASSED [0.0360s] [ 26%] 2025-12-04T12:25:43.1861120Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.0246s] [ 26%] 2025-12-04T12:25:43.1861266Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_nearest_cuda_uint8 PASSED [0.0207s] [ 26%] 2025-12-04T12:25:43.1861419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_interpolate_trilinear_cuda_float64 PASSED [0.2390s] [ 26%] 2025-12-04T12:25:43.1861551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_kl_div_cuda_bfloat16 PASSED [0.0402s] [ 26%] 2025-12-04T12:25:43.1861687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_complex64 PASSED [0.0143s] [ 26%] 2025-12-04T12:25:43.1861841Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_l1_loss_cuda_float64 PASSED [0.0108s] [ 26%] 2025-12-04T12:25:43.1861999Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_bfloat16 PASSED [0.0126s] [ 26%] 2025-12-04T12:25:43.1862136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float16 PASSED [0.9927s] [ 26%] 2025-12-04T12:25:43.1862273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_layer_norm_cuda_float32 PASSED [0.0150s] [ 26%] 2025-12-04T12:25:43.1862421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_bfloat16 PASSED [0.0107s] [ 26%] 2025-12-04T12:25:43.1862556Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0102s] [ 26%] 2025-12-04T12:25:43.1862687Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_linear_cuda_float16 PASSED [1.0156s] [ 26%] 2025-12-04T12:25:43.1862826Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_logsigmoid_cuda_bfloat16 PASSED [0.0088s] [ 26%] 2025-12-04T12:25:43.1862972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8 PASSED [0.0663s] [ 26%] 2025-12-04T12:25:43.1863109Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16 PASSED [0.9117s] [ 26%] 2025-12-04T12:25:43.1863244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool1d_cuda_float16 PASSED [0.9026s] [ 26%] 2025-12-04T12:25:43.1863380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16 PASSED [0.8485s] [ 26%] 2025-12-04T12:25:43.1863517Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float16 PASSED [0.8495s] [ 26%] 2025-12-04T12:25:43.1863650Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool2d_cuda_float64 PASSED [0.8479s] [ 26%] 2025-12-04T12:25:43.1863788Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16 PASSED [0.3885s] [ 27%] 2025-12-04T12:25:43.1863928Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16 PASSED [0.2609s] [ 27%] 2025-12-04T12:25:43.1864079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_float16 PASSED [0.2572s] [ 27%] 2025-12-04T12:25:43.1864218Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool1d_cuda_float32 PASSED [0.2562s] [ 27%] 2025-12-04T12:25:43.1864364Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float64 PASSED [0.0324s] [ 27%] 2025-12-04T12:25:43.1864497Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_bfloat16 PASSED [0.0089s] [ 27%] 2025-12-04T12:25:43.1864630Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_mse_loss_cuda_float32 PASSED [0.9927s] [ 27%] 2025-12-04T12:25:43.1864793Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float16 PASSED [2.8902s] [ 27%] 2025-12-04T12:25:43.1864940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_float16 PASSED [1.0043s] [ 27%] 2025-12-04T12:25:43.1865088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0256s] [ 27%] 2025-12-04T12:25:43.1865247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0271s] [ 27%] 2025-12-04T12:25:43.1865406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float64 PASSED [0.0239s] [ 27%] 2025-12-04T12:25:43.1865540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_nll_loss_cuda_float16 PASSED [0.1001s] [ 27%] 2025-12-04T12:25:43.1865689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_bfloat16 PASSED [0.0241s] [ 27%] 2025-12-04T12:25:43.1865840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_complex128 PASSED [1.0431s] [ 27%] 2025-12-04T12:25:43.1865976Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_normalize_cuda_float64 PASSED [0.0255s] [ 27%] 2025-12-04T12:25:43.1866112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_int32 PASSED [0.0203s] [ 27%] 2025-12-04T12:25:43.1866258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_circular_cuda_uint8 PASSED [0.0200s] [ 27%] 2025-12-04T12:25:43.1866392Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_constant_cuda_bool PASSED [0.0275s] [ 27%] 2025-12-04T12:25:43.1866533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_reflect_cuda_complex64 PASSED [0.0163s] [ 27%] 2025-12-04T12:25:43.1866676Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_bfloat16 PASSED [0.0142s] [ 27%] 2025-12-04T12:25:43.1866816Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_cuda_float64 PASSED [0.0081s] [ 27%] 2025-12-04T12:25:43.1866972Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_bfloat16 PASSED [1.0108s] [ 27%] 2025-12-04T12:25:43.1867129Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex64 PASSED [0.0079s] [ 27%] 2025-12-04T12:25:43.1867283Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float16 PASSED [0.0058s] [ 27%] 2025-12-04T12:25:43.1867434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32 PASSED [0.0056s] [ 27%] 2025-12-04T12:25:43.1867585Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_float64 PASSED [0.0055s] [ 27%] 2025-12-04T12:25:43.1867733Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int32 PASSED [0.0054s] [ 27%] 2025-12-04T12:25:43.1867883Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8 PASSED [0.0053s] [ 27%] 2025-12-04T12:25:43.1868044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_complex128 PASSED [0.0165s] [ 27%] 2025-12-04T12:25:43.1874004Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pairwise_distance_cuda_int16 PASSED [1.0289s] [ 27%] 2025-12-04T12:25:43.1874154Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_bool PASSED [0.0066s] [ 27%] 2025-12-04T12:25:43.1874300Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_shuffle_cuda_float64 PASSED [0.0050s] [ 27%] 2025-12-04T12:25:43.1874445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool PASSED [0.0050s] [ 27%] 2025-12-04T12:25:43.1874594Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0048s] [ 27%] 2025-12-04T12:25:43.1874740Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16 PASSED [0.0047s] [ 27%] 2025-12-04T12:25:43.1874884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0047s] [ 27%] 2025-12-04T12:25:43.1875026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int16 PASSED [0.0047s] [ 27%] 2025-12-04T12:25:43.1875166Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_pixel_unshuffle_cuda_int64 PASSED [0.0047s] [ 27%] 2025-12-04T12:25:43.1875311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32 PASSED [0.3072s] [ 27%] 2025-12-04T12:25:43.1875519Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_int8 PASSED [0.3075s] [ 27%] 2025-12-04T12:25:43.1875680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_poisson_nll_loss_cuda_uint8 PASSED [0.3093s] [ 27%] 2025-12-04T12:25:43.1875813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_prelu_cuda_float16 PASSED [0.0282s] [ 27%] 2025-12-04T12:25:43.1875946Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_float16 PASSED [0.9867s] [ 27%] 2025-12-04T12:25:43.1876102Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_float32 PASSED [0.0081s] [ 27%] 2025-12-04T12:25:43.1876235Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int32 PASSED [0.9841s] [ 27%] 2025-12-04T12:25:43.1876362Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu6_cuda_int64 PASSED [0.0085s] [ 27%] 2025-12-04T12:25:43.1876493Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_bfloat16 PASSED [0.9779s] [ 27%] 2025-12-04T12:25:43.1876620Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int64 PASSED [0.0075s] [ 27%] 2025-12-04T12:25:43.1876747Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_relu_cuda_int8 PASSED [0.0055s] [ 27%] 2025-12-04T12:25:43.1876884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float32 PASSED [0.0068s] [ 27%] 2025-12-04T12:25:43.1877020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rms_norm_cuda_float64 PASSED [0.0064s] [ 27%] 2025-12-04T12:25:43.1877153Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_rrelu_cuda_bfloat16 PASSED [0.0077s] [ 27%] 2025-12-04T12:25:43.1877282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_selu_cuda_bfloat16 PASSED [0.9775s] [ 27%] 2025-12-04T12:25:43.1877412Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_selu_cuda_float16 PASSED [0.0077s] [ 27%] 2025-12-04T12:25:43.1877542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_silu_cuda_float16 PASSED [0.9720s] [ 27%] 2025-12-04T12:25:43.1877685Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_smooth_l1_loss_cuda_float64 PASSED [0.0172s] [ 27%] 2025-12-04T12:25:43.1877836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float32 PASSED [0.0087s] [ 27%] 2025-12-04T12:25:43.1877974Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_cuda_float64 PASSED [0.0083s] [ 27%] 2025-12-04T12:25:43.1878169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64 PASSED [0.0086s] [ 27%] 2025-12-04T12:25:43.1878320Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0085s] [ 27%] 2025-12-04T12:25:43.1878456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softplus_cuda_float64 PASSED [0.0134s] [ 27%] 2025-12-04T12:25:43.1878597Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float16 PASSED [0.0178s] [ 27%] 2025-12-04T12:25:43.1878735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softshrink_cuda_float64 PASSED [0.9769s] [ 27%] 2025-12-04T12:25:43.1878868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_bool PASSED [0.0088s] [ 27%] 2025-12-04T12:25:43.1879007Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_complex64 PASSED [0.0070s] [ 27%] 2025-12-04T12:25:43.1879142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_float32 PASSED [0.0066s] [ 27%] 2025-12-04T12:25:43.1879275Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_softsign_cuda_int32 PASSED [0.0066s] [ 27%] 2025-12-04T12:25:43.1879433Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_complex64 PASSED [0.9636s] [ 27%] 2025-12-04T12:25:43.1879583Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_float64 PASSED [0.0073s] [ 27%] 2025-12-04T12:25:43.1879718Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int32 PASSED [0.9648s] [ 27%] 2025-12-04T12:25:43.1879854Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_tanhshrink_cuda_int64 PASSED [0.0073s] [ 27%] 2025-12-04T12:25:43.1880002Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float32 PASSED [0.9703s] [ 27%] 2025-12-04T12:25:43.1880140Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_float64 PASSED [0.0078s] [ 27%] 2025-12-04T12:25:43.1880273Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_threshold_cuda_int64 PASSED [0.0057s] [ 27%] 2025-12-04T12:25:43.1880427Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_bfloat16 PASSED [0.0441s] [ 27%] 2025-12-04T12:25:43.1880582Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_complex128 PASSED [0.0440s] [ 27%] 2025-12-04T12:25:43.1880731Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int32 PASSED [0.0428s] [ 27%] 2025-12-04T12:25:43.1880878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_loss_cuda_int8 PASSED [1.0070s] [ 27%] 2025-12-04T12:25:43.1881053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128 PASSED [0.0469s] [ 27%] 2025-12-04T12:25:43.1881221Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float16 PASSED [0.0443s] [ 27%] 2025-12-04T12:25:43.1881385Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 PASSED [0.0431s] [ 27%] 2025-12-04T12:25:43.1881552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64 PASSED [0.0434s] [ 27%] 2025-12-04T12:25:43.1881691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_complex128 PASSED [0.1803s] [ 27%] 2025-12-04T12:25:43.1881840Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float16 PASSED [0.1741s] [ 27%] 2025-12-04T12:25:43.1881975Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_unfold_cuda_float64 PASSED [0.1731s] [ 27%] 2025-12-04T12:25:43.1882126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.9938s] [ 27%] 2025-12-04T12:25:43.1882274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_bilinear_cuda_float64 PASSED [0.0271s] [ 27%] 2025-12-04T12:25:43.1882423Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0149s] [ 27%] 2025-12-04T12:25:43.1882546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex128 PASSED [0.0139s] [ 27%] 2025-12-04T12:25:43.1882668Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_complex32 PASSED [0.0134s] [ 27%] 2025-12-04T12:25:43.1882784Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_cuda_int32 PASSED [0.0133s] [ 27%] 2025-12-04T12:25:43.1882935Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_bool SKIPPED [0.0006s] (Only runs on cpu) [ 27%] 2025-12-04T12:25:43.1883087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_float16 SKIPPED [0.0005s] (Only runs on cpu) [ 27%] 2025-12-04T12:25:43.1883239Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_int64 SKIPPED [0.0005s] (Only runs on cpu) [ 27%] 2025-12-04T12:25:43.1883389Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_nonzero_static_cuda_uint8 SKIPPED [0.0005s] (Only runs on cpu) [ 27%] 2025-12-04T12:25:43.1883516Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_complex128 PASSED [0.9950s] [ 27%] 2025-12-04T12:25:43.1883643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float32 PASSED [0.0391s] [ 27%] 2025-12-04T12:25:43.1883757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_cuda_float64 PASSED [0.0367s] [ 27%] 2025-12-04T12:25:43.1883878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_bfloat16 PASSED [0.0050s] [ 27%] 2025-12-04T12:25:43.1884005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float32 PASSED [0.0048s] [ 27%] 2025-12-04T12:25:43.1884124Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_fro_cuda_float64 PASSED [0.0048s] [ 27%] 2025-12-04T12:25:43.1884244Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_inf_cuda_complex128 PASSED [0.0048s] [ 27%] 2025-12-04T12:25:43.1884366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_nuc_cuda_complex64 PASSED [0.9742s] [ 27%] 2025-12-04T12:25:43.1884483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_norm_nuc_cuda_float32 PASSED [0.0083s] [ 27%] 2025-12-04T12:25:43.1884610Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_in_place_cuda_float32 PASSED [0.9618s] [ 27%] 2025-12-04T12:25:43.1884742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_normal_number_mean_cuda_float64 PASSED [0.0072s] [ 27%] 2025-12-04T12:25:43.1884860Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_complex64 PASSED [0.0031s] [ 27%] 2025-12-04T12:25:43.1884973Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_float16 PASSED [0.9602s] [ 27%] 2025-12-04T12:25:43.1885085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_cuda_int32 PASSED [0.0043s] [ 27%] 2025-12-04T12:25:43.1885204Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_bfloat16 PASSED [0.0066s] [ 27%] 2025-12-04T12:25:43.1885328Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_complex64 PASSED [0.0063s] [ 27%] 2025-12-04T12:25:43.1885446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_float32 PASSED [0.0062s] [ 27%] 2025-12-04T12:25:43.1885562Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int16 PASSED [0.0061s] [ 27%] 2025-12-04T12:25:43.1885689Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ones_like_cuda_int32 PASSED [0.0062s] [ 27%] 2025-12-04T12:25:43.1885807Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ormqr_cuda_complex64 PASSED [0.1118s] [ 27%] 2025-12-04T12:25:43.1885922Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_bfloat16 PASSED [0.9805s] [ 27%] 2025-12-04T12:25:43.1886037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_complex64 PASSED [0.0053s] [ 27%] 2025-12-04T12:25:43.1886149Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_int32 PASSED [0.0038s] [ 27%] 2025-12-04T12:25:43.1886261Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_outer_cuda_uint8 PASSED [0.9654s] [ 27%] 2025-12-04T12:25:43.1886387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_complex128 PASSED [0.2362s] [ 27%] 2025-12-04T12:25:43.1886507Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pca_lowrank_cuda_float64 PASSED [0.3167s] [ 27%] 2025-12-04T12:25:43.1886636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_bfloat16 PASSED [0.9831s] [ 27%] 2025-12-04T12:25:43.1886755Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_bool PASSED [0.0063s] [ 27%] 2025-12-04T12:25:43.1886885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_complex128 PASSED [0.0046s] [ 27%] 2025-12-04T12:25:43.1887010Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_complex32 PASSED [0.0043s] [ 27%] 2025-12-04T12:25:43.1887135Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_float16 PASSED [0.0042s] [ 27%] 2025-12-04T12:25:43.1887265Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_copy_cuda_int8 PASSED [0.9772s] [ 27%] 2025-12-04T12:25:43.1887395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_bfloat16 PASSED [0.0053s] [ 27%] 2025-12-04T12:25:43.1887509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_bool PASSED [0.9775s] [ 27%] 2025-12-04T12:25:43.1887632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_complex128 PASSED [0.0054s] [ 27%] 2025-12-04T12:25:43.1887758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_float64 PASSED [0.9661s] [ 27%] 2025-12-04T12:25:43.1887872Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_permute_cuda_int8 PASSED [0.0052s] [ 27%] 2025-12-04T12:25:43.1887987Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pinverse_cuda_float64 PASSED [0.0202s] [ 27%] 2025-12-04T12:25:43.1888167Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float16 PASSED [0.0074s] [ 27%] 2025-12-04T12:25:43.1888307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0071s] [ 27%] 2025-12-04T12:25:43.1888445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_float64 PASSED [0.0070s] [ 27%] 2025-12-04T12:25:43.1888584Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int16 PASSED [0.0093s] [ 27%] 2025-12-04T12:25:43.1888720Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int64 PASSED [0.9874s] [ 28%] 2025-12-04T12:25:43.1888856Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_0_cuda_int8 PASSED [0.0096s] [ 28%] 2025-12-04T12:25:43.1888994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16 PASSED [0.0074s] [ 28%] 2025-12-04T12:25:43.1889136Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0071s] [ 28%] 2025-12-04T12:25:43.1889274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_float64 PASSED [0.0071s] [ 28%] 2025-12-04T12:25:43.1889410Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_1_cuda_int16 PASSED [0.0070s] [ 28%] 2025-12-04T12:25:43.1889561Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.9967s] [ 28%] 2025-12-04T12:25:43.1889697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_bool PASSED [0.0087s] [ 28%] 2025-12-04T12:25:43.1889833Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float32 PASSED [0.0073s] [ 28%] 2025-12-04T12:25:43.1889971Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_2_cuda_float64 PASSED [0.0071s] [ 28%] 2025-12-04T12:25:43.1890107Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_bool PASSED [0.0070s] [ 28%] 2025-12-04T12:25:43.1890248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_float16 PASSED [0.0070s] [ 28%] 2025-12-04T12:25:43.1890383Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_3_cuda_uint8 PASSED [0.9741s] [ 28%] 2025-12-04T12:25:43.1890522Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_float16 PASSED [0.0094s] [ 28%] 2025-12-04T12:25:43.1890661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_polygamma_polygamma_n_4_cuda_float64 PASSED [0.0074s] [ 28%] 2025-12-04T12:25:43.1890779Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_bfloat16 PASSED [0.9726s] [ 28%] 2025-12-04T12:25:43.1890901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_complex64 PASSED [0.0032s] [ 28%] 2025-12-04T12:25:43.1891017Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_float32 PASSED [0.9827s] [ 28%] 2025-12-04T12:25:43.1891146Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_int16 PASSED [0.0033s] [ 28%] 2025-12-04T12:25:43.1891280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_positive_cuda_int32 PASSED [0.9805s] [ 28%] 2025-12-04T12:25:43.1891399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_complex32 PASSED [0.0107s] [ 28%] 2025-12-04T12:25:43.1891510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float16 PASSED [0.0090s] [ 28%] 2025-12-04T12:25:43.1891635Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float32 PASSED [0.0087s] [ 28%] 2025-12-04T12:25:43.1891746Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_float64 PASSED [0.0086s] [ 28%] 2025-12-04T12:25:43.1891857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int64 PASSED [0.0086s] [ 28%] 2025-12-04T12:25:43.1891965Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_pow_cuda_int8 PASSED [0.0085s] [ 28%] 2025-12-04T12:25:43.1892082Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_complex64 PASSED [0.0233s] [ 28%] 2025-12-04T12:25:43.1892195Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_prod_cuda_float64 PASSED [0.0226s] [ 28%] 2025-12-04T12:25:43.1892307Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_int64 PASSED [0.0195s] [ 28%] 2025-12-04T12:25:43.1892416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_put_cuda_int8 PASSED [0.0192s] [ 28%] 2025-12-04T12:25:43.1892538Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_quantile_cuda_float64 PASSED [0.3903s] [ 28%] 2025-12-04T12:25:43.1892651Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_bool PASSED [0.9884s] [ 28%] 2025-12-04T12:25:43.1892768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_float16 PASSED [0.0047s] [ 28%] 2025-12-04T12:25:43.1892882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_float64 PASSED [0.9768s] [ 28%] 2025-12-04T12:25:43.1892998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_int64 PASSED [0.0048s] [ 28%] 2025-12-04T12:25:43.1893119Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rad2deg_cuda_uint8 PASSED [0.9803s] [ 28%] 2025-12-04T12:25:43.1893238Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rand_like_cuda_float32 PASSED [0.0124s] [ 28%] 2025-12-04T12:25:43.1893368Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_bfloat16 PASSED [0.0106s] [ 28%] 2025-12-04T12:25:43.1893483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_cuda_float32 PASSED [0.0100s] [ 28%] 2025-12-04T12:25:43.1893609Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_float32 PASSED [0.0129s] [ 28%] 2025-12-04T12:25:43.1893729Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int32 PASSED [0.0125s] [ 28%] 2025-12-04T12:25:43.1893849Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randint_like_cuda_int64 PASSED [0.0125s] [ 28%] 2025-12-04T12:25:43.1893967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_complex128 PASSED [0.9752s] [ 28%] 2025-12-04T12:25:43.1894083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float32 PASSED [0.0057s] [ 28%] 2025-12-04T12:25:43.1894198Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_cuda_float64 PASSED [0.0042s] [ 28%] 2025-12-04T12:25:43.1894322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_complex128 PASSED [0.0101s] [ 28%] 2025-12-04T12:25:43.1894445Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float16 PASSED [0.0099s] [ 28%] 2025-12-04T12:25:43.1894564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_randn_like_cuda_float64 PASSED [0.9791s] [ 28%] 2025-12-04T12:25:43.1894681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_bfloat16 PASSED [0.0057s] [ 28%] 2025-12-04T12:25:43.1894795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_complex64 PASSED [0.0040s] [ 28%] 2025-12-04T12:25:43.1894920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_float16 PASSED [0.9765s] [ 28%] 2025-12-04T12:25:43.1895041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_float32 PASSED [0.0058s] [ 28%] 2025-12-04T12:25:43.1895155Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_int64 PASSED [0.0040s] [ 28%] 2025-12-04T12:25:43.1895266Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_ravel_cuda_uint8 PASSED [1.3676s] [ 28%] 2025-12-04T12:25:43.1895387Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_bool PASSED [0.0038s] [ 28%] 2025-12-04T12:25:43.1895501Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_complex64 PASSED [0.0048s] [ 28%] 2025-12-04T12:25:43.1895613Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_float16 PASSED [1.4259s] [ 28%] 2025-12-04T12:25:43.1895723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_int16 PASSED [0.0037s] [ 28%] 2025-12-04T12:25:43.1895836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_real_cuda_uint8 PASSED [1.4492s] [ 28%] 2025-12-04T12:25:43.1895958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_bfloat16 PASSED [0.0062s] [ 28%] 2025-12-04T12:25:43.1896076Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_bool PASSED [0.0042s] [ 28%] 2025-12-04T12:25:43.1896194Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reciprocal_cuda_float16 PASSED [1.4570s] [ 28%] 2025-12-04T12:25:43.1896316Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_float16 PASSED [0.0125s] [ 28%] 2025-12-04T12:25:43.1896431Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_int64 PASSED [0.0095s] [ 28%] 2025-12-04T12:25:43.1896549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_remainder_cuda_uint8 PASSED [0.0093s] [ 28%] 2025-12-04T12:25:43.1896661Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_bool PASSED [0.0234s] [ 28%] 2025-12-04T12:25:43.1896782Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_complex64 PASSED [0.0232s] [ 28%] 2025-12-04T12:25:43.1896897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_cuda_int8 PASSED [0.0230s] [ 28%] 2025-12-04T12:25:43.1897040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_bfloat16 PASSED [0.0139s] [ 28%] 2025-12-04T12:25:43.1897171Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_repeat_interleave_cuda_float32 PASSED [1.4720s] [ 28%] 2025-12-04T12:25:43.1897295Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_complex32 PASSED [0.0058s] [ 28%] 2025-12-04T12:25:43.1897416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_float16 PASSED [0.0038s] [ 28%] 2025-12-04T12:25:43.1897536Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_as_cuda_float64 PASSED [1.4685s] [ 28%] 2025-12-04T12:25:43.1897659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_complex64 PASSED [0.0066s] [ 28%] 2025-12-04T12:25:43.1897774Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_float16 PASSED [0.0045s] [ 28%] 2025-12-04T12:25:43.1897890Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_float64 PASSED [1.4485s] [ 28%] 2025-12-04T12:25:43.1898005Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_reshape_cuda_int16 PASSED [0.0058s] [ 28%] 2025-12-04T12:25:43.1898213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_complex128 PASSED [0.0046s] [ 28%] 2025-12-04T12:25:43.1898327Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_float32 PASSED [0.0042s] [ 28%] 2025-12-04T12:25:43.1898443Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_int16 PASSED [0.0041s] [ 28%] 2025-12-04T12:25:43.1898555Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize__cuda_uint8 PASSED [0.0042s] [ 28%] 2025-12-04T12:25:43.1898693Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_bfloat16 PASSED [0.0043s] [ 28%] 2025-12-04T12:25:43.1898819Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_bool PASSED [0.0054s] [ 28%] 2025-12-04T12:25:43.1898943Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_complex128 PASSED [0.0050s] [ 28%] 2025-12-04T12:25:43.1899065Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_float16 PASSED [0.0044s] [ 28%] 2025-12-04T12:25:43.1899196Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resize_as__cuda_int64 PASSED [0.0043s] [ 28%] 2025-12-04T12:25:43.1899317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_bool PASSED [1.4407s] [ 28%] 2025-12-04T12:25:43.1899442Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_complex128 PASSED [0.0036s] [ 28%] 2025-12-04T12:25:43.1899565Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_float16 PASSED [1.4458s] [ 28%] 2025-12-04T12:25:43.1899688Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_float64 PASSED [0.0034s] [ 28%] 2025-12-04T12:25:43.1899809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int32 PASSED [1.4472s] [ 28%] 2025-12-04T12:25:43.1899927Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_int8 PASSED [0.0035s] [ 28%] 2025-12-04T12:25:43.1900047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_conj_cuda_uint8 PASSED [1.4399s] [ 28%] 2025-12-04T12:25:43.1900169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_bfloat16 PASSED [0.0036s] [ 28%] 2025-12-04T12:25:43.1900286Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_bool PASSED [1.4508s] [ 28%] 2025-12-04T12:25:43.1900406Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_float32 PASSED [0.0035s] [ 28%] 2025-12-04T12:25:43.1900524Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_resolve_neg_cuda_int16 PASSED [1.5029s] [ 28%] 2025-12-04T12:25:43.1900638Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_bfloat16 PASSED [0.0174s] [ 28%] 2025-12-04T12:25:43.1900751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float16 PASSED [1.4831s] [ 28%] 2025-12-04T12:25:43.1900877Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float32 PASSED [0.0162s] [ 28%] 2025-12-04T12:25:43.1900990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_float64 PASSED [1.4452s] [ 28%] 2025-12-04T12:25:43.1901103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_roll_cuda_int16 PASSED [0.0161s] [ 28%] 2025-12-04T12:25:43.1901219Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float16 PASSED [1.4470s] [ 28%] 2025-12-04T12:25:43.1901332Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_float32 PASSED [0.0225s] [ 28%] 2025-12-04T12:25:43.1901446Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_int16 PASSED [0.0206s] [ 28%] 2025-12-04T12:25:43.1901557Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rot90_cuda_uint8 PASSED [0.0204s] [ 28%] 2025-12-04T12:25:43.1901673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_bfloat16 PASSED [1.4212s] [ 28%] 2025-12-04T12:25:43.1901786Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_cuda_int64 PASSED [0.0046s] [ 28%] 2025-12-04T12:25:43.1901915Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_0_cuda_float32 PASSED [0.0045s] [ 28%] 2025-12-04T12:25:43.1902044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_0_cuda_float64 PASSED [1.4332s] [ 28%] 2025-12-04T12:25:43.1902176Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_round_decimals_neg_3_cuda_float64 PASSED [0.0065s] [ 28%] 2025-12-04T12:25:43.1902294Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_complex64 PASSED [0.2414s] [ 28%] 2025-12-04T12:25:43.1902419Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_float64 PASSED [1.4766s] [ 28%] 2025-12-04T12:25:43.1902540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int16 PASSED [0.0060s] [ 28%] 2025-12-04T12:25:43.1902649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsqrt_cuda_int64 PASSED [0.0041s] [ 28%] 2025-12-04T12:25:43.1902766Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_bfloat16 PASSED [0.0111s] [ 28%] 2025-12-04T12:25:43.1902889Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_float32 PASSED [0.0105s] [ 28%] 2025-12-04T12:25:43.1903000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_rsub_cuda_int8 PASSED [1.4385s] [ 28%] 2025-12-04T12:25:43.1903127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_bfloat16 PASSED [0.0050s] [ 28%] 2025-12-04T12:25:43.1903252Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scalar_tensor_cuda_bool PASSED [0.0035s] [ 28%] 2025-12-04T12:25:43.1903376Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_bfloat16 PASSED [0.0092s] [ 28%] 2025-12-04T12:25:43.1903503Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_complex128 PASSED [0.0082s] [ 28%] 2025-12-04T12:25:43.1903626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_complex64 PASSED [1.4295s] [ 28%] 2025-12-04T12:25:43.1903748Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float32 PASSED [0.0109s] [ 28%] 2025-12-04T12:25:43.1903868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_float64 PASSED [0.0086s] [ 28%] 2025-12-04T12:25:43.1903989Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_add_cuda_int16 PASSED [0.0083s] [ 28%] 2025-12-04T12:25:43.1904103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_bfloat16 PASSED [0.0208s] [ 28%] 2025-12-04T12:25:43.1904222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float16 PASSED [0.0202s] [ 28%] 2025-12-04T12:25:43.1904339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float32 PASSED [0.0201s] [ 28%] 2025-12-04T12:25:43.1904454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_float64 PASSED [0.0201s] [ 28%] 2025-12-04T12:25:43.1904580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int32 PASSED [0.0146s] [ 28%] 2025-12-04T12:25:43.1904692Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_cuda_int8 PASSED [0.0147s] [ 28%] 2025-12-04T12:25:43.1904827Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float16 PASSED [0.0180s] [ 28%] 2025-12-04T12:25:43.1904958Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_float32 PASSED [0.0184s] [ 28%] 2025-12-04T12:25:43.1905087Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_int8 PASSED [0.0187s] [ 28%] 2025-12-04T12:25:43.1905215Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amax_cuda_uint8 PASSED [0.0180s] [ 28%] 2025-12-04T12:25:43.1905346Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int16 PASSED [0.0179s] [ 28%] 2025-12-04T12:25:43.1905474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int32 PASSED [0.0178s] [ 29%] 2025-12-04T12:25:43.1905605Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int64 PASSED [0.0178s] [ 29%] 2025-12-04T12:25:43.1905735Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_int8 PASSED [0.0177s] [ 29%] 2025-12-04T12:25:43.1905864Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_amin_cuda_uint8 PASSED [0.0178s] [ 29%] 2025-12-04T12:25:43.1905998Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_bfloat16 PASSED [0.0195s] [ 29%] 2025-12-04T12:25:43.1906130Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_mean_cuda_float16 PASSED [0.0196s] [ 29%] 2025-12-04T12:25:43.1906277Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_bfloat16 PASSED [0.0178s] [ 29%] 2025-12-04T12:25:43.1906416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_prod_cuda_int8 PASSED [0.0177s] [ 29%] 2025-12-04T12:25:43.1906546Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_bfloat16 PASSED [0.0177s] [ 29%] 2025-12-04T12:25:43.1906691Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_scatter_reduce_sum_cuda_float16 PASSED [0.0179s] [ 29%] 2025-12-04T12:25:43.1906815Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float32 PASSED [0.1936s] [ 29%] 2025-12-04T12:25:43.1906938Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_float64 PASSED [0.1924s] [ 29%] 2025-12-04T12:25:43.1907062Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int16 PASSED [0.1925s] [ 29%] 2025-12-04T12:25:43.1907182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int32 PASSED [0.1945s] [ 29%] 2025-12-04T12:25:43.1907304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_searchsorted_cuda_int8 PASSED [0.1958s] [ 29%] 2025-12-04T12:25:43.1907416Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_int32 PASSED [1.4718s] [ 29%] 2025-12-04T12:25:43.1907530Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_cuda_uint8 PASSED [0.0056s] [ 29%] 2025-12-04T12:25:43.1907656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float16 PASSED [0.0070s] [ 29%] 2025-12-04T12:25:43.1907783Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_float64 PASSED [0.0064s] [ 29%] 2025-12-04T12:25:43.1907906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_select_scatter_cuda_int64 PASSED [0.0063s] [ 29%] 2025-12-04T12:25:43.1908022Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_complex128 PASSED [0.0033s] [ 29%] 2025-12-04T12:25:43.1908160Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_float16 PASSED [1.4315s] [ 29%] 2025-12-04T12:25:43.1908270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int64 PASSED [0.0046s] [ 29%] 2025-12-04T12:25:43.1908378Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sgn_cuda_int8 PASSED [1.4385s] [ 29%] 2025-12-04T12:25:43.1908509Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_bool PASSED [0.0045s] [ 29%] 2025-12-04T12:25:43.1908623Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_float32 PASSED [1.4230s] [ 29%] 2025-12-04T12:25:43.1908734Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_int32 PASSED [0.0044s] [ 29%] 2025-12-04T12:25:43.1908844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_short_cuda_uint8 PASSED [1.4001s] [ 29%] 2025-12-04T12:25:43.1908963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_bfloat16 PASSED [0.0057s] [ 29%] 2025-12-04T12:25:43.1909085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex128 PASSED [0.2132s] [ 29%] 2025-12-04T12:25:43.1909205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_complex64 PASSED [1.6654s] [ 29%] 2025-12-04T12:25:43.1909322Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float16 PASSED [0.0057s] [ 29%] 2025-12-04T12:25:43.1909439Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sigmoid_cuda_float64 PASSED [0.0040s] [ 29%] 2025-12-04T12:25:43.1909554Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float16 PASSED [1.4344s] [ 29%] 2025-12-04T12:25:43.1909665Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float32 PASSED [0.0047s] [ 29%] 2025-12-04T12:25:43.1909776Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_float64 PASSED [1.4606s] [ 29%] 2025-12-04T12:25:43.1909886Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sign_cuda_uint8 PASSED [0.0044s] [ 29%] 2025-12-04T12:25:43.1910046Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_bartlett_cuda_float32 PASSED [0.0130s] [ 29%] 2025-12-04T12:25:43.1910201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_exponential_cuda_float32 PASSED [0.0163s] [ 29%] 2025-12-04T12:25:43.1910341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signal_windows_gaussian_cuda_float64 PASSED [0.0176s] [ 29%] 2025-12-04T12:25:43.1910457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_signbit_cuda_bfloat16 PASSED [0.0029s] [ 29%] 2025-12-04T12:25:43.1910581Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_bool PASSED [1.4413s] [ 29%] 2025-12-04T12:25:43.1910694Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_complex32 PASSED [0.2193s] [ 29%] 2025-12-04T12:25:43.1910805Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_float16 PASSED [1.4197s] [ 29%] 2025-12-04T12:25:43.1910914Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_float64 PASSED [0.0045s] [ 29%] 2025-12-04T12:25:43.1911026Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sin_cuda_int16 PASSED [1.4157s] [ 29%] 2025-12-04T12:25:43.1911138Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinc_cuda_float32 PASSED [0.1553s] [ 29%] 2025-12-04T12:25:43.1911257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_complex128 PASSED [1.4279s] [ 29%] 2025-12-04T12:25:43.1911365Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_float16 PASSED [0.0046s] [ 29%] 2025-12-04T12:25:43.1911479Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sinh_cuda_int64 PASSED [1.4214s] [ 29%] 2025-12-04T12:25:43.1911596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_bfloat16 PASSED [0.0054s] [ 29%] 2025-12-04T12:25:43.1911704Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_bool PASSED [1.4198s] [ 29%] 2025-12-04T12:25:43.1911813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_int8 PASSED [0.0051s] [ 29%] 2025-12-04T12:25:43.1911923Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_cuda_uint8 PASSED [1.4432s] [ 29%] 2025-12-04T12:25:43.1912051Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_bfloat16 PASSED [0.0187s] [ 29%] 2025-12-04T12:25:43.1912184Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int16 PASSED [0.0154s] [ 29%] 2025-12-04T12:25:43.1912306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_int8 PASSED [0.0152s] [ 29%] 2025-12-04T12:25:43.1912426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_slice_scatter_cuda_uint8 PASSED [0.0151s] [ 29%] 2025-12-04T12:25:43.1912545Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_cuda_float16 PASSED [0.0058s] [ 29%] 2025-12-04T12:25:43.1912679Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_complex64 PASSED [0.0059s] [ 29%] 2025-12-04T12:25:43.1912809Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_int64 PASSED [0.0058s] [ 29%] 2025-12-04T12:25:43.1912937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_softmax_with_dtype_cuda_uint8 PASSED [0.0058s] [ 29%] 2025-12-04T12:25:43.1913047Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_bool PASSED [0.0059s] [ 29%] 2025-12-04T12:25:43.1913158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float32 PASSED [0.0146s] [ 29%] 2025-12-04T12:25:43.1913272Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_float64 PASSED [0.0146s] [ 29%] 2025-12-04T12:25:43.1913380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int32 PASSED [0.0145s] [ 29%] 2025-12-04T12:25:43.1913488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_int64 PASSED [0.0162s] [ 29%] 2025-12-04T12:25:43.1913596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sort_cuda_uint8 PASSED [0.0147s] [ 29%] 2025-12-04T12:25:43.1913758Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_bfloat16 SKIPPED [0.0001s] (Skipped!) [ 29%] 2025-12-04T12:25:43.1913912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 29%] 2025-12-04T12:25:43.1914057Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_mm_reduce_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 29%] 2025-12-04T12:25:43.1914213Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_complex128 SKIPPED [0.0001s] (Skipped!) [ 29%] 2025-12-04T12:25:43.1914371Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sparse_sampled_addmm_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 29%] 2025-12-04T12:25:43.1914498Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_bool PASSED [0.0054s] [ 29%] 2025-12-04T12:25:43.1914625Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_float32 PASSED [0.0045s] [ 29%] 2025-12-04T12:25:43.1914751Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int16 PASSED [1.4380s] [ 29%] 2025-12-04T12:25:43.1914876Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int64 PASSED [0.0057s] [ 29%] 2025-12-04T12:25:43.1915000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_airy_ai_cuda_int8 PASSED [0.0040s] [ 29%] 2025-12-04T12:25:43.1915127Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int64 PASSED [0.0056s] [ 29%] 2025-12-04T12:25:43.1915255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j0_cuda_int8 PASSED [1.4359s] [ 29%] 2025-12-04T12:25:43.1915382Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_j1_cuda_int16 PASSED [0.0076s] [ 29%] 2025-12-04T12:25:43.1915508Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_int16 PASSED [0.0052s] [ 29%] 2025-12-04T12:25:43.1915632Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_int8 PASSED [0.0038s] [ 29%] 2025-12-04T12:25:43.1915761Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y0_cuda_uint8 PASSED [1.4556s] [ 29%] 2025-12-04T12:25:43.1915891Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_float32 PASSED [0.0075s] [ 29%] 2025-12-04T12:25:43.1916029Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_bessel_y1_cuda_int32 PASSED [0.2278s] [ 29%] 2025-12-04T12:25:43.1916179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0105s] [ 29%] 2025-12-04T12:25:43.1916326Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int64 PASSED [0.0122s] [ 29%] 2025-12-04T12:25:43.1916470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_t_cuda_int8 PASSED [0.0087s] [ 29%] 2025-12-04T12:25:43.1916612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int64 PASSED [0.0099s] [ 29%] 2025-12-04T12:25:43.1916757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_int8 PASSED [0.0083s] [ 29%] 2025-12-04T12:25:43.1916901Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_u_cuda_uint8 PASSED [0.0083s] [ 29%] 2025-12-04T12:25:43.1917044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_bool PASSED [0.0094s] [ 29%] 2025-12-04T12:25:43.1917192Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_float64 PASSED [0.3652s] [ 29%] 2025-12-04T12:25:43.1917335Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_int8 PASSED [0.0088s] [ 29%] 2025-12-04T12:25:43.1917477Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_chebyshev_polynomial_w_cuda_uint8 PASSED [0.0089s] [ 29%] 2025-12-04T12:25:43.1917612Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_float64 PASSED [0.0068s] [ 29%] 2025-12-04T12:25:43.1917744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_entr_cuda_int32 PASSED [0.0056s] [ 29%] 2025-12-04T12:25:43.1917869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_float64 PASSED [0.1935s] [ 29%] 2025-12-04T12:25:43.1917991Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int16 PASSED [1.4479s] [ 29%] 2025-12-04T12:25:43.1918170Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_erfcx_cuda_int8 PASSED [0.0057s] [ 29%] 2025-12-04T12:25:43.1918311Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_h_cuda_int8 PASSED [0.0113s] [ 29%] 2025-12-04T12:25:43.1918457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_float32 PASSED [0.2832s] [ 29%] 2025-12-04T12:25:43.1918600Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_hermite_polynomial_he_cuda_int64 PASSED [0.0108s] [ 29%] 2025-12-04T12:25:43.1918722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_float32 PASSED [1.4573s] [ 29%] 2025-12-04T12:25:43.1918847Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_float64 PASSED [0.0062s] [ 29%] 2025-12-04T12:25:43.1918966Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int32 PASSED [0.0045s] [ 29%] 2025-12-04T12:25:43.1919084Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i0e_cuda_int8 PASSED [1.4311s] [ 29%] 2025-12-04T12:25:43.1919199Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_bool PASSED [0.0073s] [ 29%] 2025-12-04T12:25:43.1919318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_float64 PASSED [0.0047s] [ 29%] 2025-12-04T12:25:43.1919434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int32 PASSED [1.4331s] [ 29%] 2025-12-04T12:25:43.1919551Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1_cuda_int8 PASSED [0.0054s] [ 29%] 2025-12-04T12:25:43.1919672Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_bfloat16 PASSED [0.1542s] [ 29%] 2025-12-04T12:25:43.1919795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_float16 PASSED [1.4192s] [ 29%] 2025-12-04T12:25:43.1919939Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_float64 PASSED [0.0065s] [ 29%] 2025-12-04T12:25:43.1920058Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_int32 PASSED [0.0047s] [ 29%] 2025-12-04T12:25:43.1920175Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_i1e_cuda_int8 PASSED [1.4429s] [ 29%] 2025-12-04T12:25:43.1920323Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_float32 PASSED [0.0134s] [ 29%] 2025-12-04T12:25:43.1920467Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_float64 PASSED [0.0100s] [ 29%] 2025-12-04T12:25:43.1920611Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_int32 PASSED [0.0102s] [ 29%] 2025-12-04T12:25:43.1920752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_laguerre_polynomial_l_cuda_int64 PASSED [0.0083s] [ 29%] 2025-12-04T12:25:43.1920897Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_legendre_polynomial_p_cuda_float32 PASSED [0.3017s] [ 29%] 2025-12-04T12:25:43.1921044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_legendre_polynomial_p_cuda_float64 PASSED [0.2924s] [ 29%] 2025-12-04T12:25:43.1921172Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_log_ndtr_cuda_float64 PASSED [0.0080s] [ 29%] 2025-12-04T12:25:43.1921317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i0_cuda_float32 PASSED [1.4415s] [ 29%] 2025-12-04T12:25:43.1921453Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_bool PASSED [0.0080s] [ 29%] 2025-12-04T12:25:43.1921608Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_i1_cuda_int16 PASSED [0.0041s] [ 29%] 2025-12-04T12:25:43.1921757Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_bool PASSED [0.0053s] [ 29%] 2025-12-04T12:25:43.1921895Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int32 PASSED [1.4413s] [ 29%] 2025-12-04T12:25:43.1922041Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k0_cuda_int8 PASSED [0.0059s] [ 29%] 2025-12-04T12:25:43.1922177Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_bool PASSED [0.0058s] [ 29%] 2025-12-04T12:25:43.1922318Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_float64 PASSED [0.1886s] [ 29%] 2025-12-04T12:25:43.1922457Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int32 PASSED [1.4392s] [ 29%] 2025-12-04T12:25:43.1922596Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_modified_bessel_k1_cuda_int64 PASSED [0.0054s] [ 29%] 2025-12-04T12:25:43.1922722Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_float64 PASSED [0.0076s] [ 29%] 2025-12-04T12:25:43.1922844Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int16 PASSED [0.0070s] [ 29%] 2025-12-04T12:25:43.1922967Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtr_cuda_int64 PASSED [0.0070s] [ 29%] 2025-12-04T12:25:43.1923088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_bool PASSED [1.4425s] [ 30%] 2025-12-04T12:25:43.1923211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int32 PASSED [0.0062s] [ 30%] 2025-12-04T12:25:43.1923331Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_ndtri_cuda_int64 PASSED [0.0063s] [ 30%] 2025-12-04T12:25:43.1923494Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16 PASSED [0.0083s] [ 30%] 2025-12-04T12:25:43.1923652Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int8 PASSED [0.0073s] [ 30%] 2025-12-04T12:25:43.1923825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_uint8 PASSED [0.0071s] [ 30%] 2025-12-04T12:25:43.1923979Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32 PASSED [1.4287s] [ 30%] 2025-12-04T12:25:43.1924126Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16 PASSED [0.0076s] [ 30%] 2025-12-04T12:25:43.1924278Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_float64 PASSED [0.1909s] [ 30%] 2025-12-04T12:25:43.1924425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int16 PASSED [0.0054s] [ 30%] 2025-12-04T12:25:43.1924575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_int8 PASSED [1.4371s] [ 30%] 2025-12-04T12:25:43.1924723Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_scaled_modified_bessel_k1_cuda_uint8 PASSED [0.0057s] [ 30%] 2025-12-04T12:25:43.1924881Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_bool PASSED [0.0102s] [ 30%] 2025-12-04T12:25:43.1925040Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0097s] [ 30%] 2025-12-04T12:25:43.1925200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64 PASSED [0.0095s] [ 30%] 2025-12-04T12:25:43.1925355Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int8 PASSED [0.0083s] [ 30%] 2025-12-04T12:25:43.1925533Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0096s] [ 30%] 2025-12-04T12:25:43.1925702Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16 PASSED [0.0100s] [ 30%] 2025-12-04T12:25:43.1925857Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32 PASSED [0.0082s] [ 30%] 2025-12-04T12:25:43.1926012Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int64 PASSED [0.0082s] [ 30%] 2025-12-04T12:25:43.1926182Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64 PASSED [0.3700s] [ 30%] 2025-12-04T12:25:43.1926334Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8 PASSED [0.0106s] [ 30%] 2025-12-04T12:25:43.1926488Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_uint8 PASSED [0.0082s] [ 30%] 2025-12-04T12:25:43.1926643Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int32 PASSED [0.0101s] [ 30%] 2025-12-04T12:25:43.1926795Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_uint8 PASSED [0.0082s] [ 30%] 2025-12-04T12:25:43.1926940Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_bool PASSED [1.6214s] [ 30%] 2025-12-04T12:25:43.1927083Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_int32 PASSED [0.0055s] [ 30%] 2025-12-04T12:25:43.1927225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_spherical_bessel_j0_cuda_int64 PASSED [0.0040s] [ 30%] 2025-12-04T12:25:43.1927354Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_bfloat16 PASSED [0.0138s] [ 30%] 2025-12-04T12:25:43.1927483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_bool PASSED [0.0133s] [ 30%] 2025-12-04T12:25:43.1927613Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_float16 PASSED [0.0133s] [ 30%] 2025-12-04T12:25:43.1927738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int32 PASSED [0.0133s] [ 30%] 2025-12-04T12:25:43.1927874Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_xlog1py_cuda_int64 PASSED [0.0134s] [ 30%] 2025-12-04T12:25:43.1927994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int16 PASSED [0.0113s] [ 30%] 2025-12-04T12:25:43.1928156Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int64 PASSED [0.0086s] [ 30%] 2025-12-04T12:25:43.1928274Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_int8 PASSED [0.0086s] [ 30%] 2025-12-04T12:25:43.1928395Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_special_zeta_cuda_uint8 PASSED [0.0085s] [ 30%] 2025-12-04T12:25:43.1928510Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_complex64 PASSED [0.0028s] [ 30%] 2025-12-04T12:25:43.1928626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float16 PASSED [1.4713s] [ 30%] 2025-12-04T12:25:43.1928738Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_cuda_float32 PASSED [0.0048s] [ 30%] 2025-12-04T12:25:43.1928865Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_bfloat16 PASSED [1.4454s] [ 30%] 2025-12-04T12:25:43.1928990Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_float32 PASSED [0.0051s] [ 30%] 2025-12-04T12:25:43.1929115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int32 PASSED [1.4358s] [ 30%] 2025-12-04T12:25:43.1929237Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int64 PASSED [0.0053s] [ 30%] 2025-12-04T12:25:43.1929360Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_list_args_cuda_int8 PASSED [1.4466s] [ 30%] 2025-12-04T12:25:43.1929511Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_bfloat16 PASSED [0.0062s] [ 30%] 2025-12-04T12:25:43.1929662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_complex32 PASSED [0.0040s] [ 30%] 2025-12-04T12:25:43.1929797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_float32 PASSED [1.4823s] [ 30%] 2025-12-04T12:25:43.1929933Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_float64 PASSED [0.0055s] [ 30%] 2025-12-04T12:25:43.1930079Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int16 PASSED [0.0039s] [ 30%] 2025-12-04T12:25:43.1930207Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_split_with_sizes_copy_cuda_int64 PASSED [1.4461s] [ 30%] 2025-12-04T12:25:43.1930321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_bool PASSED [0.0046s] [ 30%] 2025-12-04T12:25:43.1930435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_float64 PASSED [1.4536s] [ 30%] 2025-12-04T12:25:43.1930548Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int16 PASSED [0.0046s] [ 30%] 2025-12-04T12:25:43.1930659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_int32 PASSED [1.4514s] [ 30%] 2025-12-04T12:25:43.1930769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sqrt_cuda_uint8 PASSED [0.0048s] [ 30%] 2025-12-04T12:25:43.1930885Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_bfloat16 PASSED [0.0052s] [ 30%] 2025-12-04T12:25:43.1931003Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_float64 PASSED [0.0043s] [ 30%] 2025-12-04T12:25:43.1931115Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int32 PASSED [1.4460s] [ 30%] 2025-12-04T12:25:43.1931227Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_square_cuda_int8 PASSED [0.0066s] [ 30%] 2025-12-04T12:25:43.1931351Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_bfloat16 PASSED [0.0070s] [ 30%] 2025-12-04T12:25:43.1931474Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_bool PASSED [0.0062s] [ 30%] 2025-12-04T12:25:43.1931593Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_int32 PASSED [0.0061s] [ 30%] 2025-12-04T12:25:43.1931727Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_copy_cuda_int64 PASSED [0.0061s] [ 30%] 2025-12-04T12:25:43.1931845Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_complex32 PASSED [0.0045s] [ 30%] 2025-12-04T12:25:43.1931961Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float16 PASSED [1.4279s] [ 30%] 2025-12-04T12:25:43.1932077Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float32 PASSED [0.0065s] [ 30%] 2025-12-04T12:25:43.1932193Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_float64 PASSED [0.0048s] [ 30%] 2025-12-04T12:25:43.1932306Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int16 PASSED [0.0045s] [ 30%] 2025-12-04T12:25:43.1932421Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_cuda_int8 PASSED [1.4521s] [ 30%] 2025-12-04T12:25:43.1932553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_bfloat16 PASSED [0.0057s] [ 30%] 2025-12-04T12:25:43.1932680Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_float16 PASSED [0.0041s] [ 30%] 2025-12-04T12:25:43.1932808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_squeeze_multiple_cuda_int64 PASSED [1.4363s] [ 30%] 2025-12-04T12:25:43.1932920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_float32 PASSED [0.0118s] [ 30%] 2025-12-04T12:25:43.1933033Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int32 PASSED [0.0090s] [ 30%] 2025-12-04T12:25:43.1933142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_stack_cuda_int8 PASSED [0.0087s] [ 30%] 2025-12-04T12:25:43.1933271Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_complex128 PASSED [0.0120s] [ 30%] 2025-12-04T12:25:43.1933394Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_complex64 PASSED [0.0115s] [ 30%] 2025-12-04T12:25:43.1933506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_float32 PASSED [0.0107s] [ 30%] 2025-12-04T12:25:43.1933614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_cuda_float64 PASSED [0.0108s] [ 30%] 2025-12-04T12:25:43.1933752Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_complex64 PASSED [0.0159s] [ 30%] 2025-12-04T12:25:43.1933867Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_cuda_float64 PASSED [0.0147s] [ 30%] 2025-12-04T12:25:43.1933995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_bfloat16 PASSED [0.0039s] [ 30%] 2025-12-04T12:25:43.1934122Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_mean_unbiased_cuda_float32 PASSED [1.4520s] [ 30%] 2025-12-04T12:25:43.1934247Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float16 PASSED [0.0061s] [ 30%] 2025-12-04T12:25:43.1934369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_std_unbiased_cuda_float32 PASSED [0.0040s] [ 30%] 2025-12-04T12:25:43.1934480Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int16 PASSED [0.0104s] [ 30%] 2025-12-04T12:25:43.1934587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int32 PASSED [0.0101s] [ 30%] 2025-12-04T12:25:43.1934697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sub_cuda_int8 PASSED [0.0107s] [ 30%] 2025-12-04T12:25:43.1934808Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_bfloat16 PASSED [0.0147s] [ 30%] 2025-12-04T12:25:43.1934918Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_bool PASSED [0.0132s] [ 30%] 2025-12-04T12:25:43.1935032Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_complex32 PASSED [0.0171s] [ 30%] 2025-12-04T12:25:43.1935143Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_float64 PASSED [0.0126s] [ 30%] 2025-12-04T12:25:43.1935255Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int16 PASSED [0.0128s] [ 30%] 2025-12-04T12:25:43.1935374Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int32 PASSED [1.4486s] [ 30%] 2025-12-04T12:25:43.1935482Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_cuda_int8 PASSED [0.0151s] [ 30%] 2025-12-04T12:25:43.1935603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_bfloat16 PASSED [0.0111s] [ 30%] 2025-12-04T12:25:43.1935728Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_complex128 PASSED [0.0100s] [ 30%] 2025-12-04T12:25:43.1935850Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_complex64 PASSED [0.0099s] [ 30%] 2025-12-04T12:25:43.1935969Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int64 PASSED [1.4552s] [ 30%] 2025-12-04T12:25:43.1936085Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_sum_to_size_cuda_int8 PASSED [0.0129s] [ 30%] 2025-12-04T12:25:43.1936200Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_svd_cuda_complex64 PASSED [0.1576s] [ 30%] 2025-12-04T12:25:43.1936315Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_bfloat16 PASSED [0.0043s] [ 30%] 2025-12-04T12:25:43.1936435Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_complex128 PASSED [1.4319s] [ 30%] 2025-12-04T12:25:43.1936549Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_copy_cuda_float64 PASSED [0.0058s] [ 30%] 2025-12-04T12:25:43.1936662Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_bfloat16 PASSED [0.0033s] [ 30%] 2025-12-04T12:25:43.1936770Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_float16 PASSED [1.4255s] [ 30%] 2025-12-04T12:25:43.1936878Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int32 PASSED [0.0052s] [ 30%] 2025-12-04T12:25:43.1936995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int64 PASSED [1.4737s] [ 30%] 2025-12-04T12:25:43.1937113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_t_cuda_int8 PASSED [0.0049s] [ 30%] 2025-12-04T12:25:43.1937243Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_complex64 PASSED [0.0122s] [ 30%] 2025-12-04T12:25:43.1937370Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_float16 PASSED [0.0110s] [ 30%] 2025-12-04T12:25:43.1937506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int16 PASSED [0.0109s] [ 30%] 2025-12-04T12:25:43.1937628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_along_dim_cuda_int32 PASSED [0.0109s] [ 30%] 2025-12-04T12:25:43.1937742Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_int64 PASSED [0.0079s] [ 30%] 2025-12-04T12:25:43.1937853Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_take_cuda_uint8 PASSED [0.0076s] [ 30%] 2025-12-04T12:25:43.1937970Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_complex128 PASSED [1.4316s] [ 30%] 2025-12-04T12:25:43.1938158Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_float32 PASSED [0.0046s] [ 30%] 2025-12-04T12:25:43.1938270Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int32 PASSED [1.4758s] [ 30%] 2025-12-04T12:25:43.1938380Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_int64 PASSED [0.0049s] [ 30%] 2025-12-04T12:25:43.1938491Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tan_cuda_uint8 PASSED [1.4649s] [ 30%] 2025-12-04T12:25:43.1938603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_bfloat16 PASSED [0.0049s] [ 30%] 2025-12-04T12:25:43.1938714Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_bool PASSED [1.4505s] [ 30%] 2025-12-04T12:25:43.1938825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_float16 PASSED [0.0048s] [ 30%] 2025-12-04T12:25:43.1938936Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int32 PASSED [1.4526s] [ 30%] 2025-12-04T12:25:43.1939044Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tanh_cuda_int8 PASSED [0.0048s] [ 30%] 2025-12-04T12:25:43.1939179Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_bool PASSED [0.0082s] [ 30%] 2025-12-04T12:25:43.1939304Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_complex128 PASSED [1.4630s] [ 30%] 2025-12-04T12:25:43.1939429Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tensor_split_cuda_int16 PASSED [0.0098s] [ 30%] 2025-12-04T12:25:43.1939542Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_bfloat16 PASSED [0.0341s] [ 30%] 2025-12-04T12:25:43.1939658Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_complex64 PASSED [0.0305s] [ 30%] 2025-12-04T12:25:43.1939769Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float16 PASSED [0.0301s] [ 30%] 2025-12-04T12:25:43.1939882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float32 PASSED [0.0301s] [ 30%] 2025-12-04T12:25:43.1939994Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_float64 PASSED [0.0320s] [ 30%] 2025-12-04T12:25:43.1940103Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int16 PASSED [0.0309s] [ 30%] 2025-12-04T12:25:43.1940214Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int32 PASSED [0.0300s] [ 30%] 2025-12-04T12:25:43.1940321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_int8 PASSED [0.0301s] [ 30%] 2025-12-04T12:25:43.1940432Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tile_cuda_uint8 PASSED [0.0303s] [ 31%] 2025-12-04T12:25:43.1940540Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float16 PASSED [1.4321s] [ 31%] 2025-12-04T12:25:43.1940649Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float32 PASSED [0.0072s] [ 31%] 2025-12-04T12:25:43.1940768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_float64 PASSED [1.4397s] [ 31%] 2025-12-04T12:25:43.1940887Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_int64 PASSED [0.0074s] [ 31%] 2025-12-04T12:25:43.1940992Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_cuda_int8 PASSED [1.4450s] [ 31%] 2025-12-04T12:25:43.1941112Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_bfloat16 PASSED [0.0614s] [ 31%] 2025-12-04T12:25:43.1941248Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_complex128 PASSED [1.4354s] [ 31%] 2025-12-04T12:25:43.1941369Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_complex64 PASSED [0.0057s] [ 31%] 2025-12-04T12:25:43.1941485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float16 PASSED [1.4497s] [ 31%] 2025-12-04T12:25:43.1941603Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_float32 PASSED [0.0058s] [ 31%] 2025-12-04T12:25:43.1941719Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int32 PASSED [1.4755s] [ 31%] 2025-12-04T12:25:43.1941835Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_to_sparse_cuda_int64 PASSED [0.0057s] [ 31%] 2025-12-04T12:25:43.1941948Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_bfloat16 PASSED [0.0074s] [ 31%] 2025-12-04T12:25:43.1942059Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_float16 PASSED [0.0067s] [ 31%] 2025-12-04T12:25:43.1942168Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_topk_cuda_int32 PASSED [0.0066s] [ 31%] 2025-12-04T12:25:43.1942321Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_int64 PASSED [0.0085s] [ 31%] 2025-12-04T12:25:43.1942434Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_bfloat16 PASSED [0.0029s] [ 31%] 2025-12-04T12:25:43.1942550Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float16 PASSED [1.4590s] [ 31%] 2025-12-04T12:25:43.1942666Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_float64 PASSED [0.0048s] [ 31%] 2025-12-04T12:25:43.1942777Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trace_cuda_uint8 PASSED [1.4362s] [ 31%] 2025-12-04T12:25:43.1942911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_bool PASSED [0.0085s] [ 31%] 2025-12-04T12:25:43.1943036Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_copy_cuda_int16 PASSED [1.4412s] [ 31%] 2025-12-04T12:25:43.1943162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_bfloat16 PASSED [0.0061s] [ 31%] 2025-12-04T12:25:43.1943276Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_bool PASSED [0.0046s] [ 31%] 2025-12-04T12:25:43.1943399Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_complex64 PASSED [0.0045s] [ 31%] 2025-12-04T12:25:43.1943518Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_float16 PASSED [1.4498s] [ 31%] 2025-12-04T12:25:43.1943636Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_float32 PASSED [0.0066s] [ 31%] 2025-12-04T12:25:43.1943750Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int32 PASSED [0.0047s] [ 31%] 2025-12-04T12:25:43.1943869Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_int8 PASSED [0.0044s] [ 31%] 2025-12-04T12:25:43.1943984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_transpose_cuda_uint8 PASSED [1.4302s] [ 31%] 2025-12-04T12:25:43.1944106Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_complex64 PASSED [0.0340s] [ 31%] 2025-12-04T12:25:43.1944222Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float32 PASSED [0.0312s] [ 31%] 2025-12-04T12:25:43.1944339Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapezoid_cuda_float64 PASSED [0.0310s] [ 31%] 2025-12-04T12:25:43.1944463Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_float32 PASSED [0.0309s] [ 31%] 2025-12-04T12:25:43.1944588Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_float64 PASSED [1.4716s] [ 31%] 2025-12-04T12:25:43.1944699Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int16 PASSED [0.0344s] [ 31%] 2025-12-04T12:25:43.1944813Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trapz_cuda_int64 PASSED [0.0314s] [ 31%] 2025-12-04T12:25:43.1944956Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triangular_solve_cuda_complex64 PASSED [0.0134s] [ 31%] 2025-12-04T12:25:43.1945086Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triangular_solve_cuda_float32 PASSED [0.0149s] [ 31%] 2025-12-04T12:25:43.1945201Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_bfloat16 PASSED [0.0110s] [ 31%] 2025-12-04T12:25:43.1945312Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_float16 PASSED [0.0102s] [ 31%] 2025-12-04T12:25:43.1945425Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_cuda_float32 PASSED [0.0101s] [ 31%] 2025-12-04T12:25:43.1945544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_tril_indices_cuda_int64 PASSED [0.0164s] [ 31%] 2025-12-04T12:25:43.1945657Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_complex64 PASSED [0.0102s] [ 31%] 2025-12-04T12:25:43.1945768Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_float32 PASSED [0.0100s] [ 31%] 2025-12-04T12:25:43.1945879Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_cuda_int16 PASSED [1.4283s] [ 31%] 2025-12-04T12:25:43.1946000Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_triu_indices_cuda_int32 PASSED [0.0136s] [ 31%] 2025-12-04T12:25:43.1946117Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_bool PASSED [0.0099s] [ 31%] 2025-12-04T12:25:43.1946241Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_complex128 PASSED [0.0094s] [ 31%] 2025-12-04T12:25:43.1946366Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float16 PASSED [0.0093s] [ 31%] 2025-12-04T12:25:43.1946485Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_float32 PASSED [0.0091s] [ 31%] 2025-12-04T12:25:43.1946614Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_true_divide_cuda_uint8 PASSED [0.0092s] [ 31%] 2025-12-04T12:25:43.1946725Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_float32 PASSED [1.4444s] [ 31%] 2025-12-04T12:25:43.1946836Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_trunc_cuda_int32 PASSED [0.0045s] [ 31%] 2025-12-04T12:25:43.1946960Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_bfloat16 PASSED [0.0052s] [ 31%] 2025-12-04T12:25:43.1947088Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex32 PASSED [1.4354s] [ 31%] 2025-12-04T12:25:43.1947212Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_complex64 PASSED [0.0069s] [ 31%] 2025-12-04T12:25:43.1947337Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_float32 PASSED [0.0048s] [ 31%] 2025-12-04T12:25:43.1947458Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_float64 PASSED [1.4356s] [ 31%] 2025-12-04T12:25:43.1947577Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_int32 PASSED [0.0065s] [ 31%] 2025-12-04T12:25:43.1947697Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_int8 PASSED [0.0047s] [ 31%] 2025-12-04T12:25:43.1947816Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_copy_cuda_uint8 PASSED [1.4311s] [ 31%] 2025-12-04T12:25:43.1947929Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_bool PASSED [0.0058s] [ 31%] 2025-12-04T12:25:43.1948043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_float64 PASSED [0.0044s] [ 31%] 2025-12-04T12:25:43.1948211Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unbind_cuda_int8 PASSED [1.4687s] [ 31%] 2025-12-04T12:25:43.1948352Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_complex64 PASSED [0.0068s] [ 31%] 2025-12-04T12:25:43.1948470Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float16 PASSED [0.0051s] [ 31%] 2025-12-04T12:25:43.1948588Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float32 PASSED [0.0050s] [ 31%] 2025-12-04T12:25:43.1948736Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unflatten_cuda_float64 PASSED [0.0049s] [ 31%] 2025-12-04T12:25:43.1948858Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_bfloat16 PASSED [0.0132s] [ 31%] 2025-12-04T12:25:43.1948984Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_complex64 PASSED [0.0129s] [ 31%] 2025-12-04T12:25:43.1949104Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_float32 PASSED [0.0128s] [ 31%] 2025-12-04T12:25:43.1949225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_copy_cuda_uint8 PASSED [0.0128s] [ 31%] 2025-12-04T12:25:43.1949336Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_bool PASSED [0.0080s] [ 31%] 2025-12-04T12:25:43.1949456Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_complex32 PASSED [0.0081s] [ 31%] 2025-12-04T12:25:43.1949571Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_float16 PASSED [0.0080s] [ 31%] 2025-12-04T12:25:43.1949684Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int32 PASSED [0.0079s] [ 31%] 2025-12-04T12:25:43.1949797Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unfold_cuda_int64 PASSED [0.0080s] [ 31%] 2025-12-04T12:25:43.1949920Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_complex64 PASSED [0.0036s] [ 31%] 2025-12-04T12:25:43.1950037Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_uniform_cuda_float32 PASSED [1.4557s] [ 31%] 2025-12-04T12:25:43.1950169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_bfloat16 PASSED [0.1062s] [ 31%] 2025-12-04T12:25:43.1950303Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_consecutive_cuda_float32 PASSED [0.1023s] [ 31%] 2025-12-04T12:25:43.1950428Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unique_cuda_int16 PASSED [0.2051s] [ 31%] 2025-12-04T12:25:43.1950553Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int32 PASSED [0.0436s] [ 31%] 2025-12-04T12:25:43.1950673Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_int8 PASSED [0.0424s] [ 31%] 2025-12-04T12:25:43.1950796Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unravel_index_cuda_uint8 PASSED [1.4973s] [ 31%] 2025-12-04T12:25:43.1950912Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_bool PASSED [0.0053s] [ 31%] 2025-12-04T12:25:43.1951038Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float16 PASSED [1.4598s] [ 31%] 2025-12-04T12:25:43.1951162Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_chunk_cuda_float32 PASSED [0.0054s] [ 31%] 2025-12-04T12:25:43.1951280Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_bool PASSED [1.4443s] [ 31%] 2025-12-04T12:25:43.1951409Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_complex32 PASSED [0.0047s] [ 31%] 2025-12-04T12:25:43.1951534Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float32 PASSED [1.4956s] [ 31%] 2025-12-04T12:25:43.1951656Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_float64 PASSED [0.0046s] [ 31%] 2025-12-04T12:25:43.1951781Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsafe_split_cuda_int32 PASSED [1.4665s] [ 31%] 2025-12-04T12:25:43.1951906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float16 PASSED [0.0091s] [ 31%] 2025-12-04T12:25:43.1952043Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float32 PASSED [0.0068s] [ 31%] 2025-12-04T12:25:43.1952181Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_float64 PASSED [0.0065s] [ 31%] 2025-12-04T12:25:43.1952305Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_copy_cuda_int32 PASSED [0.0065s] [ 31%] 2025-12-04T12:25:43.1952420Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_bool PASSED [0.0046s] [ 31%] 2025-12-04T12:25:43.1952552Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_complex32 PASSED [0.0046s] [ 31%] 2025-12-04T12:25:43.1952674Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_complex64 PASSED [1.4678s] [ 31%] 2025-12-04T12:25:43.1952789Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int64 PASSED [0.0066s] [ 31%] 2025-12-04T12:25:43.1952906Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_unsqueeze_cuda_int8 PASSED [0.0048s] [ 31%] 2025-12-04T12:25:43.1953020Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_cuda_complex128 PASSED [0.0109s] [ 31%] 2025-12-04T12:25:43.1953142Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_complex128 PASSED [1.4544s] [ 31%] 2025-12-04T12:25:43.1953258Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_cuda_float64 PASSED [0.0166s] [ 31%] 2025-12-04T12:25:43.1953391Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_mean_unbiased_cuda_float32 PASSED [0.0040s] [ 31%] 2025-12-04T12:25:43.1953513Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_var_unbiased_cuda_bfloat16 PASSED [1.4462s] [ 31%] 2025-12-04T12:25:43.1953628Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_bfloat16 PASSED [0.0052s] [ 31%] 2025-12-04T12:25:43.1953744Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vdot_cuda_complex128 PASSED [0.0047s] [ 31%] 2025-12-04T12:25:43.1953868Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_complex_cuda_float16 PASSED [1.4437s] [ 31%] 2025-12-04T12:25:43.1953985Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_complex32 PASSED [0.0056s] [ 31%] 2025-12-04T12:25:43.1954100Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float16 PASSED [0.0038s] [ 31%] 2025-12-04T12:25:43.1954225Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float32 PASSED [1.4576s] [ 31%] 2025-12-04T12:25:43.1954341Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_float64 PASSED [0.0056s] [ 31%] 2025-12-04T12:25:43.1954454Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_cuda_uint8 PASSED [0.0040s] [ 31%] 2025-12-04T12:25:43.1954580Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_as_real_cuda_complex128 PASSED [1.4624s] [ 31%] 2025-12-04T12:25:43.1954696Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_bfloat16 PASSED [0.0082s] [ 31%] 2025-12-04T12:25:43.1954811Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_bool PASSED [1.4345s] [ 31%] 2025-12-04T12:25:43.1954937Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_complex128 PASSED [0.0077s] [ 31%] 2025-12-04T12:25:43.1955053Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_copy_cuda_float16 PASSED [1.4469s] [ 31%] 2025-12-04T12:25:43.1955169Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex128 PASSED [0.0062s] [ 31%] 2025-12-04T12:25:43.1955282Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_complex64 PASSED [0.0045s] [ 31%] 2025-12-04T12:25:43.1955397Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_float64 PASSED [1.4424s] [ 31%] 2025-12-04T12:25:43.1955506Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_view_cuda_int64 PASSED [0.0066s] [ 31%] 2025-12-04T12:25:43.1955626Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_complex128 PASSED [0.0038s] [ 31%] 2025-12-04T12:25:43.1955753Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_complex32 PASSED [1.4478s] [ 31%] 2025-12-04T12:25:43.1955882Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_float16 PASSED [0.0056s] [ 31%] 2025-12-04T12:25:43.1955995Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_float64 PASSED [0.0039s] [ 31%] 2025-12-04T12:25:43.1956108Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vsplit_cuda_int32 PASSED [1.4401s] [ 31%] 2025-12-04T12:25:43.1956233Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_bfloat16 PASSED [0.0073s] [ 31%] 2025-12-04T12:25:43.1956348Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_bool PASSED [1.4594s] [ 31%] 2025-12-04T12:25:43.1956460Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float16 PASSED [0.0068s] [ 31%] 2025-12-04T12:25:43.1956575Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_float32 PASSED [1.4472s] [ 31%] 2025-12-04T12:25:43.1956686Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int16 PASSED [0.0072s] [ 31%] 2025-12-04T12:25:43.1956800Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int32 PASSED [1.4486s] [ 31%] 2025-12-04T12:25:43.1956911Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_vstack_cuda_int8 PASSED [0.0072s] [ 32%] 2025-12-04T12:25:43.1957025Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_bfloat16 PASSED [1.4813s] [ 32%] 2025-12-04T12:25:43.1957140Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex128 PASSED [0.0098s] [ 32%] 2025-12-04T12:25:43.1957257Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex32 PASSED [0.0077s] [ 32%] 2025-12-04T12:25:43.1957373Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_complex64 PASSED [0.0075s] [ 32%] 2025-12-04T12:25:43.1957483Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int16 PASSED [1.4414s] [ 32%] 2025-12-04T12:25:43.1957595Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_where_cuda_int32 PASSED [0.0097s] [ 32%] 2025-12-04T12:25:43.1957707Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_bfloat16 PASSED [0.0140s] [ 32%] 2025-12-04T12:25:43.1957818Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_bool PASSED [0.0133s] [ 32%] 2025-12-04T12:25:43.1957942Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float16 PASSED [0.0133s] [ 32%] 2025-12-04T12:25:43.1958056Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float32 PASSED [0.0131s] [ 32%] 2025-12-04T12:25:43.1958205Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_float64 PASSED [0.0133s] [ 32%] 2025-12-04T12:25:43.1958317Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int16 PASSED [0.0133s] [ 32%] 2025-12-04T12:25:43.1958426Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_xlogy_cuda_int8 PASSED [0.0133s] [ 32%] 2025-12-04T12:25:43.1958544Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_complex128 PASSED [0.0043s] [ 32%] 2025-12-04T12:25:43.1958659Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_complex64 PASSED [0.0042s] [ 32%] 2025-12-04T12:25:43.1958772Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_int8 PASSED [0.0044s] [ 32%] 2025-12-04T12:25:43.1958884Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zero__cuda_uint8 PASSED [0.0041s] [ 32%] 2025-12-04T12:25:43.1958996Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_bool PASSED [1.4407s] [ 32%] 2025-12-04T12:25:43.1959113Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_complex128 PASSED [0.0048s] [ 32%] 2025-12-04T12:25:43.1959231Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_complex32 PASSED [1.4559s] [ 32%] 2025-12-04T12:25:43.1959343Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_float16 PASSED [0.0046s] [ 32%] 2025-12-04T12:25:43.1959468Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int16 PASSED [1.4559s] [ 32%] 2025-12-04T12:25:43.1959587Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_cuda_int32 PASSED [0.0048s] [ 32%] 2025-12-04T12:25:43.1959709Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_bfloat16 PASSED [0.0072s] [ 32%] 2025-12-04T12:25:43.1959825Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_bool PASSED [0.0064s] [ 32%] 2025-12-04T12:25:43.1959963Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex128 PASSED [0.0064s] [ 32%] 2025-12-04T12:25:43.1960089Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_complex32 PASSED [0.0063s] [ 32%] 2025-12-04T12:25:43.1960208Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_float32 PASSED [0.0062s] [ 32%] 2025-12-04T12:25:43.1960329Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_float64 PASSED [0.0062s] [ 32%] 2025-12-04T12:25:43.1960447Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int16 PASSED [0.0063s] [ 32%] 2025-12-04T12:25:43.1960564Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_int64 PASSED [0.0063s] [ 32%] 2025-12-04T12:25:43.1960681Z test_meta.py::TestMetaCUDA::test_dispatch_meta_outplace_zeros_like_cuda_uint8 PASSED [0.0062s] [ 32%] 2025-12-04T12:25:43.1960854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_H_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1961845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_T_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1962837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___getitem___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1963919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___radd___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rand___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1964940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965441Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rdiv___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmatmul___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1965961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmod___cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rmul___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___ror___cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1966972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1967147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1967318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1967499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1967676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1967844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rpow___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1968895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rsub___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace___rxor___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1969985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__batch_norm_with_update_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1970162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1970342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1970519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1970697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1970868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1971040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__chunk_cat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.1971178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_float64 PASSED [1.4664s] [ 32%] 2025-12-04T12:25:43.1971311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_abs_cuda_int8 PASSED [0.0235s] [ 32%] 2025-12-04T12:25:43.1971447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_bfloat16 PASSED [0.0287s] [ 32%] 2025-12-04T12:25:43.1971593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_bool XFAIL [0.0076s] [ 32%] 2025-12-04T12:25:43.1971742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_float16 PASSED [1.4787s] [ 32%] 2025-12-04T12:25:43.1971875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_int16 XFAIL [0.0079s] [ 32%] 2025-12-04T12:25:43.1972008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_acos_cuda_uint8 XFAIL [1.4499s] [ 32%] 2025-12-04T12:25:43.1972158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_bfloat16 PASSED [1.5929s] [ 32%] 2025-12-04T12:25:43.1972287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_bool XFAIL [0.0138s] [ 32%] 2025-12-04T12:25:43.1972419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float16 PASSED [1.5635s] [ 32%] 2025-12-04T12:25:43.1972554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_float64 PASSED [0.1131s] [ 32%] 2025-12-04T12:25:43.1972683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_add_cuda_int8 PASSED [0.0831s] [ 32%] 2025-12-04T12:25:43.1972830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_complex128 PASSED [0.2572s] [ 32%] 2025-12-04T12:25:43.1972970Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_float16 PASSED [0.1937s] [ 32%] 2025-12-04T12:25:43.1973107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcdiv_cuda_int64 XFAIL [0.0087s] [ 32%] 2025-12-04T12:25:43.1973249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_complex128 PASSED [1.7068s] [ 32%] 2025-12-04T12:25:43.1973387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float16 PASSED [0.1931s] [ 32%] 2025-12-04T12:25:43.1973525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float32 PASSED [0.1928s] [ 32%] 2025-12-04T12:25:43.1973663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_float64 PASSED [0.1951s] [ 32%] 2025-12-04T12:25:43.1973799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int16 PASSED [0.1321s] [ 32%] 2025-12-04T12:25:43.1973952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int32 PASSED [0.1319s] [ 32%] 2025-12-04T12:25:43.1974087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_addcmul_cuda_int64 PASSED [0.1320s] [ 32%] 2025-12-04T12:25:43.1974217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_asin_cuda_int16 XFAIL [0.0075s] [ 32%] 2025-12-04T12:25:43.1974356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_bfloat16 PASSED [1.4758s] [ 32%] 2025-12-04T12:25:43.1974486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_bool XFAIL [0.0079s] [ 32%] 2025-12-04T12:25:43.1974630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_complex128 PASSED [1.4804s] [ 32%] 2025-12-04T12:25:43.1974764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float16 PASSED [0.0285s] [ 32%] 2025-12-04T12:25:43.1974900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_float32 PASSED [0.0210s] [ 32%] 2025-12-04T12:25:43.1975030Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_atan_cuda_uint8 XFAIL [0.0076s] [ 32%] 2025-12-04T12:25:43.1975168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_complex64 XFAIL [1.4700s] [ 32%] 2025-12-04T12:25:43.1975299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int32 PASSED [1.4658s] [ 32%] 2025-12-04T12:25:43.1975430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_ceil_cuda_int8 PASSED [0.0213s] [ 32%] 2025-12-04T12:25:43.1975585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_complex128 XFAIL [0.0087s] [ 32%] 2025-12-04T12:25:43.1975740Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_complex64 XFAIL [1.4717s] [ 32%] 2025-12-04T12:25:43.1975884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_float16 PASSED [1.7098s] [ 32%] 2025-12-04T12:25:43.1976033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_int32 PASSED [0.1324s] [ 32%] 2025-12-04T12:25:43.1976170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_max_cuda_int8 PASSED [0.1323s] [ 32%] 2025-12-04T12:25:43.1976315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex128 XFAIL [0.0086s] [ 32%] 2025-12-04T12:25:43.1976458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_complex64 XFAIL [1.4465s] [ 32%] 2025-12-04T12:25:43.1976804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_float64 PASSED [1.6341s] [ 32%] 2025-12-04T12:25:43.1976943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_clamp_min_cuda_int64 PASSED [0.1361s] [ 32%] 2025-12-04T12:25:43.1977077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_bool PASSED [0.0217s] [ 32%] 2025-12-04T12:25:43.1977219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_complex128 PASSED [0.0215s] [ 32%] 2025-12-04T12:25:43.1977356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_complex64 PASSED [0.0215s] [ 32%] 2025-12-04T12:25:43.1977493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_float32 PASSED [0.0212s] [ 33%] 2025-12-04T12:25:43.1977626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_copy_cuda_int64 PASSED [0.0212s] [ 33%] 2025-12-04T12:25:43.1977755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_bool XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1977890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_complex64 PASSED [1.4864s] [ 33%] 2025-12-04T12:25:43.1978044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cos_cuda_float16 PASSED [0.0284s] [ 33%] 2025-12-04T12:25:43.1978221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float32 PASSED [0.0212s] [ 33%] 2025-12-04T12:25:43.1978356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_float64 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1978484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_cosh_cuda_int64 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1978620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_bfloat16 PASSED [1.5756s] [ 33%] 2025-12-04T12:25:43.1978758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_complex64 PASSED [0.1446s] [ 33%] 2025-12-04T12:25:43.1978886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int16 XFAIL [0.0089s] [ 33%] 2025-12-04T12:25:43.1979015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int64 XFAIL [1.4676s] [ 33%] 2025-12-04T12:25:43.1979141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_div_cuda_int8 XFAIL [1.4584s] [ 33%] 2025-12-04T12:25:43.1979277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_complex128 XFAIL [1.4681s] [ 33%] 2025-12-04T12:25:43.1979411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_complex64 XFAIL [1.4766s] [ 33%] 2025-12-04T12:25:43.1979539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erf_cuda_int32 XFAIL [1.4602s] [ 33%] 2025-12-04T12:25:43.1979688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_complex64 XFAIL [1.4591s] [ 33%] 2025-12-04T12:25:43.1979835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_float16 PASSED [1.5050s] [ 33%] 2025-12-04T12:25:43.1979965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int64 XFAIL [0.0079s] [ 33%] 2025-12-04T12:25:43.1980095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_erfc_cuda_int8 XFAIL [1.4872s] [ 33%] 2025-12-04T12:25:43.1980237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_bool XFAIL [1.4593s] [ 33%] 2025-12-04T12:25:43.1980375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_complex64 PASSED [1.4576s] [ 33%] 2025-12-04T12:25:43.1980508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_float32 PASSED [0.0215s] [ 33%] 2025-12-04T12:25:43.1980637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_int64 XFAIL [0.0076s] [ 33%] 2025-12-04T12:25:43.1980765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_exp_cuda_uint8 XFAIL [0.0074s] [ 33%] 2025-12-04T12:25:43.1980902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_float16 PASSED [1.4725s] [ 33%] 2025-12-04T12:25:43.1981036Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_expm1_cuda_float32 PASSED [0.0214s] [ 33%] 2025-12-04T12:25:43.1981170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_float16 PASSED [0.0283s] [ 33%] 2025-12-04T12:25:43.1981305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int64 PASSED [0.0211s] [ 33%] 2025-12-04T12:25:43.1981434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_int8 PASSED [0.0208s] [ 33%] 2025-12-04T12:25:43.1981638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_floor_cuda_uint8 PASSED [0.0209s] [ 33%] 2025-12-04T12:25:43.1981771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_float16 PASSED [0.0423s] [ 33%] 2025-12-04T12:25:43.1981904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_float64 PASSED [0.0339s] [ 33%] 2025-12-04T12:25:43.1982050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_frac_cuda_int32 XFAIL [0.0076s] [ 33%] 2025-12-04T12:25:43.1982187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_bfloat16 PASSED [1.5821s] [ 33%] 2025-12-04T12:25:43.1982315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_bool XFAIL [0.0096s] [ 33%] 2025-12-04T12:25:43.1982453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_complex128 PASSED [1.6535s] [ 33%] 2025-12-04T12:25:43.1982587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_float64 PASSED [0.1385s] [ 33%] 2025-12-04T12:25:43.1982718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lerp_cuda_int64 XFAIL [0.0094s] [ 33%] 2025-12-04T12:25:43.1982859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_complex128 XFAIL [1.4651s] [ 33%] 2025-12-04T12:25:43.1983069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_int32 SKIPPED [0.0002s] (In-place lgamma not supported for integral tensors) [ 33%] 2025-12-04T12:25:43.1983275Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_lgamma_cuda_int64 SKIPPED [0.0001s] (In-place lgamma not supported for integral tensors) [ 33%] 2025-12-04T12:25:43.1983412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_complex64 PASSED [1.4640s] [ 33%] 2025-12-04T12:25:43.1983545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log10_cuda_int16 XFAIL [0.0079s] [ 33%] 2025-12-04T12:25:43.1983683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_bool XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1983832Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_complex64 PASSED [1.4575s] [ 33%] 2025-12-04T12:25:43.1983968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_float16 PASSED [0.0287s] [ 33%] 2025-12-04T12:25:43.1984111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int32 XFAIL [0.0077s] [ 33%] 2025-12-04T12:25:43.1984241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_int8 XFAIL [1.4447s] [ 33%] 2025-12-04T12:25:43.1984373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log1p_cuda_uint8 XFAIL [1.4461s] [ 33%] 2025-12-04T12:25:43.1984507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_bfloat16 PASSED [1.4799s] [ 33%] 2025-12-04T12:25:43.1984639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_bool XFAIL [0.0079s] [ 33%] 2025-12-04T12:25:43.1984778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_complex128 PASSED [1.4782s] [ 33%] 2025-12-04T12:25:43.1984917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_complex64 PASSED [0.0215s] [ 33%] 2025-12-04T12:25:43.1985052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_float32 PASSED [0.0212s] [ 33%] 2025-12-04T12:25:43.1985182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_int64 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1985311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log2_cuda_uint8 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1985448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_complex64 PASSED [1.4536s] [ 33%] 2025-12-04T12:25:43.1985583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_float64 PASSED [0.0215s] [ 33%] 2025-12-04T12:25:43.1985710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_log_cuda_int8 XFAIL [0.0076s] [ 33%] 2025-12-04T12:25:43.1985903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1986080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_max_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1986224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_complex128 XFAIL [0.0084s] [ 33%] 2025-12-04T12:25:43.1986359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_maximum_cuda_int8 PASSED [1.5671s] [ 33%] 2025-12-04T12:25:43.1986502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_complex128 XFAIL [0.0088s] [ 33%] 2025-12-04T12:25:43.1986640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_float16 PASSED [1.7012s] [ 33%] 2025-12-04T12:25:43.1986776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int16 PASSED [0.1322s] [ 33%] 2025-12-04T12:25:43.1986909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_minimum_cuda_int8 PASSED [0.1321s] [ 33%] 2025-12-04T12:25:43.1987050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_complex128 PASSED [0.1434s] [ 33%] 2025-12-04T12:25:43.1987183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_mul_cuda_uint8 PASSED [0.0829s] [ 33%] 2025-12-04T12:25:43.1987312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_bool XFAIL [0.0074s] [ 33%] 2025-12-04T12:25:43.1987448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_complex64 PASSED [1.4567s] [ 33%] 2025-12-04T12:25:43.1987595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_float32 PASSED [0.0214s] [ 33%] 2025-12-04T12:25:43.1987737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int16 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1987865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int64 PASSED [0.0211s] [ 33%] 2025-12-04T12:25:43.1988007Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_neg_cuda_int8 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1988225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1988408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1988585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1988761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_norm_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1988896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_bfloat16 PASSED [0.1809s] [ 33%] 2025-12-04T12:25:43.1989026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_bool XFAIL [0.0084s] [ 33%] 2025-12-04T12:25:43.1989162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_complex128 PASSED [1.6266s] [ 33%] 2025-12-04T12:25:43.1989300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_complex64 PASSED [0.1684s] [ 33%] 2025-12-04T12:25:43.1989433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_float16 PASSED [0.1816s] [ 33%] 2025-12-04T12:25:43.1989568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_float32 PASSED [0.1276s] [ 33%] 2025-12-04T12:25:43.1989698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_int32 PASSED [0.0888s] [ 33%] 2025-12-04T12:25:43.1989829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_pow_cuda_uint8 PASSED [0.0886s] [ 33%] 2025-12-04T12:25:43.1989992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_bfloat16 PASSED [0.0280s] [ 33%] 2025-12-04T12:25:43.1990141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_complex128 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1990289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_complex64 PASSED [0.0211s] [ 33%] 2025-12-04T12:25:43.1990429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_float32 PASSED [0.0209s] [ 33%] 2025-12-04T12:25:43.1990569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_reciprocal_cuda_int32 XFAIL [0.0074s] [ 33%] 2025-12-04T12:25:43.1990706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_float32 PASSED [1.5157s] [ 33%] 2025-12-04T12:25:43.1990841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_round_cuda_int8 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1990973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_rsqrt_cuda_int64 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1991118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_complex128 PASSED [0.0306s] [ 33%] 2025-12-04T12:25:43.1991254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_float32 PASSED [0.0303s] [ 33%] 2025-12-04T12:25:43.1991389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sigmoid_cuda_int64 XFAIL [0.0074s] [ 33%] 2025-12-04T12:25:43.1991540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_bool PASSED [1.4720s] [ 33%] 2025-12-04T12:25:43.1991692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_complex128 XFAIL [0.0080s] [ 33%] 2025-12-04T12:25:43.1991829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_complex64 XFAIL [0.0076s] [ 33%] 2025-12-04T12:25:43.1991961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_float16 PASSED [1.4683s] [ 33%] 2025-12-04T12:25:43.1992117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_int32 PASSED [0.0214s] [ 33%] 2025-12-04T12:25:43.1992247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sign_cuda_uint8 PASSED [0.0211s] [ 33%] 2025-12-04T12:25:43.1992381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float16 PASSED [0.0281s] [ 33%] 2025-12-04T12:25:43.1992512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_float64 PASSED [0.0209s] [ 33%] 2025-12-04T12:25:43.1992643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sin_cuda_int8 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1992782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_complex128 PASSED [0.0211s] [ 33%] 2025-12-04T12:25:43.1992921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_float16 PASSED [0.0280s] [ 33%] 2025-12-04T12:25:43.1993051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sinh_cuda_int8 XFAIL [0.0074s] [ 33%] 2025-12-04T12:25:43.1993189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_bfloat16 PASSED [0.0281s] [ 33%] 2025-12-04T12:25:43.1993316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_bool XFAIL [0.0073s] [ 33%] 2025-12-04T12:25:43.1993455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_complex128 PASSED [0.0212s] [ 33%] 2025-12-04T12:25:43.1993589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_float64 PASSED [0.0209s] [ 33%] 2025-12-04T12:25:43.1993718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int16 XFAIL [0.0073s] [ 33%] 2025-12-04T12:25:43.1993858Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sqrt_cuda_int64 XFAIL [1.4465s] [ 33%] 2025-12-04T12:25:43.1993992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_bfloat16 XFAIL [1.4481s] [ 33%] 2025-12-04T12:25:43.1994123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_float16 XFAIL [1.4457s] [ 33%] 2025-12-04T12:25:43.1994251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_sub_cuda_int8 XFAIL [1.4597s] [ 33%] 2025-12-04T12:25:43.1994387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_bfloat16 PASSED [1.5010s] [ 33%] 2025-12-04T12:25:43.1994524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_complex128 PASSED [0.0217s] [ 33%] 2025-12-04T12:25:43.1994662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tan_cuda_complex64 PASSED [0.0213s] [ 33%] 2025-12-04T12:25:43.1994798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_float16 PASSED [0.0282s] [ 33%] 2025-12-04T12:25:43.1994936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_float32 PASSED [0.0210s] [ 33%] 2025-12-04T12:25:43.1995065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_tanh_cuda_int64 XFAIL [0.0075s] [ 33%] 2025-12-04T12:25:43.1995203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float16 PASSED [0.0280s] [ 33%] 2025-12-04T12:25:43.1995338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_trunc_cuda_float32 PASSED [0.0209s] [ 33%] 2025-12-04T12:25:43.1995486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__foreach_zero_cuda_bfloat16 PASSED [0.0179s] [ 33%] 2025-12-04T12:25:43.1995690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__native_batch_norm_legit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1995887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1996092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_lengths_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1996288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__segment_reduce_offsets_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1996482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1996667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1996871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1997084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.1997290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.1997498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.1997704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__unsafe_masked_index_put_accumulate_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.1997897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.1998159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace__upsample_bilinear2d_aa_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.1998280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_bool PASSED [1.5246s] [ 34%] 2025-12-04T12:25:43.1998407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_complex64 XFAIL [0.0044s] [ 34%] 2025-12-04T12:25:43.1998533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float16 PASSED [1.4994s] [ 34%] 2025-12-04T12:25:43.1998653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float32 PASSED [0.0038s] [ 34%] 2025-12-04T12:25:43.1998774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_float64 PASSED [1.4525s] [ 34%] 2025-12-04T12:25:43.1998893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int16 PASSED [0.0050s] [ 34%] 2025-12-04T12:25:43.1999011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_abs_cuda_int32 PASSED [0.0034s] [ 34%] 2025-12-04T12:25:43.1999241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.1999465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.1999697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.1999929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acos_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.2000146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_acosh_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.2000285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float16 PASSED [0.0074s] [ 34%] 2025-12-04T12:25:43.2000408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_float64 PASSED [0.0070s] [ 34%] 2025-12-04T12:25:43.2000526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int16 PASSED [0.0070s] [ 34%] 2025-12-04T12:25:43.2000646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_add_cuda_int32 PASSED [0.0070s] [ 34%] 2025-12-04T12:25:43.2000777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_complex128 PASSED [0.0057s] [ 34%] 2025-12-04T12:25:43.2000907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_complex64 PASSED [0.0056s] [ 34%] 2025-12-04T12:25:43.2001032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_float32 PASSED [1.4520s] [ 34%] 2025-12-04T12:25:43.2001159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addbmm_cuda_float64 PASSED [0.0068s] [ 34%] 2025-12-04T12:25:43.2001289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcdiv_cuda_complex128 PASSED [0.3715s] [ 34%] 2025-12-04T12:25:43.2001419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_float64 PASSED [1.4582s] [ 34%] 2025-12-04T12:25:43.2001542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int32 PASSED [0.0108s] [ 34%] 2025-12-04T12:25:43.2001670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int64 PASSED [0.0089s] [ 34%] 2025-12-04T12:25:43.2001794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addcmul_cuda_int8 PASSED [0.0086s] [ 34%] 2025-12-04T12:25:43.2001934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_cuda_float64 PASSED [0.0061s] [ 34%] 2025-12-04T12:25:43.2002076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_complex128 PASSED [0.0070s] [ 34%] 2025-12-04T12:25:43.2002220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmm_decomposed_cuda_float32 PASSED [0.0058s] [ 34%] 2025-12-04T12:25:43.2002345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addmv_cuda_complex64 PASSED [0.0050s] [ 34%] 2025-12-04T12:25:43.2002471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_complex64 PASSED [0.0051s] [ 34%] 2025-12-04T12:25:43.2002594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_float64 PASSED [0.0051s] [ 34%] 2025-12-04T12:25:43.2002714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_int8 PASSED [0.0048s] [ 34%] 2025-12-04T12:25:43.2002838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_addr_cuda_uint8 PASSED [0.0048s] [ 34%] 2025-12-04T12:25:43.2003019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2003202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2003381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_alias_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2003557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2003732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2003910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_T_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2004106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rdiv___cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2004292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rpow___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2004470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides___rxor___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2004679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__batch_norm_with_update_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2004869Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__chunk_cat_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2005020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_abs_cuda_float32 PASSED [0.0213s] [ 34%] 2025-12-04T12:25:43.2005178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_addcdiv_cuda_float32 PASSED [0.1953s] [ 34%] 2025-12-04T12:25:43.2005328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_copy_cuda_float32 PASSED [0.0213s] [ 34%] 2025-12-04T12:25:43.2005482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_erfc_cuda_float32 PASSED [0.0210s] [ 34%] 2025-12-04T12:25:43.2005630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_exp_cuda_float32 PASSED [0.0210s] [ 34%] 2025-12-04T12:25:43.2005784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_log2_cuda_float32 PASSED [0.0211s] [ 34%] 2025-12-04T12:25:43.2005937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_minimum_cuda_float32 PASSED [0.1948s] [ 34%] 2025-12-04T12:25:43.2006095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_neg_cuda_float32 PASSED [0.0211s] [ 34%] 2025-12-04T12:25:43.2006291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2006451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_reciprocal_cuda_float32 PASSED [0.0211s] [ 34%] 2025-12-04T12:25:43.2006600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_sqrt_cuda_float32 PASSED [0.0210s] [ 34%] 2025-12-04T12:25:43.2006750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__foreach_tan_cuda_float32 PASSED [0.0212s] [ 34%] 2025-12-04T12:25:43.2006960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__softmax_backward_data_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2007185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides__unsafe_masked_index_put_accumulate_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2007329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_acosh_cuda_float32 PASSED [1.4633s] [ 34%] 2025-12-04T12:25:43.2007466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_addmm_cuda_float32 PASSED [0.5721s] [ 34%] 2025-12-04T12:25:43.2007658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_all_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2007861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_allclose_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2008044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_amax_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2008273Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_aminmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2008458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_angle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2008655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_as_strided_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2008795Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_atan2_cuda_float32 PASSED [0.0186s] [ 34%] 2025-12-04T12:25:43.2008956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_right_shift_cuda_int64 PASSED [0.0185s] [ 34%] 2025-12-04T12:25:43.2009100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bitwise_xor_cuda_int64 PASSED [0.0183s] [ 34%] 2025-12-04T12:25:43.2009293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_block_diag_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2009473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bool_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2009672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_shapes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2009868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_broadcast_to_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2010060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_bucketize_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2010254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_byte_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2010398Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cauchy_cuda_float32 PASSED [0.0095s] [ 34%] 2025-12-04T12:25:43.2010581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_char_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2010775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cholesky_inverse_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2010917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clamp_cuda_float32 PASSED [0.1335s] [ 34%] 2025-12-04T12:25:43.2011099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_clone_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2011295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_constant_pad_nd_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2011434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cosh_cuda_float32 PASSED [1.4330s] [ 34%] 2025-12-04T12:25:43.2011618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cummin_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2011809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumprod_cuda_float32 SKIPPED [0.0012s] (Function is in dispatch early skips) [ 34%] 2025-12-04T12:25:43.2012017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_cumsum_cuda_float32 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 34%] 2025-12-04T12:25:43.2012214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diag_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2012410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagflat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2012599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_diagonal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2012757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_div_no_rounding_mode_cuda_float32 PASSED [0.0173s] [ 34%] 2025-12-04T12:25:43.2012944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_double_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2013130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_dsplit_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2013314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_equal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2013454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_erfc_cuda_float32 PASSED [0.0052s] [ 34%] 2025-12-04T12:25:43.2013594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_exp_cuda_float32 PASSED [0.0050s] [ 34%] 2025-12-04T12:25:43.2013732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_expm1_cuda_float32 PASSED [0.0039s] [ 34%] 2025-12-04T12:25:43.2013915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_eye_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2014099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2014299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2014493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2014679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2014863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfft_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2015051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_hfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2015245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ifftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2015432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ihfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2015622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_ihfftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2015806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_irfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_irfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfft_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fft_rfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_flatten_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2016919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_floor_cuda_float32 PASSED [1.4698s] [ 34%] 2025-12-04T12:25:43.2017068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_floor_divide_cuda_float32 PASSED [0.0590s] [ 34%] 2025-12-04T12:25:43.2017251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fmin_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2017390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_fmod_cuda_float32 PASSED [0.0192s] [ 34%] 2025-12-04T12:25:43.2017575Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_frexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2017756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_full_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2017941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_gather_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2018151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_geometric_cuda_float32 PASSED [0.0092s] [ 34%] 2025-12-04T12:25:43.2018336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_geqrf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2018517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 34%] 2025-12-04T12:25:43.2018700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_histc_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2018884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_hstack_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2019021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_i0_cuda_float32 PASSED [1.4531s] [ 34%] 2025-12-04T12:25:43.2019168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_igamma_cuda_float32 PASSED [0.0214s] [ 34%] 2025-12-04T12:25:43.2019317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_fill_cuda_float32 PASSED [0.0163s] [ 34%] 2025-12-04T12:25:43.2019475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_amax_cuda_float32 PASSED [0.0515s] [ 34%] 2025-12-04T12:25:43.2019630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_amin_cuda_float32 PASSED [0.0513s] [ 34%] 2025-12-04T12:25:43.2019786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_index_reduce_mean_cuda_float32 PASSED [0.0553s] [ 34%] 2025-12-04T12:25:43.2019971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_inner_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2020166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_item_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2020392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.2020606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_jiterator_binary_return_by_ref_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2020876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_kron_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2021063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_kthvalue_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2021206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ldexp_cuda_float32 PASSED [0.0245s] [ 35%] 2025-12-04T12:25:43.2021344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_le_cuda_float32 PASSED [1.4732s] [ 35%] 2025-12-04T12:25:43.2021485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lerp_cuda_float32 PASSED [0.0626s] [ 35%] 2025-12-04T12:25:43.2021626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lgamma_cuda_float32 PASSED [0.0054s] [ 35%] 2025-12-04T12:25:43.2021820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eigh_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_eigvals_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_inv_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_inv_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_ldl_factor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2022991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_lu_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2023192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2023396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_power_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2023612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2023850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 35%] 2025-12-04T12:25:43.2024060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_solve_triangular_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2024250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_svd_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2024468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vander_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2024679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linalg_vector_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2024887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_linspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2025041Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log1p_cuda_float32 PASSED [1.4732s] [ 35%] 2025-12-04T12:25:43.2025186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_normal_cuda_float32 PASSED [0.0125s] [ 35%] 2025-12-04T12:25:43.2025378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_log_softmax_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2025567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logaddexp_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2025755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logdet_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2025894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_logit_cuda_float32 PASSED [0.0106s] [ 35%] 2025-12-04T12:25:43.2026078Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_long_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2026266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_lu_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2026443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mH_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2026630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mT_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2026823Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_argmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_cumsum_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_logaddexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_normalize_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_prod_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2027973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_masked_scatter_cuda_float32 PASSED [0.0338s] [ 35%] 2025-12-04T12:25:43.2028240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2028423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mean_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2028610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_min_binary_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2028824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_min_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2028977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mul_cuda_float32 PASSED [1.4983s] [ 35%] 2025-12-04T12:25:43.2029143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0275s] [ 35%] 2025-12-04T12:25:43.2029316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0237s] [ 35%] 2025-12-04T12:25:43.2029479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [1.5024s] [ 35%] 2025-12-04T12:25:43.2029664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanmean_cuda_float32 SKIPPED [0.0017s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2029859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nanquantile_cuda_float32 SKIPPED [0.0014s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2030052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_narrow_copy_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2030251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_native_layer_norm_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2030391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ne_cuda_float32 PASSED [0.0226s] [ 35%] 2025-12-04T12:25:43.2030576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_new_ones_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2030787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_avg_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2030991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_bilinear_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2031245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_binary_cross_entropy_with_logits_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2031448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2031651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv3d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2031868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2032082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_conv_transpose2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2032298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2032501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2032669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout2d_cuda_float32 PASSED [0.0337s] [ 35%] 2025-12-04T12:25:43.2032827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_dropout_cuda_float32 PASSED [0.0248s] [ 35%] 2025-12-04T12:25:43.2032995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_elu_cuda_float32 PASSED [1.4665s] [ 35%] 2025-12-04T12:25:43.2033220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_embedding_bag_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2033416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0296s] [ 35%] 2025-12-04T12:25:43.2033655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_fractional_max_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2033864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_grid_sample_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2034074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2034279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_hardswish_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2034489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_huber_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2034707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_linear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2034933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2037530Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_kl_div_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2037710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_leaky_relu_cuda_float32 PASSED [0.0229s] [ 35%] 2025-12-04T12:25:43.2037959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_local_response_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2038202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_logsigmoid_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2038425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2038633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2038963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2039181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool1d_grad_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2039395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2039610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2039798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_mish_cuda_float32 PASSED [1.4565s] [ 35%] 2025-12-04T12:25:43.2040023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_mse_loss_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2040257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2040504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_multilabel_soft_margin_loss_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2040729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pad_replicate_negative_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2040957Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_pairwise_distance_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2041159Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_prelu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2041323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_rrelu_cuda_float32 PASSED [0.0109s] [ 35%] 2025-12-04T12:25:43.2041496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_silu_complex_cuda_complex64 PASSED [0.0050s] [ 35%] 2025-12-04T12:25:43.2041713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_smooth_l1_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2041932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_soft_margin_loss_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2042151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2042389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_softsign_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2042553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_nn_functional_threshold_cuda_float32 PASSED [1.4676s] [ 35%] 2025-12-04T12:25:43.2042743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_norm_fro_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2042924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ones_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2043118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_ones_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2043304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_outer_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2043501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pca_lowrank_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2043693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_permute_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2043881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pinverse_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2044061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0146s] [ 35%] 2025-12-04T12:25:43.2044260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_positive_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2044407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_pow_cuda_float32 PASSED [0.0188s] [ 35%] 2025-12-04T12:25:43.2044592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_prod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2044792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_qr_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2044978Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randint_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2045172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randint_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2045357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_randn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2045510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_remainder_cuda_float32 PASSED [0.0193s] [ 35%] 2025-12-04T12:25:43.2045706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_reshape_as_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2045900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resolve_conj_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2046094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_resolve_neg_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2046281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_rsub_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2046437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_add_cuda_float32 PASSED [0.0933s] [ 35%] 2025-12-04T12:25:43.2046612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_amax_cuda_float32 PASSED [0.2786s] [ 35%] 2025-12-04T12:25:43.2046777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_scatter_reduce_sum_cuda_float32 PASSED [0.2778s] [ 35%] 2025-12-04T12:25:43.2046972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_searchsorted_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2047172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_select_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2047312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sgn_cuda_float32 PASSED [1.4716s] [ 35%] 2025-12-04T12:25:43.2047532Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_general_hamming_cuda_float32 SKIPPED [0.0017s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2047737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signal_windows_hann_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2047924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_signbit_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2048113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sin_cuda_float32 PASSED [1.4623s] [ 35%] 2025-12-04T12:25:43.2048252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sinh_cuda_float32 PASSED [0.0065s] [ 35%] 2025-12-04T12:25:43.2048471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_bessel_y0_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2048702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2048936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_chebyshev_polynomial_w_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2049130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_entr_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2049355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_hermite_polynomial_he_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2049554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_i0e_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2049752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_log_ndtr_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.2049966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_modified_bessel_i1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2050187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2050417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2050644Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_shifted_chebyshev_polynomial_w_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2050861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_spherical_bessel_j0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2051073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_special_xlog1py_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2051284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_with_sizes_copy_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2051483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_split_with_sizes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2051630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_squeeze_cuda_float32 PASSED [0.0149s] [ 36%] 2025-12-04T12:25:43.2051816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_stack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2051999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2052190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2052379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_std_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2052566Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_stft_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2052768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_sum_to_size_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2052972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_svd_lowrank_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2053165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_take_along_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2053368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_to_sparse_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2053520Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_transpose_cuda_float32 PASSED [0.0122s] [ 36%] 2025-12-04T12:25:43.2053706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trapz_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2053910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triangular_solve_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2054102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_tril_indices_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2054249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_triu_cuda_float32 PASSED [0.0257s] [ 36%] 2025-12-04T12:25:43.2054390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_trunc_cuda_float32 PASSED [0.0040s] [ 36%] 2025-12-04T12:25:43.2054597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unique_consecutive_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2054794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2054987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_var_unbiased_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2055188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_view_as_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2055376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vsplit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2055568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_vstack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2055750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_where_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2055894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_all_strides_zero__cuda_float32 PASSED [0.0050s] [ 36%] 2025-12-04T12:25:43.2056072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2056264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2056443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_allclose_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2056620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2056791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2056983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2057172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2057339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2057528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2057696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2057866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_amin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_aminmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058604Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2058948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_angle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2059122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2059296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2059483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2059650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_any_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2059829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_arange_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argmin_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2060875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argsort_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2061972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2062150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_argwhere_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2062332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2062526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2062714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2062905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2063087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2063272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2063457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2063588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_bool PASSED [1.4808s] [ 36%] 2025-12-04T12:25:43.2063737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float32 PASSED [0.0070s] [ 36%] 2025-12-04T12:25:43.2063875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_float64 PASSED [0.0050s] [ 36%] 2025-12-04T12:25:43.2064013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int64 PASSED [0.0048s] [ 36%] 2025-12-04T12:25:43.2064142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_cuda_int8 PASSED [0.0047s] [ 36%] 2025-12-04T12:25:43.2064295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_bool XFAIL [0.0042s] [ 36%] 2025-12-04T12:25:43.2064448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float16 XFAIL [1.4907s] [ 36%] 2025-12-04T12:25:43.2064605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float32 XFAIL [1.4813s] [ 36%] 2025-12-04T12:25:43.2064754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_float64 XFAIL [1.4713s] [ 36%] 2025-12-04T12:25:43.2064907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_partial_views_cuda_int64 XFAIL [1.5008s] [ 36%] 2025-12-04T12:25:43.2065096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_bool SKIPPED [1.4827s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2065293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_complex64 SKIPPED [0.0017s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2065489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int16 SKIPPED [0.0012s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2065683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_as_strided_scatter_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2065820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_bfloat16 PASSED [1.4977s] [ 36%] 2025-12-04T12:25:43.2066060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_complex64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2066193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float16 PASSED [1.4517s] [ 36%] 2025-12-04T12:25:43.2066319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float32 PASSED [0.0048s] [ 36%] 2025-12-04T12:25:43.2066450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_float64 PASSED [0.0034s] [ 36%] 2025-12-04T12:25:43.2066671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2066894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asin_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2067023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_bfloat16 PASSED [1.4568s] [ 36%] 2025-12-04T12:25:43.2067249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_bool SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2067379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_float32 PASSED [1.4313s] [ 36%] 2025-12-04T12:25:43.2067600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_asinh_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2067731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_bfloat16 PASSED [0.0093s] [ 36%] 2025-12-04T12:25:43.2067958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2068123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float16 PASSED [0.0074s] [ 36%] 2025-12-04T12:25:43.2068248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_float64 PASSED [0.0065s] [ 36%] 2025-12-04T12:25:43.2068473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2068695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan2_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2068921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2069150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2069283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_float16 PASSED [1.4968s] [ 36%] 2025-12-04T12:25:43.2069413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_float32 PASSED [0.0048s] [ 36%] 2025-12-04T12:25:43.2069648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2069885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2070104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atan_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.2070252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atanh_cuda_float16 PASSED [0.0034s] [ 36%] 2025-12-04T12:25:43.2070434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2070626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_1d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2070806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2070993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2071170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2071351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2071530Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_2d_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2071707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2071890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2072068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2072265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2072438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_atleast_3d_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2072583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_complex128 PASSED [0.0078s] [ 36%] 2025-12-04T12:25:43.2072712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_baddbmm_cuda_float32 PASSED [0.0046s] [ 36%] 2025-12-04T12:25:43.2072896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bernoulli_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bfloat16_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2073963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2074160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bincount_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.2074295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_int64 PASSED [0.0067s] [ 36%] 2025-12-04T12:25:43.2074446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_and_cuda_uint8 PASSED [0.0064s] [ 36%] 2025-12-04T12:25:43.2074589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int16 PASSED [0.0065s] [ 36%] 2025-12-04T12:25:43.2074737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_int32 PASSED [0.0064s] [ 37%] 2025-12-04T12:25:43.2074882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_left_shift_cuda_uint8 PASSED [0.0064s] [ 37%] 2025-12-04T12:25:43.2075020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_int8 PASSED [0.0064s] [ 37%] 2025-12-04T12:25:43.2075151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_or_cuda_uint8 PASSED [0.0064s] [ 37%] 2025-12-04T12:25:43.2075289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bitwise_xor_cuda_int8 PASSED [0.0064s] [ 37%] 2025-12-04T12:25:43.2075469Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2075653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2075833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_block_diag_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bmm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2076887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bool_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_tensors_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2077960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2078190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2078383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2078565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2078754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2078937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_broadcast_to_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_bucketize_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2079996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2080175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2080341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2080514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_byte_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2080714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2080902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cartesian_prod_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2081956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cat_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2082090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float32 PASSED [0.0052s] [ 37%] 2025-12-04T12:25:43.2082222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cauchy_cuda_float64 PASSED [0.0050s] [ 37%] 2025-12-04T12:25:43.2082421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2082607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2082785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2082966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cdouble_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2083099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int32 PASSED [1.5192s] [ 37%] 2025-12-04T12:25:43.2083223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int64 PASSED [0.0049s] [ 37%] 2025-12-04T12:25:43.2083350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_int8 PASSED [0.0035s] [ 37%] 2025-12-04T12:25:43.2083474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ceil_cuda_uint8 PASSED [1.5142s] [ 37%] 2025-12-04T12:25:43.2083654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2083836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_complex32 SKIPPED [0.0012s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2084890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cfloat_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085403Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chalf_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2085921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2086094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2086267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2086446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2086627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_char_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2086825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2087006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cholesky_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2087191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2087365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_chunk_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2087498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_bfloat16 PASSED [0.0133s] [ 37%] 2025-12-04T12:25:43.2087632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_float64 PASSED [0.0105s] [ 37%] 2025-12-04T12:25:43.2087758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_int16 PASSED [0.0104s] [ 37%] 2025-12-04T12:25:43.2087886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_int32 PASSED [0.0100s] [ 37%] 2025-12-04T12:25:43.2088008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_cuda_uint8 PASSED [1.4791s] [ 37%] 2025-12-04T12:25:43.2088167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_bool PASSED [0.0099s] [ 37%] 2025-12-04T12:25:43.2088302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_max_cuda_float16 PASSED [0.0086s] [ 37%] 2025-12-04T12:25:43.2088434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_float64 PASSED [1.4805s] [ 37%] 2025-12-04T12:25:43.2088568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clamp_min_cuda_int16 PASSED [0.0097s] [ 37%] 2025-12-04T12:25:43.2088741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2088911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089273Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_clone_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2089963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2090142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2090320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2090498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_column_stack_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2090696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2090900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_combinations_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2091984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2092151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2092320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2092461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float32 PASSED [1.4843s] [ 37%] 2025-12-04T12:25:43.2092597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_conj_physical_cuda_float64 PASSED [0.0047s] [ 37%] 2025-12-04T12:25:43.2092789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2092974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2093167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2093344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2093524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_constant_pad_nd_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2093697Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_contiguous_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2093834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_bfloat16 PASSED [0.0102s] [ 37%] 2025-12-04T12:25:43.2093966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_copysign_cuda_float32 PASSED [0.0085s] [ 37%] 2025-12-04T12:25:43.2094143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2094320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2094492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2094666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_corrcoef_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2094895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T12:25:43.2095136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T12:25:43.2095262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float16 PASSED [0.0039s] [ 37%] 2025-12-04T12:25:43.2095389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_float64 PASSED [1.4867s] [ 37%] 2025-12-04T12:25:43.2095619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cos_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T12:25:43.2095840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T12:25:43.2095967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_float32 PASSED [0.0048s] [ 37%] 2025-12-04T12:25:43.2096098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_float64 PASSED [1.4806s] [ 37%] 2025-12-04T12:25:43.2096319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cosh_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 37%] 2025-12-04T12:25:43.2096505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2096686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2096863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_count_nonzero_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097036Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cov_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097379Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097546Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cross_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2097882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2098051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.2098262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2098429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummax_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2098595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2098758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cummin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2098937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_complex64 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2099131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int32 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2099316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumprod_cuda_int64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2099495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_complex128 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2099679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_float32 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2099853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_float64 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2100024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumsum_cuda_int32 SKIPPED [0.0010s] (Function is in dispatch early skips) [ 38%] 2025-12-04T12:25:43.2100216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2100403Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2100593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_cumulative_trapezoid_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2100815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2100943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_float64 PASSED [1.4618s] [ 38%] 2025-12-04T12:25:43.2101164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_deg2rad_cuda_uint8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2101341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2101526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2101690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int32 SKIPPED [0.0012s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2101855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diag_embed_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102716Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2102892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagflat_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2103959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2104136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2104305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2104481Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2104652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2104830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105018Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2105945Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diagonal_scatter_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2106107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2106272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2106436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2106599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_diff_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2106730Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_bfloat16 PASSED [1.4860s] [ 38%] 2025-12-04T12:25:43.2106858Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_float64 PASSED [1.5171s] [ 38%] 2025-12-04T12:25:43.2107084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int16 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2107316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_digamma_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2107471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int16 PASSED [0.0143s] [ 38%] 2025-12-04T12:25:43.2107612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_int32 PASSED [0.0106s] [ 38%] 2025-12-04T12:25:43.2107749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_floor_rounding_cuda_uint8 PASSED [0.0071s] [ 38%] 2025-12-04T12:25:43.2107996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2108272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2108516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_complex32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2108658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_float64 PASSED [0.0064s] [ 38%] 2025-12-04T12:25:43.2108898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_no_rounding_mode_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2109040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float16 PASSED [0.0080s] [ 38%] 2025-12-04T12:25:43.2109182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_float64 PASSED [1.4916s] [ 38%] 2025-12-04T12:25:43.2109317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_int64 PASSED [0.0103s] [ 38%] 2025-12-04T12:25:43.2109457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_div_trunc_rounding_cuda_uint8 PASSED [1.4616s] [ 38%] 2025-12-04T12:25:43.2109625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_bfloat16 SKIPPED [0.0017s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2109811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dot_cuda_float64 SKIPPED [0.0014s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2109986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2110154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2110321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2110487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_double_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2110660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2110827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dsplit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_dstack_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2111872Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_einsum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_like_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2112937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_permuted_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2113110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2113288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2113464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_empty_strided_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2113588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex32 PASSED [0.0098s] [ 38%] 2025-12-04T12:25:43.2113715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_complex64 PASSED [0.0077s] [ 38%] 2025-12-04T12:25:43.2113847Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float16 PASSED [0.0083s] [ 38%] 2025-12-04T12:25:43.2113965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eq_cuda_float64 PASSED [0.0076s] [ 38%] 2025-12-04T12:25:43.2114130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2114296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2114460Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_equal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2114585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_bfloat16 PASSED [0.0031s] [ 38%] 2025-12-04T12:25:43.2114705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_float64 PASSED [1.5027s] [ 38%] 2025-12-04T12:25:43.2114926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2115144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erf_cuda_uint8 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2115359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2115484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float32 PASSED [1.4526s] [ 38%] 2025-12-04T12:25:43.2115616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_float64 PASSED [0.0061s] [ 38%] 2025-12-04T12:25:43.2115844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfc_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2115972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_bfloat16 PASSED [1.4888s] [ 38%] 2025-12-04T12:25:43.2116107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_float16 PASSED [0.0069s] [ 38%] 2025-12-04T12:25:43.2116326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_erfinv_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2116540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2116763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2116979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2117199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2117416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2117631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp2_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2117753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_bfloat16 PASSED [1.4689s] [ 38%] 2025-12-04T12:25:43.2117976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2118250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2118471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_complex64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2118592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_float64 PASSED [1.4696s] [ 38%] 2025-12-04T12:25:43.2118809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int16 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2119024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exp_cuda_int64 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2119204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2119378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_as_cuda_int32 SKIPPED [0.0013s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2119556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2119748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2119930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_copy_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2120105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2120292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2120460Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2120625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2120795Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expand_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2121020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2121146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_float64 PASSED [1.4541s] [ 38%] 2025-12-04T12:25:43.2121365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_expm1_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.2121500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_exponential_cuda_float64 PASSED [1.4839s] [ 38%] 2025-12-04T12:25:43.2121663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_bool SKIPPED [0.0017s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2121829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2121996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2122179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float8_e5m2 SKIPPED [0.0011s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2122357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_float8_e5m2fnuz SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.2122517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_eye_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2122694Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2122865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft2_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2123908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2124954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_fftshift_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2125995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2126168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2126353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2126522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2126686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2126862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2127906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2128166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2128333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_hfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2128500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2128690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2128863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2129892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2130973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2131148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ifftshift_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2131324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2131492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2131663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2131830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2132896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133062Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_ihfftn_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2133938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft2_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfft_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2134983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2135161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2135331Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2135503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_irfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2135671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2135840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfft_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fft_rfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2136839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_bool PASSED [1.4872s] [ 39%] 2025-12-04T12:25:43.2136966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_complex32 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.2137088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_float64 PASSED [0.0040s] [ 39%] 2025-12-04T12:25:43.2137209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int16 PASSED [1.4612s] [ 39%] 2025-12-04T12:25:43.2137327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_int8 PASSED [0.0056s] [ 39%] 2025-12-04T12:25:43.2137449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fill_cuda_uint8 PASSED [0.0039s] [ 39%] 2025-12-04T12:25:43.2137621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2137790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2137964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2138230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2138397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flatten_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2138563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2138741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2138904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flip_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fliplr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2139905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_flipud_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140773Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2140950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2141114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2141279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2141444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2141632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_bfloat16 SKIPPED [0.0009s] (Function is in dispatch early skips) [ 39%] 2025-12-04T12:25:43.2141871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_float_power_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 39%] 2025-12-04T12:25:43.2142000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_bfloat16 PASSED [1.4616s] [ 39%] 2025-12-04T12:25:43.2142124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_float32 PASSED [0.0050s] [ 39%] 2025-12-04T12:25:43.2142246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_cuda_float64 PASSED [0.0036s] [ 39%] 2025-12-04T12:25:43.2142382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_bfloat16 PASSED [0.0160s] [ 39%] 2025-12-04T12:25:43.2142515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float32 PASSED [0.0141s] [ 39%] 2025-12-04T12:25:43.2142657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_float64 PASSED [0.0139s] [ 39%] 2025-12-04T12:25:43.2142788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_floor_divide_cuda_int64 PASSED [1.4891s] [ 39%] 2025-12-04T12:25:43.2142954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_float64 SKIPPED [0.0014s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_int16 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmax_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2143958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2144121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2144296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_fmin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2144429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frac_cuda_float16 PASSED [1.4825s] [ 39%] 2025-12-04T12:25:43.2144598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_frexp_cuda_float32 SKIPPED [0.0017s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2144769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex128 SKIPPED [0.0014s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2144950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2145112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_cuda_int32 SKIPPED [0.0012s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2145287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_complex64 SKIPPED [0.0021s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2145460Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2145627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 39%] 2025-12-04T12:25:43.2145801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_uint16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2145968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_full_like_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146481Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gather_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2146946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gcd_cuda_int64 PASSED [0.0081s] [ 40%] 2025-12-04T12:25:43.2147065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_float16 PASSED [0.0075s] [ 40%] 2025-12-04T12:25:43.2147185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_float32 PASSED [0.0068s] [ 40%] 2025-12-04T12:25:43.2147302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int32 PASSED [0.0068s] [ 40%] 2025-12-04T12:25:43.2147423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_int64 PASSED [0.0067s] [ 40%] 2025-12-04T12:25:43.2147538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ge_cuda_uint8 PASSED [0.0068s] [ 40%] 2025-12-04T12:25:43.2147667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geometric_cuda_int64 PASSED [1.4680s] [ 40%] 2025-12-04T12:25:43.2147836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_geqrf_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_bfloat16 SKIPPED [0.0014s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gradient_cuda_int8 SKIPPED [0.0012s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2148946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 40%] 2025-12-04T12:25:43.2149099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_grid_sampler_3d_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 40%] 2025-12-04T12:25:43.2149222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_bfloat16 PASSED [0.0087s] [ 40%] 2025-12-04T12:25:43.2149340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_float16 PASSED [1.5049s] [ 40%] 2025-12-04T12:25:43.2149459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int16 PASSED [0.0101s] [ 40%] 2025-12-04T12:25:43.2149577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_gt_cuda_int8 PASSED [0.0073s] [ 40%] 2025-12-04T12:25:43.2149742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2149909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2150078Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2150244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2150411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_half_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2150582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hash_tensor_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2150727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_heaviside_cuda_float32 PASSED [0.0089s] [ 40%] 2025-12-04T12:25:43.2150897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_histc_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151562Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2151894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hsplit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hstack_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2152885Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_bfloat16 PASSED [0.0074s] [ 40%] 2025-12-04T12:25:43.2153019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float16 PASSED [0.0073s] [ 40%] 2025-12-04T12:25:43.2153144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float32 PASSED [1.4810s] [ 40%] 2025-12-04T12:25:43.2153271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_hypot_cuda_float64 PASSED [0.0090s] [ 40%] 2025-12-04T12:25:43.2153390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_i0_cuda_float16 PASSED [1.6374s] [ 40%] 2025-12-04T12:25:43.2153516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_igamma_cuda_float32 PASSED [0.0094s] [ 40%] 2025-12-04T12:25:43.2153642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_igammac_cuda_float32 PASSED [1.4983s] [ 40%] 2025-12-04T12:25:43.2153814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_imag_cuda_complex128 SKIPPED [0.0016s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2153938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_bool PASSED [0.0085s] [ 40%] 2025-12-04T12:25:43.2154074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex128 PASSED [0.0093s] [ 40%] 2025-12-04T12:25:43.2154205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex32 PASSED [1.4716s] [ 40%] 2025-12-04T12:25:43.2154345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_complex64 PASSED [0.0122s] [ 40%] 2025-12-04T12:25:43.2154475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_float16 PASSED [0.0101s] [ 40%] 2025-12-04T12:25:43.2154600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_add_cuda_int64 PASSED [0.0088s] [ 40%] 2025-12-04T12:25:43.2154744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_complex128 PASSED [0.0043s] [ 40%] 2025-12-04T12:25:43.2154876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_float64 PASSED [0.0041s] [ 40%] 2025-12-04T12:25:43.2155002Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int16 PASSED [1.3594s] [ 40%] 2025-12-04T12:25:43.2155130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_int32 PASSED [0.0064s] [ 40%] 2025-12-04T12:25:43.2155259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_copy_cuda_uint8 PASSED [0.0045s] [ 40%] 2025-12-04T12:25:43.2155391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_complex128 PASSED [0.0064s] [ 40%] 2025-12-04T12:25:43.2155522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_float32 PASSED [0.0062s] [ 40%] 2025-12-04T12:25:43.2155651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_float64 PASSED [0.0061s] [ 40%] 2025-12-04T12:25:43.2155777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_fill_cuda_int8 PASSED [0.0061s] [ 40%] 2025-12-04T12:25:43.2155904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_float64 PASSED [0.0052s] [ 40%] 2025-12-04T12:25:43.2156028Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int32 PASSED [0.0050s] [ 40%] 2025-12-04T12:25:43.2156151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_put_cuda_int8 PASSED [0.0050s] [ 40%] 2025-12-04T12:25:43.2156301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amax_cuda_float64 PASSED [0.0071s] [ 40%] 2025-12-04T12:25:43.2156452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_bfloat16 PASSED [0.0071s] [ 40%] 2025-12-04T12:25:43.2156594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_float32 PASSED [1.1075s] [ 40%] 2025-12-04T12:25:43.2156740Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_int32 PASSED [0.0099s] [ 40%] 2025-12-04T12:25:43.2156879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_amin_cuda_uint8 PASSED [0.0075s] [ 40%] 2025-12-04T12:25:43.2157017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_float64 PASSED [0.0077s] [ 40%] 2025-12-04T12:25:43.2157154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int16 PASSED [1.0427s] [ 40%] 2025-12-04T12:25:43.2157290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_mean_cuda_int32 PASSED [0.0101s] [ 40%] 2025-12-04T12:25:43.2157428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int16 PASSED [0.0074s] [ 40%] 2025-12-04T12:25:43.2157564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int32 PASSED [0.0072s] [ 40%] 2025-12-04T12:25:43.2157698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int64 PASSED [1.0499s] [ 40%] 2025-12-04T12:25:43.2157835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_int8 PASSED [0.0095s] [ 40%] 2025-12-04T12:25:43.2157968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_reduce_prod_cuda_uint8 PASSED [0.0074s] [ 40%] 2025-12-04T12:25:43.2158196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2158374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2158552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2158743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_index_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2158915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_inner_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_int_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2159920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isclose_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160447Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2160969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isfinite_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isinf_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2161969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isnan_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2162982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2163150Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isneginf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2163326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2163495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2163666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2163840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isposinf_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164532Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164694Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_isreal_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2164881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_istft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_item_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165772Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2165969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2166164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_2inputs_2outputs_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2166368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2166581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2166782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2167004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2167203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2167383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 40%] 2025-12-04T12:25:43.2167564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2167766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2167970Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2168200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_binary_return_by_ref_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2168384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2168573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2168769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2168949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_jiterator_unary_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kron_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_kthvalue_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2169930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lcm_cuda_int64 PASSED [0.0107s] [ 41%] 2025-12-04T12:25:43.2170056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_bfloat16 PASSED [0.0097s] [ 41%] 2025-12-04T12:25:43.2170276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2170500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2170626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_float64 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.2170844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2171077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ldexp_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2171209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_complex128 PASSED [1.8220s] [ 41%] 2025-12-04T12:25:43.2171334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float32 PASSED [0.0115s] [ 41%] 2025-12-04T12:25:43.2171458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lerp_cuda_float64 PASSED [0.0093s] [ 41%] 2025-12-04T12:25:43.2171584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_bfloat16 PASSED [0.0046s] [ 41%] 2025-12-04T12:25:43.2171713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_float64 PASSED [1.0375s] [ 41%] 2025-12-04T12:25:43.2171935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lgamma_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2172125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_complex64 SKIPPED [0.0013s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2172307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2172502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2172702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2172897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cholesky_ex_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cond_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_cross_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2173986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2174168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2174346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_diagonal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2174526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eig_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2174705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2174884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2175060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigh_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2175256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_eigvalsh_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2175501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T12:25:43.2175738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_householder_product_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T12:25:43.2175918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2176099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2176278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_inv_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2176466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2176655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2176845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_factor_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2177077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_ldl_solve_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 41%] 2025-12-04T12:25:43.2177283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lstsq_grad_oriented_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2177477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2177658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2177839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_factor_ex_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_lu_solve_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2178992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_power_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2179179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2179377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2179576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2179763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2179944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_multi_dot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2180125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2180301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2180511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2180719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_norm_subgradients_at_zero_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2180897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2181133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 41%] 2025-12-04T12:25:43.2181317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_qr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2181498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_slogdet_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2181695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2181877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_solve_triangular_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2182994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_svdvals_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2183182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2183363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorinv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2183556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_tensorsolve_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2183748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2183925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vander_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2184103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2184283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2184463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vecdot_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2184648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2184836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185208Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linalg_vector_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2185916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2186106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2186300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2186492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_linspace_tensor_overload_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2186715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2186843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_float16 PASSED [1.0180s] [ 41%] 2025-12-04T12:25:43.2187059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2187278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2187496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log10_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2187624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_bfloat16 PASSED [1.0197s] [ 41%] 2025-12-04T12:25:43.2187863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_complex128 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2188125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2188341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log1p_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2188465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_bfloat16 PASSED [1.0295s] [ 41%] 2025-12-04T12:25:43.2188592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float16 PASSED [0.0062s] [ 41%] 2025-12-04T12:25:43.2188714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_float32 PASSED [0.0042s] [ 41%] 2025-12-04T12:25:43.2188933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int32 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2189148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log2_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2189361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 41%] 2025-12-04T12:25:43.2189554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2189761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2189955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2190158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2190349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2190537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2190723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2190906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_log_softmax_with_dtype_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp2_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logaddexp_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2191983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logcumsumexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2192156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2192326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logdet_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 41%] 2025-12-04T12:25:43.2192462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_complex128 PASSED [1.0357s] [ 41%] 2025-12-04T12:25:43.2192599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_complex64 PASSED [0.2962s] [ 41%] 2025-12-04T12:25:43.2192731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_float16 PASSED [1.0399s] [ 41%] 2025-12-04T12:25:43.2192861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_and_cuda_uint8 PASSED [0.0081s] [ 42%] 2025-12-04T12:25:43.2192996Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_complex64 PASSED [1.0372s] [ 42%] 2025-12-04T12:25:43.2193130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_float16 PASSED [0.0055s] [ 42%] 2025-12-04T12:25:43.2193259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int16 PASSED [0.0039s] [ 42%] 2025-12-04T12:25:43.2193389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int64 PASSED [1.0258s] [ 42%] 2025-12-04T12:25:43.2193525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_not_cuda_int8 PASSED [0.0055s] [ 42%] 2025-12-04T12:25:43.2193668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_complex64 PASSED [0.2953s] [ 42%] 2025-12-04T12:25:43.2193795Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_float64 PASSED [0.0061s] [ 42%] 2025-12-04T12:25:43.2193924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_or_cuda_int64 PASSED [0.0057s] [ 42%] 2025-12-04T12:25:43.2194067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_int64 PASSED [0.0057s] [ 42%] 2025-12-04T12:25:43.2194194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logical_xor_cuda_uint8 PASSED [0.0057s] [ 42%] 2025-12-04T12:25:43.2194413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T12:25:43.2194535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_float32 PASSED [1.0569s] [ 42%] 2025-12-04T12:25:43.2194756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T12:25:43.2194974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int32 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T12:25:43.2195194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T12:25:43.2195410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logit_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 42%] 2025-12-04T12:25:43.2195583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2195778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2195979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logspace_tensor_overload_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2196160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2196337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2196517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2196692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2196866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_logsumexp_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2197035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2197204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2197366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_long_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2197492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_bfloat16 PASSED [0.0091s] [ 42%] 2025-12-04T12:25:43.2197622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_bool PASSED [0.0066s] [ 42%] 2025-12-04T12:25:43.2197760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_float32 PASSED [0.0067s] [ 42%] 2025-12-04T12:25:43.2197879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lt_cuda_uint8 PASSED [0.0067s] [ 42%] 2025-12-04T12:25:43.2198048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2198268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2198445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_lu_unpack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2198607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2198772Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2198935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mH_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199587Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mT_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2199940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amax_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_amin_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2200998Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2201179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2201358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2201535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2201709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2201894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_argmin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2202992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2203171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2203351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2203529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2203706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumprod_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2203884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2204061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2204244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_cumsum_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2204377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_bool PASSED [0.0077s] [ 42%] 2025-12-04T12:25:43.2204515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex128 PASSED [0.0074s] [ 42%] 2025-12-04T12:25:43.2204653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_complex64 PASSED [1.0522s] [ 42%] 2025-12-04T12:25:43.2204785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_float32 PASSED [0.0103s] [ 42%] 2025-12-04T12:25:43.2204916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_fill_cuda_int16 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.2205102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_log_softmax_cuda_bfloat16 SKIPPED [0.0014s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2205291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2205473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2205661Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logaddexp_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2205860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_logsumexp_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2206956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2207132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2207316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2207497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2207679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_normalize_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2207858Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2208033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2208328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_prod_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2208482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_bfloat16 PASSED [0.0055s] [ 42%] 2025-12-04T12:25:43.2208623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_complex128 PASSED [0.0045s] [ 42%] 2025-12-04T12:25:43.2208761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_complex64 PASSED [0.0045s] [ 42%] 2025-12-04T12:25:43.2208897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_scatter_cuda_float16 PASSED [0.0044s] [ 42%] 2025-12-04T12:25:43.2209075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_select_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2209257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2209437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2209617Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_softmin_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2209792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2209965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2210157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_std_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2210347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2210522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_sum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2210707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2210886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211239Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_masked_var_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matmul_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2211934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_matrix_exp_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2212989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_binary_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2213194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2213380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2213564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2213748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2213929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_no_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2214118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_max_reduction_with_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2214302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2214482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_maximum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2214653Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2214838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_median_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_meshgrid_list_of_tensors_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215715Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_binary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2215902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2216090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2216272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2216456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_no_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 42%] 2025-12-04T12:25:43.2216644Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2216844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_min_reduction_with_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217012Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_minimum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mode_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2217854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_movedim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2218918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_msort_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2219048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex128 PASSED [0.0059s] [ 43%] 2025-12-04T12:25:43.2219172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex32 PASSED [0.0058s] [ 43%] 2025-12-04T12:25:43.2219297Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_complex64 PASSED [0.0057s] [ 43%] 2025-12-04T12:25:43.2219421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int16 PASSED [0.0057s] [ 43%] 2025-12-04T12:25:43.2219540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int32 PASSED [0.0056s] [ 43%] 2025-12-04T12:25:43.2219661Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mul_cuda_int8 PASSED [0.0057s] [ 43%] 2025-12-04T12:25:43.2219841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_multinomial_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2220009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2220176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2220324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float16 PASSED [0.0144s] [ 43%] 2025-12-04T12:25:43.2220563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T12:25:43.2220815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T12:25:43.2221051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T12:25:43.2221198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0126s] [ 43%] 2025-12-04T12:25:43.2221435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 43%] 2025-12-04T12:25:43.2221568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float16 PASSED [0.0049s] [ 43%] 2025-12-04T12:25:43.2221701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float32 PASSED [0.0045s] [ 43%] 2025-12-04T12:25:43.2221831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_float64 PASSED [0.0045s] [ 43%] 2025-12-04T12:25:43.2221961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nan_to_num_cuda_uint8 PASSED [1.0471s] [ 43%] 2025-12-04T12:25:43.2222135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmean_cuda_complex32 SKIPPED [0.0016s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2222312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_float64 SKIPPED [0.0013s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2222493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2222675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanmedian_cuda_uint8 SKIPPED [0.0012s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2222852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nanquantile_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nansum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2223898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224079Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224775Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_narrow_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2224971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2225156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2225336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_batch_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2225528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_dropout_backward_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2225712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_native_layer_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2225840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_complex128 PASSED [1.0383s] [ 43%] 2025-12-04T12:25:43.2225964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_complex64 PASSED [0.0097s] [ 43%] 2025-12-04T12:25:43.2226085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ne_cuda_int8 PASSED [0.0072s] [ 43%] 2025-12-04T12:25:43.2226209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_bfloat16 PASSED [0.0032s] [ 43%] 2025-12-04T12:25:43.2226337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex128 PASSED [1.0391s] [ 43%] 2025-12-04T12:25:43.2226461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_complex32 PASSED [0.0059s] [ 43%] 2025-12-04T12:25:43.2226592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_neg_cuda_uint8 PASSED [0.0035s] [ 43%] 2025-12-04T12:25:43.2226777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2226957Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2227140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2227310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2227477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2227645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2227830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_empty_strided_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_full_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2228929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_ones_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2229979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2230149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_new_zeros_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2230279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nextafter_cuda_float16 PASSED [0.0069s] [ 43%] 2025-12-04T12:25:43.2230409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nextafter_cuda_float64 PASSED [0.0083s] [ 43%] 2025-12-04T12:25:43.2230627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2230849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2231054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2231271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2231474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2231676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2231879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2232035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16 PASSED [0.0155s] [ 43%] 2025-12-04T12:25:43.2232191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0157s] [ 43%] 2025-12-04T12:25:43.2232383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool1d_cuda_float16 SKIPPED [0.0018s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2232576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool2d_cuda_float16 SKIPPED [0.0018s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2232766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_avg_pool3d_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2232960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2233162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_bilinear_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2233368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2233592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2233813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2234033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2234177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float16 PASSED [1.0624s] [ 43%] 2025-12-04T12:25:43.2234321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_celu_cuda_float32 PASSED [0.0067s] [ 43%] 2025-12-04T12:25:43.2234517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2234720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2234928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2235134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2235330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2235537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_channel_shuffle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2235728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2235920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2236112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv1d_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2236300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2236491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv2d_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2236694Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2236893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2237203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2237403Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2237619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2237825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2238029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2238250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_conv_transpose3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2238458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2238662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2238868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2239072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2239286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2239495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_cross_entropy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2239683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2239846Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout2d_cuda_bfloat16 PASSED [0.0120s] [ 43%] 2025-12-04T12:25:43.2239998Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_bfloat16 PASSED [0.0133s] [ 43%] 2025-12-04T12:25:43.2240151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float16 PASSED [0.0132s] [ 43%] 2025-12-04T12:25:43.2240298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout3d_cuda_float64 PASSED [1.0632s] [ 43%] 2025-12-04T12:25:43.2240445Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_dropout_cuda_float16 PASSED [0.0169s] [ 43%] 2025-12-04T12:25:43.2240586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_bfloat16 PASSED [1.0418s] [ 43%] 2025-12-04T12:25:43.2240725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_elu_cuda_float16 PASSED [0.0075s] [ 43%] 2025-12-04T12:25:43.2240924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_embedding_bag_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 43%] 2025-12-04T12:25:43.2241102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16 PASSED [0.0098s] [ 44%] 2025-12-04T12:25:43.2241286Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0081s] [ 44%] 2025-12-04T12:25:43.2241465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int16 PASSED [0.0068s] [ 44%] 2025-12-04T12:25:43.2241663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int64 PASSED [0.0068s] [ 44%] 2025-12-04T12:25:43.2241870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2242077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2242276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2242462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_gelu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2242656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_grid_sample_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2242848Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2243038Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_group_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2243192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_bfloat16 PASSED [0.0057s] [ 44%] 2025-12-04T12:25:43.2243355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float16 PASSED [0.0054s] [ 44%] 2025-12-04T12:25:43.2243516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardsigmoid_cuda_float64 PASSED [0.0052s] [ 44%] 2025-12-04T12:25:43.2243708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2243916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardswish_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2244109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2244298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2244486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2244673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hardtanh_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2244880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2245072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2245262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_huber_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2245469Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2245674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bicubic_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2245894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2246096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_linear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2246315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2246531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2246743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2246950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2247154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_nearest_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2247363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2247582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_interpolate_trilinear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2247779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2247963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_kl_div_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2248199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_layer_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2248350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_leaky_relu_cuda_float32 PASSED [0.0096s] [ 44%] 2025-12-04T12:25:43.2248538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_linear_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2248745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2248937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_logsigmoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2249140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2249339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2249539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_margin_ranking_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2249731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2249925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2250135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2250328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2250527Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2250722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2250917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2251117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool2d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2251316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2251508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2251710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2251926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2252137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2252281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0048s] [ 44%] 2025-12-04T12:25:43.2252432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mish_cuda_float64 PASSED [0.0045s] [ 44%] 2025-12-04T12:25:43.2252624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2252811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2253000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2253189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_mse_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2253401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2253608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2253822Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254012Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_nll_loss_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_normalize_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_one_hot_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2254986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2255181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_circular_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2255381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2255576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2255769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2255959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2256164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_constant_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2256368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2256578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_reflect_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2256776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2256974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2257168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2257381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2257590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2257794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pad_replicate_negative_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2257992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pairwise_distance_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2258214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2258413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2258619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2258813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_shuffle_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_pixel_unshuffle_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2259998Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2260195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2260405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2260610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_prelu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2260795Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_prelu_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2260998Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2261186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu6_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2261369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_relu_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2261560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2261747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rms_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2261893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_bfloat16 PASSED [0.0061s] [ 44%] 2025-12-04T12:25:43.2262034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_rrelu_cuda_float32 PASSED [0.0059s] [ 44%] 2025-12-04T12:25:43.2262254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2262471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2262614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float32 PASSED [1.0732s] [ 44%] 2025-12-04T12:25:43.2262766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_selu_cuda_float64 PASSED [0.0076s] [ 44%] 2025-12-04T12:25:43.2262924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_silu_complex_cuda_complex64 PASSED [0.0042s] [ 44%] 2025-12-04T12:25:43.2263122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_smooth_l1_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2263312Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2263502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2263708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2263911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2264109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softmin_with_dtype_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2264300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softplus_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2264499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2264699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_softsign_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2264897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2265097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2265287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_tanhshrink_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2265496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2265707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2265912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2266120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2266322Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2266550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2266779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2267008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2267231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2267419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_unfold_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2267624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2267828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 44%] 2025-12-04T12:25:43.2268035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_bilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2268290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2268490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2268688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nn_functional_upsample_nearest_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2268877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2269060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2269228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2269412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2269578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2269741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_bfloat16 SKIPPED [0.0007s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2269904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex128 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2270069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex32 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2270230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_complex64 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2270393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2270552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_nonzero_static_cuda_uint8 SKIPPED [0.0005s] (Only runs on cpu) [ 45%] 2025-12-04T12:25:43.2270718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2270889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_fro_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2271939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2272109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_inf_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2274021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_norm_nuc_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2274197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2274367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2274511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_complex64 PASSED [0.0041s] [ 45%] 2025-12-04T12:25:43.2274670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_float16 PASSED [0.0045s] [ 45%] 2025-12-04T12:25:43.2274819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_in_place_cuda_float64 PASSED [0.0038s] [ 45%] 2025-12-04T12:25:43.2275011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_normal_number_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2275191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2275362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2275525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2275705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2275873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ones_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ormqr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2276920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277267Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_outer_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pca_lowrank_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pca_lowrank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2277968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2278186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2278368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2278549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2278723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2278927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_permute_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2279988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pinverse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2280236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_0_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2280389Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_float16 PASSED [0.0077s] [ 45%] 2025-12-04T12:25:43.2280543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_float64 PASSED [0.0075s] [ 45%] 2025-12-04T12:25:43.2280783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_1_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2281024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2281173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float16 PASSED [0.0076s] [ 45%] 2025-12-04T12:25:43.2281340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_float64 PASSED [0.0075s] [ 45%] 2025-12-04T12:25:43.2281581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2281821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2282061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_2_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2282298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2282454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_float16 PASSED [1.0997s] [ 45%] 2025-12-04T12:25:43.2282695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2282936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_3_cuda_uint8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2283095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0106s] [ 45%] 2025-12-04T12:25:43.2283346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2283597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2283841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2284080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_polygamma_polygamma_n_4_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2284255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2284436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2284610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2284785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2284954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2285127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_positive_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2285256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_complex64 PASSED [0.0068s] [ 45%] 2025-12-04T12:25:43.2285382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int32 PASSED [0.0065s] [ 45%] 2025-12-04T12:25:43.2285513Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int64 PASSED [0.0064s] [ 45%] 2025-12-04T12:25:43.2285638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_pow_cuda_int8 PASSED [0.0064s] [ 45%] 2025-12-04T12:25:43.2285813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2285981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2286148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_prod_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2286271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_float32 PASSED [0.0172s] [ 45%] 2025-12-04T12:25:43.2286390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_int16 PASSED [0.0206s] [ 45%] 2025-12-04T12:25:43.2286509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_put_cuda_int64 PASSED [1.1294s] [ 45%] 2025-12-04T12:25:43.2286677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_qr_cuda_float64 SKIPPED [0.0016s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2286849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_quantile_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2286978Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float16 PASSED [0.0048s] [ 45%] 2025-12-04T12:25:43.2287107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rad2deg_cuda_float64 PASSED [1.0991s] [ 45%] 2025-12-04T12:25:43.2287296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_complex32 SKIPPED [0.0016s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2287478Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rand_like_cuda_float64 SKIPPED [0.0013s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2287650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2287833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_cuda_int64 SKIPPED [0.0012s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288007Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randint_like_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288566Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_randn_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2288920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_ravel_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2289952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2290119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2290283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_real_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2290420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_bfloat16 PASSED [1.0931s] [ 45%] 2025-12-04T12:25:43.2290553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_float16 PASSED [0.0060s] [ 45%] 2025-12-04T12:25:43.2290787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2291014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reciprocal_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 45%] 2025-12-04T12:25:43.2291149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_bfloat16 PASSED [0.0083s] [ 45%] 2025-12-04T12:25:43.2291293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float16 PASSED [0.0078s] [ 45%] 2025-12-04T12:25:43.2291439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float32 PASSED [0.0069s] [ 45%] 2025-12-04T12:25:43.2291569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_remainder_cuda_float64 PASSED [0.0069s] [ 45%] 2025-12-04T12:25:43.2291699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_renorm_cuda_float16 PASSED [0.0096s] [ 45%] 2025-12-04T12:25:43.2291883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2292950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2293133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2293318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2293502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2293686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_repeat_interleave_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2293883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2294052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 45%] 2025-12-04T12:25:43.2294226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_as_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2294401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2294576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2294747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2294921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2295090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2295263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_reshape_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2295393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_bfloat16 PASSED [1.1111s] [ 46%] 2025-12-04T12:25:43.2295539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_complex128 PASSED [0.0059s] [ 46%] 2025-12-04T12:25:43.2295676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_float32 PASSED [0.0041s] [ 46%] 2025-12-04T12:25:43.2295806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int64 PASSED [1.0893s] [ 46%] 2025-12-04T12:25:43.2295935Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_int8 PASSED [0.0059s] [ 46%] 2025-12-04T12:25:43.2296069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize__cuda_uint8 PASSED [0.0042s] [ 46%] 2025-12-04T12:25:43.2296205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float32 PASSED [0.0041s] [ 46%] 2025-12-04T12:25:43.2296335Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_float64 PASSED [1.0938s] [ 46%] 2025-12-04T12:25:43.2296466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resize_as__cuda_int8 PASSED [0.0060s] [ 46%] 2025-12-04T12:25:43.2296647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2296833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_conj_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2297905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2298133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2298314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2298493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2298667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2298842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_resolve_neg_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299007Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_roll_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2299891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2300061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2300237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rot90_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2300369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_float16 PASSED [1.0844s] [ 46%] 2025-12-04T12:25:43.2300497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_float64 PASSED [0.0049s] [ 46%] 2025-12-04T12:25:43.2300625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_cuda_uint8 PASSED [1.0916s] [ 46%] 2025-12-04T12:25:43.2300767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_bfloat16 PASSED [0.0062s] [ 46%] 2025-12-04T12:25:43.2300912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float32 PASSED [0.0053s] [ 46%] 2025-12-04T12:25:43.2301053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_0_cuda_float64 PASSED [1.0964s] [ 46%] 2025-12-04T12:25:43.2301194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_float16 PASSED [0.0062s] [ 46%] 2025-12-04T12:25:43.2301329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_3_cuda_float64 PASSED [0.0043s] [ 46%] 2025-12-04T12:25:43.2301477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_bfloat16 PASSED [1.0869s] [ 46%] 2025-12-04T12:25:43.2301624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_round_decimals_neg_3_cuda_float64 PASSED [0.0062s] [ 46%] 2025-12-04T12:25:43.2301846Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_bool SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2302089Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_complex32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2302310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsqrt_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2302482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2302650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2302817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_rsub_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scalar_tensor_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2303880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex128 PASSED [0.0087s] [ 46%] 2025-12-04T12:25:43.2304017Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_complex64 PASSED [0.0081s] [ 46%] 2025-12-04T12:25:43.2304149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int16 PASSED [1.1085s] [ 46%] 2025-12-04T12:25:43.2304292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_add_cuda_int8 PASSED [0.0112s] [ 46%] 2025-12-04T12:25:43.2304422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_bfloat16 PASSED [0.0202s] [ 46%] 2025-12-04T12:25:43.2304549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int32 PASSED [0.0143s] [ 46%] 2025-12-04T12:25:43.2304673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_cuda_int64 PASSED [0.0142s] [ 46%] 2025-12-04T12:25:43.2304815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_int8 PASSED [0.0166s] [ 46%] 2025-12-04T12:25:43.2304956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_amax_cuda_uint8 PASSED [0.0166s] [ 46%] 2025-12-04T12:25:43.2305106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_mean_cuda_float64 PASSED [0.0176s] [ 46%] 2025-12-04T12:25:43.2305251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float16 PASSED [0.0167s] [ 46%] 2025-12-04T12:25:43.2305395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_float32 PASSED [0.0166s] [ 46%] 2025-12-04T12:25:43.2305536Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int16 PASSED [0.0167s] [ 46%] 2025-12-04T12:25:43.2305677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_prod_cuda_int8 PASSED [0.0166s] [ 46%] 2025-12-04T12:25:43.2305821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_bfloat16 PASSED [0.0166s] [ 46%] 2025-12-04T12:25:43.2305963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_bool PASSED [0.0165s] [ 46%] 2025-12-04T12:25:43.2306116Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_int64 PASSED [0.0165s] [ 46%] 2025-12-04T12:25:43.2306262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_scatter_reduce_sum_cuda_uint8 PASSED [0.0165s] [ 46%] 2025-12-04T12:25:43.2306451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2306628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2306807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2306982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2307162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_searchsorted_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2307334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2307503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2307672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2307855Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308030Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2308997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_select_scatter_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2309120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_bool PASSED [0.0030s] [ 46%] 2025-12-04T12:25:43.2309248Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_complex64 PASSED [1.1024s] [ 46%] 2025-12-04T12:25:43.2309374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sgn_cuda_int32 PASSED [0.0050s] [ 46%] 2025-12-04T12:25:43.2309549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2309718Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2309888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2310058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_short_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2310187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_bfloat16 PASSED [1.1172s] [ 46%] 2025-12-04T12:25:43.2310432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_complex32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2310660Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2310882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2311106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sigmoid_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2311232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_bfloat16 PASSED [1.1052s] [ 46%] 2025-12-04T12:25:43.2311357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int64 PASSED [0.0049s] [ 46%] 2025-12-04T12:25:43.2311476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_int8 PASSED [0.0037s] [ 46%] 2025-12-04T12:25:43.2311599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sign_cuda_uint8 PASSED [1.0946s] [ 46%] 2025-12-04T12:25:43.2311793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_blackman_cuda_float32 SKIPPED [0.0016s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2312009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_exponential_cuda_float32 SKIPPED [0.0013s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2312224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_hamming_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2312429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_general_hamming_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2312633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hamming_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2312820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_hann_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2313010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signal_windows_kaiser_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2313182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2313358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2313528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_signbit_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2313657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_bfloat16 PASSED [1.1107s] [ 46%] 2025-12-04T12:25:43.2313879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_complex32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2314004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float16 PASSED [1.0912s] [ 46%] 2025-12-04T12:25:43.2314128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_float64 PASSED [0.0050s] [ 46%] 2025-12-04T12:25:43.2314350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2314581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sin_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2314707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_bfloat16 PASSED [0.0057s] [ 46%] 2025-12-04T12:25:43.2314933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2315148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2315366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2315581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2315800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinc_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2316027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2316260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2316494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2316620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float16 PASSED [1.1125s] [ 46%] 2025-12-04T12:25:43.2316758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_float32 PASSED [0.0051s] [ 46%] 2025-12-04T12:25:43.2316976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_int16 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2317195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sinh_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 46%] 2025-12-04T12:25:43.2317363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 46%] 2025-12-04T12:25:43.2317537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2317704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2317889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2318106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2318279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_slice_scatter_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2318452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2318635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2318837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_softmax_with_dtype_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2319858Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sort_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2320023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_mm_reduce_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 47%] 2025-12-04T12:25:43.2320216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2320409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2320601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2320784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_airy_ai_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2320981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2321166Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2321343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2321525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2321705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_j1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2321890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2322071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2322252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2322431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y0_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2322614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2322796Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_bessel_y1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2323008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2323209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2323408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_t_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2323612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2323812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2324013Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2324220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2324420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_v_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2324635Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2324852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_chebyshev_polynomial_w_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325223Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_entr_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2325932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2326110Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_erfcx_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2326307Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2326504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2326704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_h_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2326904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2327115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2327310Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2327510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_hermite_polynomial_he_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2327684Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2327857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i0e_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2328941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2329129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2329301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2329490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_i1e_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2329691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2329892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2330093Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_laguerre_polynomial_l_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2330290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2330489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2330687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_legendre_polynomial_p_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2330868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2331048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_log_ndtr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2331246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2331453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2331645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i0_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2331842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_i1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k0_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2332994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_modified_bessel_k1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2333181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2333367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2333545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2333733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_ndtri_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2333942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2334158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2334368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2334577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2334777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2334976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2335178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2335386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2335606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2335814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2336022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2336233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2336439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2336654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2336863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2337070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2337287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2337502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2337696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_spherical_bessel_j0_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2337884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2338121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_special_xlog1py_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2338285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2338455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2338630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2338817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339176Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_list_args_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2339922Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2340109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2340288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2340472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2340651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2340832Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_split_with_sizes_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2341050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T12:25:43.2341177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_float64 PASSED [1.0994s] [ 47%] 2025-12-04T12:25:43.2341396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_int32 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T12:25:43.2341629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_int8 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T12:25:43.2341860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sqrt_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 47%] 2025-12-04T12:25:43.2341987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_float32 PASSED [1.1105s] [ 47%] 2025-12-04T12:25:43.2342129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_float64 PASSED [0.0065s] [ 47%] 2025-12-04T12:25:43.2342251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int16 PASSED [0.0045s] [ 47%] 2025-12-04T12:25:43.2342374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int32 PASSED [1.1080s] [ 47%] 2025-12-04T12:25:43.2342498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_square_cuda_int64 PASSED [0.0065s] [ 47%] 2025-12-04T12:25:43.2342680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2342857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_copy_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 47%] 2025-12-04T12:25:43.2342980Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int64 PASSED [0.0067s] [ 47%] 2025-12-04T12:25:43.2343104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_cuda_int8 PASSED [0.0063s] [ 47%] 2025-12-04T12:25:43.2343244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_bfloat16 PASSED [0.0054s] [ 48%] 2025-12-04T12:25:43.2343390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_complex128 PASSED [0.0054s] [ 48%] 2025-12-04T12:25:43.2343533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_complex64 PASSED [0.0054s] [ 48%] 2025-12-04T12:25:43.2343672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int16 PASSED [0.0053s] [ 48%] 2025-12-04T12:25:43.2343818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_squeeze_multiple_cuda_int8 PASSED [0.0053s] [ 48%] 2025-12-04T12:25:43.2343994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stack_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2344991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2345169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2345355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2345552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2345748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_mean_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2345927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2346117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2346293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_std_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2346462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_stft_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2346586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_bfloat16 PASSED [0.0071s] [ 48%] 2025-12-04T12:25:43.2346712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex128 PASSED [0.0072s] [ 48%] 2025-12-04T12:25:43.2346837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_complex64 PASSED [0.0071s] [ 48%] 2025-12-04T12:25:43.2346958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_float16 PASSED [0.0070s] [ 48%] 2025-12-04T12:25:43.2347080Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_float64 PASSED [1.1212s] [ 48%] 2025-12-04T12:25:43.2347198Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sub_cuda_int8 PASSED [0.0096s] [ 48%] 2025-12-04T12:25:43.2347364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2347533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2347701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2347881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348399Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2348914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2349082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2349252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_sum_to_size_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2349420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2349595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2349726Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_float64 PASSED [0.0038s] [ 48%] 2025-12-04T12:25:43.2349842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int16 PASSED [1.1057s] [ 48%] 2025-12-04T12:25:43.2349959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_int32 PASSED [0.0059s] [ 48%] 2025-12-04T12:25:43.2350087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_t_cuda_uint8 PASSED [0.0042s] [ 48%] 2025-12-04T12:25:43.2350271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2350452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2350629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2350807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2350982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_along_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2351147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_take_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2351362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2351586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2351710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_float64 PASSED [1.1149s] [ 48%] 2025-12-04T12:25:43.2351924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tan_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2352157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_complex32 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2352282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float16 PASSED [0.0046s] [ 48%] 2025-12-04T12:25:43.2352405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_float64 PASSED [1.1014s] [ 48%] 2025-12-04T12:25:43.2352624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tanh_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2352799Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_bool SKIPPED [0.0013s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2352981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2353156Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int16 SKIPPED [0.0012s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2353328Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tensor_split_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2353491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tile_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2353666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2353839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2354990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2355163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2355332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_to_sparse_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2355497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2355663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2355828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2356008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_topk_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2356272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 48%] 2025-12-04T12:25:43.2356485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2356692Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2356893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2357925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2358130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trace_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2358330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2358510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2358689Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2358865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2359043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_copy_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2359171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_int16 PASSED [0.0078s] [ 48%] 2025-12-04T12:25:43.2359300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_transpose_cuda_uint8 PASSED [0.0063s] [ 48%] 2025-12-04T12:25:43.2359477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2359657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2359826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2359997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2360180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2360349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapezoid_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2360519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2360686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2360851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trapz_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2361037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2361219Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triangular_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2361340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_bool PASSED [0.0111s] [ 48%] 2025-12-04T12:25:43.2361468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex128 PASSED [0.0103s] [ 48%] 2025-12-04T12:25:43.2361597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_complex64 PASSED [0.0102s] [ 48%] 2025-12-04T12:25:43.2361721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float16 PASSED [1.0991s] [ 48%] 2025-12-04T12:25:43.2361856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_float64 PASSED [0.0135s] [ 48%] 2025-12-04T12:25:43.2361990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int64 PASSED [0.0106s] [ 48%] 2025-12-04T12:25:43.2362107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_tril_cuda_int8 PASSED [0.0102s] [ 48%] 2025-12-04T12:25:43.2362229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_float64 PASSED [1.1221s] [ 48%] 2025-12-04T12:25:43.2362358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int16 PASSED [0.0134s] [ 48%] 2025-12-04T12:25:43.2362475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_cuda_int64 PASSED [0.0106s] [ 48%] 2025-12-04T12:25:43.2362649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_triu_indices_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2362781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float16 PASSED [0.0066s] [ 48%] 2025-12-04T12:25:43.2362913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_float64 PASSED [0.0063s] [ 48%] 2025-12-04T12:25:43.2363140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2363369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2363592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_true_divide_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 48%] 2025-12-04T12:25:43.2363716Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_float16 PASSED [1.1004s] [ 48%] 2025-12-04T12:25:43.2363838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int16 PASSED [0.0049s] [ 48%] 2025-12-04T12:25:43.2363959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_int8 PASSED [0.0036s] [ 48%] 2025-12-04T12:25:43.2364079Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_trunc_cuda_uint8 PASSED [1.1010s] [ 48%] 2025-12-04T12:25:43.2364263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2364444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_complex128 SKIPPED [0.0013s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2364622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2364800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2364972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_copy_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2365146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2365317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2365486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2365651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unbind_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2365836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 48%] 2025-12-04T12:25:43.2366019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2366191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2366374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2366542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unflatten_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2366713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2366889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unfold_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2367852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_bfloat16 PASSED [1.1107s] [ 49%] 2025-12-04T12:25:43.2367983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_complex64 PASSED [0.0064s] [ 49%] 2025-12-04T12:25:43.2368155Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_uniform_cuda_float64 PASSED [0.0044s] [ 49%] 2025-12-04T12:25:43.2368358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2368542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2368724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2368907Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_consecutive_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unique_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2369926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unravel_index_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2370113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2370288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2370479Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_chunk_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2370656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2370830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsafe_split_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2371876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2372052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_copy_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2372180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_bool PASSED [0.0068s] [ 49%] 2025-12-04T12:25:43.2372329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_complex128 PASSED [0.0067s] [ 49%] 2025-12-04T12:25:43.2372458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_float32 PASSED [0.0067s] [ 49%] 2025-12-04T12:25:43.2372585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_int32 PASSED [1.1056s] [ 49%] 2025-12-04T12:25:43.2372709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_unsqueeze_cuda_uint8 PASSED [0.0093s] [ 49%] 2025-12-04T12:25:43.2372877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373043Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_mean_unbiased_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_var_unbiased_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2373915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vdot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2374103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2374374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_as_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2374548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2374736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2374913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2375925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376421Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_view_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2376934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vsplit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_vstack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2377931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_where_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2378057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_bfloat16 PASSED [0.0095s] [ 49%] 2025-12-04T12:25:43.2378235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_xlogy_cuda_float32 PASSED [1.1185s] [ 49%] 2025-12-04T12:25:43.2378375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_bfloat16 PASSED [0.0058s] [ 49%] 2025-12-04T12:25:43.2378502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_complex64 PASSED [0.0041s] [ 49%] 2025-12-04T12:25:43.2378623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int64 PASSED [1.1089s] [ 49%] 2025-12-04T12:25:43.2378757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zero__cuda_int8 PASSED [0.0058s] [ 49%] 2025-12-04T12:25:43.2378928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379264Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379760Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2379936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2380105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2380279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2380446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2380630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_inplace_zeros_like_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 49%] 2025-12-04T12:25:43.2380750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_H_cuda_int8 PASSED [1.1253s] [ 49%] 2025-12-04T12:25:43.2380876Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_complex32 PASSED [0.0047s] [ 49%] 2025-12-04T12:25:43.2380994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_float16 PASSED [1.1181s] [ 49%] 2025-12-04T12:25:43.2381115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_float64 PASSED [0.0048s] [ 49%] 2025-12-04T12:25:43.2381231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int32 PASSED [1.1139s] [ 49%] 2025-12-04T12:25:43.2381348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int64 PASSED [0.0047s] [ 49%] 2025-12-04T12:25:43.2381465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_T_cuda_int8 PASSED [1.1024s] [ 49%] 2025-12-04T12:25:43.2381607Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_complex128 PASSED [0.0117s] [ 49%] 2025-12-04T12:25:43.2381736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_int32 PASSED [0.0091s] [ 49%] 2025-12-04T12:25:43.2381865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___getitem___cuda_int64 PASSED [0.0088s] [ 49%] 2025-12-04T12:25:43.2381991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_bfloat16 PASSED [0.0139s] [ 49%] 2025-12-04T12:25:43.2382133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_float16 PASSED [0.0132s] [ 49%] 2025-12-04T12:25:43.2382271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_float32 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2382396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___radd___cuda_int8 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2382521Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int16 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2382662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int32 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2382786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_int64 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2382909Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rand___cuda_uint8 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2383033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_bool PASSED [0.0165s] [ 49%] 2025-12-04T12:25:43.2383163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_complex64 PASSED [0.0139s] [ 49%] 2025-12-04T12:25:43.2383290Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_float64 PASSED [0.0137s] [ 49%] 2025-12-04T12:25:43.2383413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rdiv___cuda_int32 PASSED [0.0164s] [ 49%] 2025-12-04T12:25:43.2383548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmatmul___cuda_float64 PASSED [0.0253s] [ 49%] 2025-12-04T12:25:43.2383673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_float32 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2383797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int16 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2383920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int32 PASSED [0.0098s] [ 49%] 2025-12-04T12:25:43.2384047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int64 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2384171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_int8 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2384308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmod___cuda_uint8 PASSED [0.0098s] [ 49%] 2025-12-04T12:25:43.2384431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rmul___cuda_bool PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2384554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___ror___cuda_int64 PASSED [0.0098s] [ 49%] 2025-12-04T12:25:43.2384680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_float16 PASSED [0.0133s] [ 49%] 2025-12-04T12:25:43.2384804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_int32 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2384929Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rpow___cuda_uint8 PASSED [0.0100s] [ 49%] 2025-12-04T12:25:43.2385056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_float16 PASSED [0.0133s] [ 49%] 2025-12-04T12:25:43.2385182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_float64 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2385305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rsub___cuda_int8 PASSED [0.0099s] [ 49%] 2025-12-04T12:25:43.2385430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace___rxor___cuda_uint8 PASSED [0.0097s] [ 49%] 2025-12-04T12:25:43.2385581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_bfloat16 PASSED [0.0224s] [ 49%] 2025-12-04T12:25:43.2385731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float16 PASSED [0.0219s] [ 49%] 2025-12-04T12:25:43.2385887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float32 PASSED [0.0207s] [ 49%] 2025-12-04T12:25:43.2386050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__batch_norm_with_update_cuda_float64 PASSED [1.1299s] [ 49%] 2025-12-04T12:25:43.2386188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_complex128 PASSED [0.0176s] [ 49%] 2025-12-04T12:25:43.2386319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_float16 PASSED [1.1152s] [ 49%] 2025-12-04T12:25:43.2386457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__chunk_cat_cuda_uint8 PASSED [0.0176s] [ 49%] 2025-12-04T12:25:43.2386594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_bfloat16 PASSED [1.1136s] [ 49%] 2025-12-04T12:25:43.2386727Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_float16 PASSED [0.0186s] [ 49%] 2025-12-04T12:25:43.2386865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_abs_cuda_float64 PASSED [0.0092s] [ 49%] 2025-12-04T12:25:43.2386999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_bool PASSED [0.0129s] [ 49%] 2025-12-04T12:25:43.2387135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_float64 PASSED [0.0088s] [ 50%] 2025-12-04T12:25:43.2387269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int16 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2387402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int32 PASSED [0.0128s] [ 50%] 2025-12-04T12:25:43.2387533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_int8 PASSED [0.0128s] [ 50%] 2025-12-04T12:25:43.2387665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_acos_cuda_uint8 PASSED [0.0126s] [ 50%] 2025-12-04T12:25:43.2387805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_complex128 PASSED [0.0665s] [ 50%] 2025-12-04T12:25:43.2387943Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_complex64 PASSED [0.0660s] [ 50%] 2025-12-04T12:25:43.2388111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_float16 PASSED [0.0639s] [ 50%] 2025-12-04T12:25:43.2388256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_add_cuda_int8 PASSED [0.0408s] [ 50%] 2025-12-04T12:25:43.2388401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_complex128 PASSED [0.1352s] [ 50%] 2025-12-04T12:25:43.2388544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_complex64 PASSED [0.1345s] [ 50%] 2025-12-04T12:25:43.2388686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_float16 PASSED [0.1019s] [ 50%] 2025-12-04T12:25:43.2388825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int16 XFAIL [0.0052s] [ 50%] 2025-12-04T12:25:43.2388962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcdiv_cuda_int32 XFAIL [1.1127s] [ 50%] 2025-12-04T12:25:43.2389097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_bool XFAIL [1.1151s] [ 50%] 2025-12-04T12:25:43.2389243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_complex64 PASSED [1.2592s] [ 50%] 2025-12-04T12:25:43.2389380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_addcmul_cuda_int8 PASSED [0.0709s] [ 50%] 2025-12-04T12:25:43.2389519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_complex128 PASSED [0.0090s] [ 50%] 2025-12-04T12:25:43.2389657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_complex64 PASSED [0.0088s] [ 50%] 2025-12-04T12:25:43.2389819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float16 PASSED [0.0158s] [ 50%] 2025-12-04T12:25:43.2389972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_float64 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2390104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_int64 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2390237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_asin_cuda_uint8 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2390390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_bfloat16 PASSED [0.0157s] [ 50%] 2025-12-04T12:25:43.2390525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float16 PASSED [0.0157s] [ 50%] 2025-12-04T12:25:43.2390658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_float32 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2390793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int32 PASSED [0.0126s] [ 50%] 2025-12-04T12:25:43.2390925Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_atan_cuda_int8 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2391057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_bool XFAIL [0.0037s] [ 50%] 2025-12-04T12:25:43.2391195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_complex128 XFAIL [1.1278s] [ 50%] 2025-12-04T12:25:43.2391330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int16 PASSED [1.1055s] [ 50%] 2025-12-04T12:25:43.2391462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_int32 PASSED [0.0096s] [ 50%] 2025-12-04T12:25:43.2391591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_ceil_cuda_uint8 PASSED [0.0091s] [ 50%] 2025-12-04T12:25:43.2391737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_bfloat16 PASSED [0.1728s] [ 50%] 2025-12-04T12:25:43.2391881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_complex128 XFAIL [0.0047s] [ 50%] 2025-12-04T12:25:43.2392027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_float16 PASSED [1.2821s] [ 50%] 2025-12-04T12:25:43.2392178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int16 PASSED [0.0823s] [ 50%] 2025-12-04T12:25:43.2392317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int64 PASSED [0.0820s] [ 50%] 2025-12-04T12:25:43.2392453Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_max_cuda_int8 PASSED [0.0817s] [ 50%] 2025-12-04T12:25:43.2392600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_complex64 XFAIL [0.0046s] [ 50%] 2025-12-04T12:25:43.2392742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_float16 PASSED [1.2783s] [ 50%] 2025-12-04T12:25:43.2392883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_clamp_min_cuda_int64 PASSED [0.0824s] [ 50%] 2025-12-04T12:25:43.2393015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_bool PASSED [0.0095s] [ 50%] 2025-12-04T12:25:43.2393155Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_complex64 PASSED [0.0095s] [ 50%] 2025-12-04T12:25:43.2393369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_float32 PASSED [0.0093s] [ 50%] 2025-12-04T12:25:43.2393502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int32 PASSED [0.0093s] [ 50%] 2025-12-04T12:25:43.2393633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int64 PASSED [0.0093s] [ 50%] 2025-12-04T12:25:43.2393779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_int8 PASSED [0.0092s] [ 50%] 2025-12-04T12:25:43.2393924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_copy_cuda_uint8 PASSED [0.0092s] [ 50%] 2025-12-04T12:25:43.2394061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_complex64 PASSED [0.0088s] [ 50%] 2025-12-04T12:25:43.2394196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_float64 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2394338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_int16 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2394469Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cos_cuda_int32 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2394606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_complex64 PASSED [0.0088s] [ 50%] 2025-12-04T12:25:43.2394741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_float16 PASSED [0.0156s] [ 50%] 2025-12-04T12:25:43.2394873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_int64 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2395006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_cosh_cuda_uint8 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2395142Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_bfloat16 PASSED [0.1156s] [ 50%] 2025-12-04T12:25:43.2395280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_complex128 PASSED [0.0758s] [ 50%] 2025-12-04T12:25:43.2395409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_div_cuda_int8 PASSED [0.0714s] [ 50%] 2025-12-04T12:25:43.2395540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_bool PASSED [0.0126s] [ 50%] 2025-12-04T12:25:43.2395676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_complex128 XFAIL [0.0037s] [ 50%] 2025-12-04T12:25:43.2395809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int32 PASSED [1.1269s] [ 50%] 2025-12-04T12:25:43.2395940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int64 PASSED [0.0135s] [ 50%] 2025-12-04T12:25:43.2396085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_int8 PASSED [0.0131s] [ 50%] 2025-12-04T12:25:43.2396217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erf_cuda_uint8 PASSED [0.0130s] [ 50%] 2025-12-04T12:25:43.2396353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_bfloat16 PASSED [0.0159s] [ 50%] 2025-12-04T12:25:43.2396492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_complex128 XFAIL [0.0038s] [ 50%] 2025-12-04T12:25:43.2396628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_complex64 XFAIL [1.1226s] [ 50%] 2025-12-04T12:25:43.2396764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_float32 PASSED [1.1256s] [ 50%] 2025-12-04T12:25:43.2396897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int16 PASSED [0.0139s] [ 50%] 2025-12-04T12:25:43.2397032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int32 PASSED [0.0131s] [ 50%] 2025-12-04T12:25:43.2397162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_erfc_cuda_int64 PASSED [0.0129s] [ 50%] 2025-12-04T12:25:43.2397293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_bool PASSED [0.0128s] [ 50%] 2025-12-04T12:25:43.2397424Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int16 PASSED [0.0128s] [ 50%] 2025-12-04T12:25:43.2397560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int32 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2397701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_exp_cuda_int8 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2397850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_complex64 PASSED [0.0089s] [ 50%] 2025-12-04T12:25:43.2397989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_expm1_cuda_float64 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2398158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_bool XFAIL [0.0036s] [ 50%] 2025-12-04T12:25:43.2398319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_complex128 XFAIL [1.1219s] [ 50%] 2025-12-04T12:25:43.2398455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_float32 PASSED [1.1298s] [ 50%] 2025-12-04T12:25:43.2398591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_floor_cuda_uint8 PASSED [0.0094s] [ 50%] 2025-12-04T12:25:43.2398720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_bool XFAIL [0.0038s] [ 50%] 2025-12-04T12:25:43.2398857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_complex64 XFAIL [1.0953s] [ 50%] 2025-12-04T12:25:43.2398993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_float64 PASSED [1.1158s] [ 50%] 2025-12-04T12:25:43.2399122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_frac_cuda_int8 XFAIL [0.0042s] [ 50%] 2025-12-04T12:25:43.2399261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_complex64 PASSED [1.1897s] [ 50%] 2025-12-04T12:25:43.2399392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int32 XFAIL [0.0058s] [ 50%] 2025-12-04T12:25:43.2399522Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lerp_cuda_int64 XFAIL [1.0985s] [ 50%] 2025-12-04T12:25:43.2399663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_float16 PASSED [1.1096s] [ 50%] 2025-12-04T12:25:43.2399801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int32 PASSED [0.0133s] [ 50%] 2025-12-04T12:25:43.2399938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_lgamma_cuda_int64 PASSED [0.0130s] [ 50%] 2025-12-04T12:25:43.2400094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_complex128 PASSED [0.0090s] [ 50%] 2025-12-04T12:25:43.2400231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_float16 PASSED [0.0157s] [ 50%] 2025-12-04T12:25:43.2400365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log10_cuda_int16 PASSED [0.0128s] [ 50%] 2025-12-04T12:25:43.2400498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_bool PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2400642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_complex128 PASSED [0.0088s] [ 50%] 2025-12-04T12:25:43.2400778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float16 PASSED [0.0155s] [ 50%] 2025-12-04T12:25:43.2400915Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float32 PASSED [0.0086s] [ 50%] 2025-12-04T12:25:43.2401051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_float64 PASSED [0.0086s] [ 50%] 2025-12-04T12:25:43.2401188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log1p_cuda_int32 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2401324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_float32 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2401463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_float64 PASSED [0.0085s] [ 50%] 2025-12-04T12:25:43.2401615Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_int16 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2401762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log2_cuda_uint8 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2401892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_log_cuda_int16 PASSED [0.0127s] [ 50%] 2025-12-04T12:25:43.2402029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_bfloat16 PASSED [0.0053s] [ 50%] 2025-12-04T12:25:43.2402169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_bool PASSED [0.0052s] [ 50%] 2025-12-04T12:25:43.2402308Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_complex128 XFAIL [0.0036s] [ 50%] 2025-12-04T12:25:43.2402442Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_float64 PASSED [1.1101s] [ 50%] 2025-12-04T12:25:43.2402576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int64 PASSED [1.1036s] [ 50%] 2025-12-04T12:25:43.2402706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_int8 PASSED [0.0074s] [ 50%] 2025-12-04T12:25:43.2402838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_max_cuda_uint8 PASSED [0.0058s] [ 50%] 2025-12-04T12:25:43.2402983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_complex128 XFAIL [0.0048s] [ 50%] 2025-12-04T12:25:43.2403123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_float64 PASSED [1.2237s] [ 50%] 2025-12-04T12:25:43.2403259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int32 PASSED [0.0819s] [ 50%] 2025-12-04T12:25:43.2403394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int64 PASSED [0.0818s] [ 50%] 2025-12-04T12:25:43.2403533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_int8 PASSED [0.0815s] [ 50%] 2025-12-04T12:25:43.2403669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_maximum_cuda_uint8 PASSED [0.0814s] [ 50%] 2025-12-04T12:25:43.2403813Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_complex128 XFAIL [0.0046s] [ 50%] 2025-12-04T12:25:43.2403963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_float16 PASSED [1.2855s] [ 50%] 2025-12-04T12:25:43.2404100Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_int64 PASSED [0.0821s] [ 50%] 2025-12-04T12:25:43.2404236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_minimum_cuda_uint8 PASSED [0.0818s] [ 50%] 2025-12-04T12:25:43.2404374Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_bfloat16 PASSED [0.1156s] [ 50%] 2025-12-04T12:25:43.2404505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_bool PASSED [0.0537s] [ 50%] 2025-12-04T12:25:43.2404643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_complex64 PASSED [0.0752s] [ 50%] 2025-12-04T12:25:43.2404774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_mul_cuda_uint8 PASSED [0.0452s] [ 50%] 2025-12-04T12:25:43.2404916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_complex128 PASSED [0.0089s] [ 50%] 2025-12-04T12:25:43.2405051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float32 PASSED [0.0086s] [ 50%] 2025-12-04T12:25:43.2405185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_float64 PASSED [0.0087s] [ 50%] 2025-12-04T12:25:43.2405315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int16 PASSED [0.0085s] [ 50%] 2025-12-04T12:25:43.2405446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int32 PASSED [0.0086s] [ 50%] 2025-12-04T12:25:43.2405591Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_neg_cuda_int8 PASSED [0.0085s] [ 50%] 2025-12-04T12:25:43.2405732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_bool XFAIL [0.0036s] [ 50%] 2025-12-04T12:25:43.2405872Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_complex128 PASSED [1.3241s] [ 50%] 2025-12-04T12:25:43.2406016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_float32 PASSED [0.1138s] [ 50%] 2025-12-04T12:25:43.2406151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_float64 PASSED [0.0931s] [ 51%] 2025-12-04T12:25:43.2406280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_norm_cuda_int32 XFAIL [0.0037s] [ 51%] 2025-12-04T12:25:43.2406410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_bool XFAIL [1.1121s] [ 51%] 2025-12-04T12:25:43.2406547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_complex64 PASSED [0.0778s] [ 51%] 2025-12-04T12:25:43.2406683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_pow_cuda_int64 PASSED [0.0434s] [ 51%] 2025-12-04T12:25:43.2406831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_bfloat16 PASSED [0.0157s] [ 51%] 2025-12-04T12:25:43.2406983Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_complex128 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2407132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_complex64 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2407272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_reciprocal_cuda_int32 PASSED [0.0126s] [ 51%] 2025-12-04T12:25:43.2407410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_bfloat16 PASSED [0.0086s] [ 51%] 2025-12-04T12:25:43.2407541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_bool XFAIL [0.0036s] [ 51%] 2025-12-04T12:25:43.2407682Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_complex128 XFAIL [1.1195s] [ 51%] 2025-12-04T12:25:43.2407827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_float16 PASSED [1.1207s] [ 51%] 2025-12-04T12:25:43.2407963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_float64 PASSED [1.1058s] [ 51%] 2025-12-04T12:25:43.2408123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_int16 PASSED [0.0111s] [ 51%] 2025-12-04T12:25:43.2408258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_round_cuda_int8 PASSED [0.0091s] [ 51%] 2025-12-04T12:25:43.2408390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_bool PASSED [0.0133s] [ 51%] 2025-12-04T12:25:43.2408533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_complex128 PASSED [0.0089s] [ 51%] 2025-12-04T12:25:43.2408671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_complex64 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2408814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_float64 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2408948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_rsqrt_cuda_int8 PASSED [0.0127s] [ 51%] 2025-12-04T12:25:43.2409084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_bool PASSED [0.0080s] [ 51%] 2025-12-04T12:25:43.2409220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_int8 PASSED [0.0079s] [ 51%] 2025-12-04T12:25:43.2409356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sigmoid_cuda_uint8 PASSED [0.0080s] [ 51%] 2025-12-04T12:25:43.2409508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_complex128 XFAIL [0.0036s] [ 51%] 2025-12-04T12:25:43.2409655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_float16 PASSED [1.1371s] [ 51%] 2025-12-04T12:25:43.2409791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sign_cuda_int64 PASSED [0.0093s] [ 51%] 2025-12-04T12:25:43.2409939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_complex64 PASSED [0.0090s] [ 51%] 2025-12-04T12:25:43.2410073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_float16 PASSED [0.0158s] [ 51%] 2025-12-04T12:25:43.2410203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_int32 PASSED [0.0128s] [ 51%] 2025-12-04T12:25:43.2410335Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sin_cuda_uint8 PASSED [0.0129s] [ 51%] 2025-12-04T12:25:43.2410549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_bool PASSED [0.0128s] [ 51%] 2025-12-04T12:25:43.2410691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_complex128 PASSED [0.0089s] [ 51%] 2025-12-04T12:25:43.2410830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_complex64 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2410962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_int16 PASSED [0.0127s] [ 51%] 2025-12-04T12:25:43.2411095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sinh_cuda_uint8 PASSED [0.0127s] [ 51%] 2025-12-04T12:25:43.2411232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_complex64 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2411368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_float16 PASSED [0.0155s] [ 51%] 2025-12-04T12:25:43.2411504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sqrt_cuda_float64 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2411639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_bfloat16 XFAIL [0.0151s] [ 51%] 2025-12-04T12:25:43.2411785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_int32 XFAIL [1.1438s] [ 51%] 2025-12-04T12:25:43.2411916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_sub_cuda_int64 XFAIL [1.1233s] [ 51%] 2025-12-04T12:25:43.2412047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_bool PASSED [1.1186s] [ 51%] 2025-12-04T12:25:43.2412185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_complex64 PASSED [0.0097s] [ 51%] 2025-12-04T12:25:43.2412319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_float32 PASSED [0.0089s] [ 51%] 2025-12-04T12:25:43.2412454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_float64 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2412584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tan_cuda_uint8 PASSED [0.0129s] [ 51%] 2025-12-04T12:25:43.2412717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_bool PASSED [0.0128s] [ 51%] 2025-12-04T12:25:43.2412850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float16 PASSED [0.0158s] [ 51%] 2025-12-04T12:25:43.2412986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_float64 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2413118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int64 PASSED [0.0128s] [ 51%] 2025-12-04T12:25:43.2413251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_tanh_cuda_int8 PASSED [0.0128s] [ 51%] 2025-12-04T12:25:43.2413397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_bool XFAIL [0.0036s] [ 51%] 2025-12-04T12:25:43.2413547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_complex64 XFAIL [1.1066s] [ 51%] 2025-12-04T12:25:43.2413683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_float16 PASSED [1.1314s] [ 51%] 2025-12-04T12:25:43.2413817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_trunc_cuda_int8 PASSED [0.0095s] [ 51%] 2025-12-04T12:25:43.2413958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int32 PASSED [0.0061s] [ 51%] 2025-12-04T12:25:43.2414091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__foreach_zero_cuda_int64 PASSED [0.0061s] [ 51%] 2025-12-04T12:25:43.2414242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_bfloat16 PASSED [0.0210s] [ 51%] 2025-12-04T12:25:43.2414392Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float16 PASSED [0.0203s] [ 51%] 2025-12-04T12:25:43.2414543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__native_batch_norm_legit_cuda_float64 PASSED [0.0162s] [ 51%] 2025-12-04T12:25:43.2414693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_bfloat16 PASSED [0.0744s] [ 51%] 2025-12-04T12:25:43.2414840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_lengths_cuda_float32 PASSED [0.0739s] [ 51%] 2025-12-04T12:25:43.2414988Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__segment_reduce_offsets_cuda_bfloat16 PASSED [0.0649s] [ 51%] 2025-12-04T12:25:43.2415134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_bfloat16 PASSED [0.0149s] [ 51%] 2025-12-04T12:25:43.2415283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_complex128 PASSED [0.0145s] [ 51%] 2025-12-04T12:25:43.2415433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_complex64 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2415580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_float32 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2415744Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_float64 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2415886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int16 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2416026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_cuda_int32 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2416185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool PASSED [0.0149s] [ 51%] 2025-12-04T12:25:43.2416352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_complex128 PASSED [1.1222s] [ 51%] 2025-12-04T12:25:43.2416512Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8 PASSED [0.0185s] [ 51%] 2025-12-04T12:25:43.2416661Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float16 PASSED [0.0067s] [ 51%] 2025-12-04T12:25:43.2416814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0060s] [ 51%] 2025-12-04T12:25:43.2416941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_complex32 PASSED [0.0050s] [ 51%] 2025-12-04T12:25:43.2417065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_float32 PASSED [1.1000s] [ 51%] 2025-12-04T12:25:43.2417186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_int32 PASSED [0.0050s] [ 51%] 2025-12-04T12:25:43.2417314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_abs_cuda_int8 PASSED [1.0942s] [ 51%] 2025-12-04T12:25:43.2417449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_bfloat16 PASSED [0.0068s] [ 51%] 2025-12-04T12:25:43.2417574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_float64 PASSED [0.0044s] [ 51%] 2025-12-04T12:25:43.2417695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_int64 PASSED [1.1247s] [ 51%] 2025-12-04T12:25:43.2417830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acos_cuda_uint8 PASSED [0.0067s] [ 51%] 2025-12-04T12:25:43.2417958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex128 PASSED [0.0046s] [ 51%] 2025-12-04T12:25:43.2418139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_complex32 PASSED [1.1314s] [ 51%] 2025-12-04T12:25:43.2418262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_float16 PASSED [0.0065s] [ 51%] 2025-12-04T12:25:43.2418388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_acosh_cuda_uint8 PASSED [0.0045s] [ 51%] 2025-12-04T12:25:43.2418508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_bool PASSED [0.0117s] [ 51%] 2025-12-04T12:25:43.2418632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_complex32 PASSED [0.0162s] [ 51%] 2025-12-04T12:25:43.2418753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_add_cuda_float64 PASSED [0.0114s] [ 51%] 2025-12-04T12:25:43.2418881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_bfloat16 PASSED [1.1237s] [ 51%] 2025-12-04T12:25:43.2419012Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_complex64 PASSED [0.0165s] [ 51%] 2025-12-04T12:25:43.2419136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addbmm_cuda_float16 PASSED [0.0086s] [ 51%] 2025-12-04T12:25:43.2419265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_bfloat16 PASSED [0.0148s] [ 51%] 2025-12-04T12:25:43.2419396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcdiv_cuda_complex64 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2419527Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_complex128 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2419671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_complex64 PASSED [0.0144s] [ 51%] 2025-12-04T12:25:43.2419801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float32 PASSED [0.0141s] [ 51%] 2025-12-04T12:25:43.2419927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_float64 PASSED [0.0141s] [ 51%] 2025-12-04T12:25:43.2420053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int32 PASSED [0.0146s] [ 51%] 2025-12-04T12:25:43.2420175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addcmul_cuda_int8 PASSED [0.0141s] [ 51%] 2025-12-04T12:25:43.2420300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float16 PASSED [0.0135s] [ 51%] 2025-12-04T12:25:43.2420422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_cuda_float64 PASSED [0.0101s] [ 51%] 2025-12-04T12:25:43.2420565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addmm_decomposed_cuda_float32 PASSED [0.0101s] [ 51%] 2025-12-04T12:25:43.2420685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_bool PASSED [0.0074s] [ 51%] 2025-12-04T12:25:43.2420812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_complex64 PASSED [0.0082s] [ 51%] 2025-12-04T12:25:43.2420936Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float16 PASSED [1.1382s] [ 51%] 2025-12-04T12:25:43.2421058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_float32 PASSED [0.0135s] [ 51%] 2025-12-04T12:25:43.2421194Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int64 PASSED [0.0085s] [ 51%] 2025-12-04T12:25:43.2421326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_addr_cuda_int8 PASSED [0.0082s] [ 51%] 2025-12-04T12:25:43.2421461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_complex32 PASSED [1.1156s] [ 51%] 2025-12-04T12:25:43.2421594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_float64 PASSED [0.0055s] [ 51%] 2025-12-04T12:25:43.2421737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int16 PASSED [0.0039s] [ 51%] 2025-12-04T12:25:43.2421864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_int8 PASSED [1.1115s] [ 51%] 2025-12-04T12:25:43.2421993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_alias_copy_cuda_uint8 PASSED [0.0055s] [ 51%] 2025-12-04T12:25:43.2422111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_cuda_bool PASSED [0.0250s] [ 51%] 2025-12-04T12:25:43.2422260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___radd___cuda_float32 PASSED [0.0495s] [ 51%] 2025-12-04T12:25:43.2422404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rand___cuda_int64 PASSED [0.0480s] [ 51%] 2025-12-04T12:25:43.2422549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rdiv___cuda_float32 PASSED [0.0730s] [ 51%] 2025-12-04T12:25:43.2422696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rmatmul___cuda_float32 PASSED [0.2043s] [ 51%] 2025-12-04T12:25:43.2422836Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___ror___cuda_int64 PASSED [0.0482s] [ 51%] 2025-12-04T12:25:43.2422976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides___rpow___cuda_float32 PASSED [0.0490s] [ 51%] 2025-12-04T12:25:43.2423128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_acos_cuda_float32 PASSED [0.0107s] [ 51%] 2025-12-04T12:25:43.2423279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_asin_cuda_float32 PASSED [0.0097s] [ 51%] 2025-12-04T12:25:43.2423436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_clamp_max_cuda_float32 PASSED [0.1197s] [ 51%] 2025-12-04T12:25:43.2423598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_cosh_cuda_float32 PASSED [0.0089s] [ 51%] 2025-12-04T12:25:43.2423747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_erf_cuda_float32 PASSED [0.0088s] [ 51%] 2025-12-04T12:25:43.2423898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_expm1_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2424048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_floor_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2424199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_lerp_cuda_float32 PASSED [0.0641s] [ 51%] 2025-12-04T12:25:43.2424355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_lgamma_cuda_float32 PASSED [0.0086s] [ 51%] 2025-12-04T12:25:43.2424508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_log2_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2424668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_reciprocal_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2424818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_sinh_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2424966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_tanh_cuda_float32 PASSED [0.0086s] [ 51%] 2025-12-04T12:25:43.2425127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_trunc_cuda_float32 PASSED [0.0087s] [ 51%] 2025-12-04T12:25:43.2425289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__foreach_zero_cuda_float32 PASSED [0.0059s] [ 52%] 2025-12-04T12:25:43.2425451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__native_batch_norm_legit_cuda_float32 PASSED [0.6242s] [ 52%] 2025-12-04T12:25:43.2425626Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__softmax_backward_data_cuda_float32 PASSED [0.0300s] [ 52%] 2025-12-04T12:25:43.2425787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides__unsafe_masked_index_cuda_float32 PASSED [0.1028s] [ 52%] 2025-12-04T12:25:43.2425926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_abs_cuda_float32 PASSED [0.0040s] [ 52%] 2025-12-04T12:25:43.2426065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_cuda_float32 PASSED [0.1624s] [ 52%] 2025-12-04T12:25:43.2426222Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmm_decomposed_cuda_float32 PASSED [0.1598s] [ 52%] 2025-12-04T12:25:43.2426361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_addmv_cuda_float32 PASSED [0.0857s] [ 52%] 2025-12-04T12:25:43.2426511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_alias_copy_cuda_float32 PASSED [0.0039s] [ 52%] 2025-12-04T12:25:43.2426650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_all_cuda_float32 PASSED [0.0638s] [ 52%] 2025-12-04T12:25:43.2426793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_allclose_cuda_float32 PASSED [0.1723s] [ 52%] 2025-12-04T12:25:43.2426933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_angle_cuda_float32 PASSED [1.1332s] [ 52%] 2025-12-04T12:25:43.2427073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_any_cuda_float32 PASSED [0.0588s] [ 52%] 2025-12-04T12:25:43.2427215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_argmax_cuda_float32 PASSED [0.0176s] [ 52%] 2025-12-04T12:25:43.2427352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_asin_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T12:25:43.2427514Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_1d_cuda_float32 PASSED [1.1123s] [ 52%] 2025-12-04T12:25:43.2427659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_atleast_3d_cuda_float32 PASSED [0.0078s] [ 52%] 2025-12-04T12:25:43.2427801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_baddbmm_cuda_float32 PASSED [0.0912s] [ 52%] 2025-12-04T12:25:43.2427942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bfloat16_cuda_float32 PASSED [1.1259s] [ 52%] 2025-12-04T12:25:43.2428129Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_and_cuda_int64 PASSED [0.0499s] [ 52%] 2025-12-04T12:25:43.2428285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_left_shift_cuda_int64 PASSED [0.0474s] [ 52%] 2025-12-04T12:25:43.2428432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_bitwise_xor_cuda_int64 PASSED [0.0470s] [ 52%] 2025-12-04T12:25:43.2428578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_block_diag_cuda_float32 PASSED [0.0923s] [ 52%] 2025-12-04T12:25:43.2428729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_broadcast_to_cuda_float32 PASSED [0.0069s] [ 52%] 2025-12-04T12:25:43.2428868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_byte_cuda_float32 PASSED [1.1201s] [ 52%] 2025-12-04T12:25:43.2429022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cartesian_prod_cuda_float32 PASSED [0.0452s] [ 52%] 2025-12-04T12:25:43.2429178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cauchy_cuda_float32 PASSED [0.0096s] [ 52%] 2025-12-04T12:25:43.2429337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_max_cuda_float32 PASSED [0.0770s] [ 52%] 2025-12-04T12:25:43.2429484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clamp_min_cuda_float32 PASSED [0.0761s] [ 52%] 2025-12-04T12:25:43.2429649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_clone_cuda_float32 PASSED [0.0051s] [ 52%] 2025-12-04T12:25:43.2429788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_conj_cuda_float32 PASSED [1.0934s] [ 52%] 2025-12-04T12:25:43.2429924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cos_cuda_float32 PASSED [0.0083s] [ 52%] 2025-12-04T12:25:43.2430085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_cumulative_trapezoid_cuda_float32 PASSED [0.2334s] [ 52%] 2025-12-04T12:25:43.2430226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_deg2rad_cuda_float32 PASSED [0.0045s] [ 52%] 2025-12-04T12:25:43.2430370Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_diagflat_cuda_float32 PASSED [0.0207s] [ 52%] 2025-12-04T12:25:43.2430508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_dot_cuda_float32 PASSED [0.0056s] [ 52%] 2025-12-04T12:25:43.2430649Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_double_cuda_float32 PASSED [1.1126s] [ 52%] 2025-12-04T12:25:43.2430784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_eq_cuda_float32 PASSED [0.0569s] [ 52%] 2025-12-04T12:25:43.2430923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_equal_cuda_float32 PASSED [0.0119s] [ 52%] 2025-12-04T12:25:43.2431063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_erfinv_cuda_float32 PASSED [0.1871s] [ 52%] 2025-12-04T12:25:43.2431212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_expand_as_cuda_float32 PASSED [0.0063s] [ 52%] 2025-12-04T12:25:43.2431353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fft2_cuda_float32 PASSED [0.3180s] [ 52%] 2025-12-04T12:25:43.2431511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fftn_cuda_float32 PASSED [0.0302s] [ 52%] 2025-12-04T12:25:43.2431661Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_fftshift_cuda_float32 PASSED [0.0174s] [ 52%] 2025-12-04T12:25:43.2431804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_hfft2_cuda_float32 PASSED [0.0572s] [ 52%] 2025-12-04T12:25:43.2431948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_hfftn_cuda_float32 PASSED [0.0471s] [ 52%] 2025-12-04T12:25:43.2432090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifft_cuda_float32 PASSED [0.1112s] [ 52%] 2025-12-04T12:25:43.2432241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ifftshift_cuda_float32 PASSED [0.0176s] [ 52%] 2025-12-04T12:25:43.2432386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fft_ihfft2_cuda_float32 PASSED [0.1082s] [ 52%] 2025-12-04T12:25:43.2432525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fill_cuda_float32 PASSED [0.0060s] [ 52%] 2025-12-04T12:25:43.2432674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_floor_divide_cuda_float32 PASSED [0.2233s] [ 52%] 2025-12-04T12:25:43.2432814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fmax_cuda_float32 PASSED [0.0469s] [ 52%] 2025-12-04T12:25:43.2432953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_fmin_cuda_float32 PASSED [0.0468s] [ 52%] 2025-12-04T12:25:43.2433111Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_geometric_cuda_float32 PASSED [0.0085s] [ 52%] 2025-12-04T12:25:43.2433270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hash_tensor_cuda_float32 PASSED [0.0254s] [ 52%] 2025-12-04T12:25:43.2433415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_hsplit_cuda_float32 PASSED [1.0752s] [ 52%] 2025-12-04T12:25:43.2433580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_reduce_prod_cuda_float32 PASSED [0.0667s] [ 52%] 2025-12-04T12:25:43.2433728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_index_select_cuda_float32 PASSED [1.0763s] [ 52%] 2025-12-04T12:25:43.2433870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_inner_cuda_float32 PASSED [0.0183s] [ 52%] 2025-12-04T12:25:43.2434009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isclose_cuda_float32 PASSED [0.7887s] [ 52%] 2025-12-04T12:25:43.2434152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isfinite_cuda_float32 PASSED [0.0154s] [ 52%] 2025-12-04T12:25:43.2434293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isnan_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T12:25:43.2434438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isneginf_cuda_float32 PASSED [0.0043s] [ 52%] 2025-12-04T12:25:43.2434581Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isposinf_cuda_float32 PASSED [0.0043s] [ 52%] 2025-12-04T12:25:43.2434722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_isreal_cuda_float32 PASSED [0.0056s] [ 52%] 2025-12-04T12:25:43.2434889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_2inputs_2outputs_cuda_float32 PASSED [0.0162s] [ 52%] 2025-12-04T12:25:43.2435068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.0482s] [ 52%] 2025-12-04T12:25:43.2435205Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_kron_cuda_float32 XFAIL [0.0055s] [ 52%] 2025-12-04T12:25:43.2435356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lcm_cuda_int64 PASSED [1.1391s] [ 52%] 2025-12-04T12:25:43.2435497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ldexp_cuda_float32 PASSED [0.0715s] [ 52%] 2025-12-04T12:25:43.2435645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_eig_cuda_float32 PASSED [0.2330s] [ 52%] 2025-12-04T12:25:43.2435805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0075s] [ 52%] 2025-12-04T12:25:43.2435952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lstsq_cuda_float32 PASSED [1.4862s] [ 52%] 2025-12-04T12:25:43.2436107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_lu_factor_cuda_float32 PASSED [0.0831s] [ 52%] 2025-12-04T12:25:43.2436277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.0603s] [ 52%] 2025-12-04T12:25:43.2436428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_norm_cuda_float32 PASSED [0.4019s] [ 52%] 2025-12-04T12:25:43.2436604Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [1.8436s] [ 52%] 2025-12-04T12:25:43.2436845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_pinv_singular_cuda_float32 SKIPPED [0.0012s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 52%] 2025-12-04T12:25:43.2436991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_qr_cuda_float32 PASSED [0.0509s] [ 52%] 2025-12-04T12:25:43.2437154Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_solve_cuda_float32 PASSED [0.4355s] [ 52%] 2025-12-04T12:25:43.2437320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linalg_vector_norm_cuda_float32 PASSED [0.4090s] [ 52%] 2025-12-04T12:25:43.2437464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_linspace_cuda_float32 PASSED [0.0223s] [ 52%] 2025-12-04T12:25:43.2437621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_softmax_cuda_float32 PASSED [0.0239s] [ 52%] 2025-12-04T12:25:43.2437785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_log_softmax_with_dtype_cuda_float32 PASSED [0.0267s] [ 52%] 2025-12-04T12:25:43.2437931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logaddexp_cuda_float32 PASSED [0.1516s] [ 52%] 2025-12-04T12:25:43.2438120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logcumsumexp_cuda_float32 PASSED [0.0094s] [ 52%] 2025-12-04T12:25:43.2438262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logit_cuda_float32 PASSED [0.0133s] [ 52%] 2025-12-04T12:25:43.2438406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_logspace_cuda_float32 PASSED [0.1311s] [ 52%] 2025-12-04T12:25:43.2438547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_long_cuda_float32 PASSED [1.6135s] [ 52%] 2025-12-04T12:25:43.2438683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lt_cuda_float32 PASSED [0.0498s] [ 52%] 2025-12-04T12:25:43.2438824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lu_cuda_float32 PASSED [0.0554s] [ 52%] 2025-12-04T12:25:43.2438966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_lu_solve_cuda_float32 PASSED [0.4205s] [ 52%] 2025-12-04T12:25:43.2439103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mT_cuda_float32 PASSED [0.0065s] [ 52%] 2025-12-04T12:25:43.2439252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_amin_cuda_float32 PASSED [0.5164s] [ 52%] 2025-12-04T12:25:43.2439418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_argmax_cuda_float32 PASSED [0.2889s] [ 52%] 2025-12-04T12:25:43.2439569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_argmin_cuda_float32 PASSED [0.2888s] [ 52%] 2025-12-04T12:25:43.2439719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_cumsum_cuda_float32 PASSED [0.1265s] [ 52%] 2025-12-04T12:25:43.2439874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_logsumexp_cuda_float32 PASSED [0.8634s] [ 52%] 2025-12-04T12:25:43.2440024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_median_cuda_float32 PASSED [0.0731s] [ 52%] 2025-12-04T12:25:43.2440179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_normalize_cuda_float32 PASSED [0.2028s] [ 52%] 2025-12-04T12:25:43.2440327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_prod_cuda_float32 PASSED [0.6228s] [ 52%] 2025-12-04T12:25:43.2440476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_std_cuda_float32 PASSED [1.3564s] [ 52%] 2025-12-04T12:25:43.2440622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_masked_sum_cuda_float32 PASSED [0.5330s] [ 52%] 2025-12-04T12:25:43.2440764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_matmul_cuda_float32 PASSED [0.2597s] [ 52%] 2025-12-04T12:25:43.2440908Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_binary_cuda_float32 PASSED [0.0476s] [ 52%] 2025-12-04T12:25:43.2441085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_max_reduction_with_dim_cuda_float32 PASSED [0.0049s] [ 52%] 2025-12-04T12:25:43.2441237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mean_cuda_float32 PASSED [0.0384s] [ 52%] 2025-12-04T12:25:43.2441382Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_minimum_cuda_float32 PASSED [0.0470s] [ 52%] 2025-12-04T12:25:43.2441518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mm_cuda_float32 PASSED [0.0182s] [ 52%] 2025-12-04T12:25:43.2441671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_mv_cuda_float32 PASSED [0.0078s] [ 52%] 2025-12-04T12:25:43.2441817Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nanmedian_cuda_float32 PASSED [0.0239s] [ 52%] 2025-12-04T12:25:43.2441956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nansum_cuda_float32 PASSED [0.0464s] [ 52%] 2025-12-04T12:25:43.2442103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_narrow_copy_cuda_float32 XFAIL [0.0033s] [ 52%] 2025-12-04T12:25:43.2442263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_batch_norm_cuda_float32 PASSED [2.2610s] [ 52%] 2025-12-04T12:25:43.2442423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_native_layer_norm_cuda_float32 PASSED [0.3408s] [ 52%] 2025-12-04T12:25:43.2442561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_neg_cuda_float32 PASSED [0.0047s] [ 52%] 2025-12-04T12:25:43.2442709Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nextafter_cuda_float32 PASSED [0.0471s] [ 52%] 2025-12-04T12:25:43.2442886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0419s] [ 52%] 2025-12-04T12:25:43.2443067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.0731s] [ 52%] 2025-12-04T12:25:43.2443246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0291s] [ 52%] 2025-12-04T12:25:43.2443422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_avg_pool2d_cuda_float32 PASSED [0.0156s] [ 52%] 2025-12-04T12:25:43.2443588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_batch_norm_cuda_float32 PASSED [1.1038s] [ 52%] 2025-12-04T12:25:43.2443751Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_bilinear_cuda_float32 PASSED [3.2502s] [ 52%] 2025-12-04T12:25:43.2443926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_binary_cross_entropy_cuda_float32 XFAIL [0.1078s] [ 52%] 2025-12-04T12:25:43.2444106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_cosine_embedding_loss_cuda_float32 PASSED [0.6616s] [ 52%] 2025-12-04T12:25:43.2444268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_ctc_loss_cuda_float32 PASSED [0.9057s] [ 52%] 2025-12-04T12:25:43.2444432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_dropout2d_cuda_float32 PASSED [0.0344s] [ 52%] 2025-12-04T12:25:43.2444590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_elu_cuda_float32 PASSED [0.0094s] [ 52%] 2025-12-04T12:25:43.2444770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.1488s] [ 52%] 2025-12-04T12:25:43.2444926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_gelu_cuda_float32 PASSED [1.1389s] [ 52%] 2025-12-04T12:25:43.2445089Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_glu_cuda_float32 PASSED [0.1250s] [ 52%] 2025-12-04T12:25:43.2445268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_group_norm_cuda_float32 PASSED [0.1763s] [ 52%] 2025-12-04T12:25:43.2445431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardshrink_cuda_float32 PASSED [0.0149s] [ 52%] 2025-12-04T12:25:43.2445596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hardswish_cuda_float32 PASSED [0.0249s] [ 52%] 2025-12-04T12:25:43.2445783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.3414s] [ 52%] 2025-12-04T12:25:43.2445958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_instance_norm_cuda_float32 PASSED [0.4307s] [ 52%] 2025-12-04T12:25:43.2446139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.7450s] [ 52%] 2025-12-04T12:25:43.2446298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_kl_div_cuda_float32 PASSED [0.2647s] [ 53%] 2025-12-04T12:25:43.2446463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_leaky_relu_cuda_float32 PASSED [0.0275s] [ 53%] 2025-12-04T12:25:43.2446636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.2704s] [ 53%] 2025-12-04T12:25:43.2446793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_mish_cuda_float32 PASSED [0.0097s] [ 53%] 2025-12-04T12:25:43.2446981Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [3.3631s] [ 53%] 2025-12-04T12:25:43.2447162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.1774s] [ 53%] 2025-12-04T12:25:43.2447319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_one_hot_cuda_int64 PASSED [0.0150s] [ 53%] 2025-12-04T12:25:43.2447484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_circular_cuda_float32 PASSED [0.0696s] [ 53%] 2025-12-04T12:25:43.2447662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_constant_cuda_float32 PASSED [0.0920s] [ 53%] 2025-12-04T12:25:43.2447828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_pad_reflect_cuda_float32 PASSED [0.0229s] [ 53%] 2025-12-04T12:25:43.2447985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_prelu_cuda_float32 PASSED [0.0499s] [ 53%] 2025-12-04T12:25:43.2448213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_relu_cuda_float32 PASSED [0.0096s] [ 53%] 2025-12-04T12:25:43.2448378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_rms_norm_cuda_float32 PASSED [0.0311s] [ 53%] 2025-12-04T12:25:43.2448565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [10.7922s] [ 53%] 2025-12-04T12:25:43.2448721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_selu_cuda_float32 PASSED [0.0099s] [ 53%] 2025-12-04T12:25:43.2448890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0889s] [ 53%] 2025-12-04T12:25:43.2449051Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softplus_cuda_float32 PASSED [0.0091s] [ 53%] 2025-12-04T12:25:43.2449226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_nn_functional_softsign_cuda_float32 PASSED [0.0120s] [ 53%] 2025-12-04T12:25:43.2449383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_norm_cuda_float32 PASSED [0.1046s] [ 53%] 2025-12-04T12:25:43.2449542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 53%] 2025-12-04T12:25:43.2449699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_normal_number_mean_cuda_float32 PASSED [0.0078s] [ 53%] 2025-12-04T12:25:43.2449859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ones_like_cuda_float32 PASSED [0.0114s] [ 53%] 2025-12-04T12:25:43.2450004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ormqr_cuda_float32 PASSED [2.4120s] [ 53%] 2025-12-04T12:25:43.2450146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_outer_cuda_float32 PASSED [0.0078s] [ 53%] 2025-12-04T12:25:43.2450291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_pinverse_cuda_float32 PASSED [0.0801s] [ 53%] 2025-12-04T12:25:43.2450431Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polar_cuda_float32 PASSED [0.0991s] [ 53%] 2025-12-04T12:25:43.2450596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_0_cuda_float32 PASSED [0.4206s] [ 53%] 2025-12-04T12:25:43.2450763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0131s] [ 53%] 2025-12-04T12:25:43.2450926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0129s] [ 53%] 2025-12-04T12:25:43.2451091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0127s] [ 53%] 2025-12-04T12:25:43.2451236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_quantile_cuda_float32 PASSED [1.7589s] [ 53%] 2025-12-04T12:25:43.2451387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randint_like_cuda_float32 PASSED [0.0272s] [ 53%] 2025-12-04T12:25:43.2451534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_randn_like_cuda_float32 PASSED [0.0204s] [ 53%] 2025-12-04T12:25:43.2451687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_ravel_cuda_float32 PASSED [0.0051s] [ 53%] 2025-12-04T12:25:43.2451827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_renorm_cuda_float32 PASSED [0.0245s] [ 53%] 2025-12-04T12:25:43.2451966Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_repeat_cuda_float32 PASSED [0.0594s] [ 53%] 2025-12-04T12:25:43.2452109Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_reshape_cuda_float32 PASSED [0.0082s] [ 53%] 2025-12-04T12:25:43.2452261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_resolve_neg_cuda_float32 PASSED [1.0955s] [ 53%] 2025-12-04T12:25:43.2452416Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_0_cuda_float32 PASSED [0.0084s] [ 53%] 2025-12-04T12:25:43.2452570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_3_cuda_float32 PASSED [1.1041s] [ 53%] 2025-12-04T12:25:43.2452728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_round_decimals_neg_3_cuda_float32 PASSED [0.0087s] [ 53%] 2025-12-04T12:25:43.2452877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_add_cuda_float32 PASSED [0.1128s] [ 53%] 2025-12-04T12:25:43.2453021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_cuda_float32 PASSED [0.3901s] [ 53%] 2025-12-04T12:25:43.2453179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_reduce_prod_cuda_float32 PASSED [0.3649s] [ 53%] 2025-12-04T12:25:43.2453347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_scatter_reduce_sum_cuda_float32 PASSED [0.3664s] [ 53%] 2025-12-04T12:25:43.2453497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sgn_cuda_float32 PASSED [0.0046s] [ 53%] 2025-12-04T12:25:43.2453640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_short_cuda_float32 PASSED [1.1003s] [ 53%] 2025-12-04T12:25:43.2453814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_bartlett_cuda_float32 PASSED [0.0164s] [ 53%] 2025-12-04T12:25:43.2453977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_cosine_cuda_float32 PASSED [0.0108s] [ 53%] 2025-12-04T12:25:43.2454146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_exponential_cuda_float32 PASSED [1.1058s] [ 53%] 2025-12-04T12:25:43.2454311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_gaussian_cuda_float32 PASSED [0.0224s] [ 53%] 2025-12-04T12:25:43.2454482Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_general_cosine_cuda_float32 PASSED [0.0333s] [ 53%] 2025-12-04T12:25:43.2454640Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_signal_windows_hann_cuda_float32 PASSED [0.0321s] [ 53%] 2025-12-04T12:25:43.2454783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sinh_cuda_float32 PASSED [0.0042s] [ 53%] 2025-12-04T12:25:43.2454924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_softmax_cuda_float32 PASSED [0.0116s] [ 53%] 2025-12-04T12:25:43.2455082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_softmax_with_dtype_cuda_float32 PASSED [0.0133s] [ 53%] 2025-12-04T12:25:43.2455257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 53%] 2025-12-04T12:25:43.2455414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_j0_cuda_float32 PASSED [1.2803s] [ 53%] 2025-12-04T12:25:43.2455568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_bessel_y0_cuda_float32 PASSED [0.2582s] [ 53%] 2025-12-04T12:25:43.2455753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0466s] [ 53%] 2025-12-04T12:25:43.2455902Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_entr_cuda_float32 PASSED [1.2418s] [ 53%] 2025-12-04T12:25:43.2456052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_i1e_cuda_float32 PASSED [0.1669s] [ 53%] 2025-12-04T12:25:43.2456226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_laguerre_polynomial_l_cuda_float32 PASSED [0.0455s] [ 53%] 2025-12-04T12:25:43.2456394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_modified_bessel_k0_cuda_float32 PASSED [0.1859s] [ 53%] 2025-12-04T12:25:43.2456572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_scaled_modified_bessel_k1_cuda_float32 PASSED [0.1796s] [ 53%] 2025-12-04T12:25:43.2456756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0453s] [ 53%] 2025-12-04T12:25:43.2456942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.5357s] [ 53%] 2025-12-04T12:25:43.2457124Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [0.0464s] [ 53%] 2025-12-04T12:25:43.2457306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_special_spherical_bessel_j0_cuda_float32 PASSED [0.3319s] [ 53%] 2025-12-04T12:25:43.2457457Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_cuda_float32 PASSED [0.0046s] [ 53%] 2025-12-04T12:25:43.2457619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_split_with_sizes_copy_cuda_float32 PASSED [0.0068s] [ 53%] 2025-12-04T12:25:43.2457769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_squeeze_copy_cuda_float32 PASSED [0.0176s] [ 53%] 2025-12-04T12:25:43.2457931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_std_unbiased_cuda_float32 PASSED [0.0059s] [ 53%] 2025-12-04T12:25:43.2458202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_stft_cuda_float32 PASSED [0.4391s] [ 53%] 2025-12-04T12:25:43.2458344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_sub_cuda_float32 PASSED [0.0615s] [ 53%] 2025-12-04T12:25:43.2458499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_svd_lowrank_cuda_float32 PASSED [3.0613s] [ 53%] 2025-12-04T12:25:43.2458638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tile_cuda_float32 PASSED [0.0854s] [ 53%] 2025-12-04T12:25:43.2458837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0006s] (Requires CUDA SM >= 8.9) [ 53%] 2025-12-04T12:25:43.2458984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trapezoid_cuda_float32 PASSED [0.1996s] [ 53%] 2025-12-04T12:25:43.2459141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_triangular_solve_cuda_float32 PASSED [0.0641s] [ 53%] 2025-12-04T12:25:43.2459281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tril_cuda_float32 PASSED [0.0366s] [ 53%] 2025-12-04T12:25:43.2459433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_tril_indices_cuda_int64 PASSED [0.0135s] [ 53%] 2025-12-04T12:25:43.2459571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_triu_cuda_float32 PASSED [0.0326s] [ 53%] 2025-12-04T12:25:43.2459719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_true_divide_cuda_float32 PASSED [0.0474s] [ 53%] 2025-12-04T12:25:43.2459873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_trunc_cuda_float32 PASSED [0.0040s] [ 53%] 2025-12-04T12:25:43.2460021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unbind_copy_cuda_float32 PASSED [0.0102s] [ 53%] 2025-12-04T12:25:43.2460161Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unbind_cuda_float32 PASSED [0.0085s] [ 53%] 2025-12-04T12:25:43.2460320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_unique_consecutive_cuda_float32 PASSED [0.2434s] [ 53%] 2025-12-04T12:25:43.2460476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_mean_unbiased_cuda_float32 PASSED [0.0066s] [ 53%] 2025-12-04T12:25:43.2460627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_var_unbiased_cuda_float32 PASSED [0.0051s] [ 53%] 2025-12-04T12:25:43.2460778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_view_as_complex_cuda_float32 XFAIL [0.0029s] [ 53%] 2025-12-04T12:25:43.2460918Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_where_cuda_float32 PASSED [0.0707s] [ 53%] 2025-12-04T12:25:43.2461059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_all_strides_zeros_cuda_float32 PASSED [1.6270s] [ 53%] 2025-12-04T12:25:43.2461190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_allclose_cuda_float64 PASSED [0.0430s] [ 53%] 2025-12-04T12:25:43.2461314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int64 PASSED [1.6259s] [ 53%] 2025-12-04T12:25:43.2461456Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amax_cuda_int8 PASSED [0.0153s] [ 53%] 2025-12-04T12:25:43.2461598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_bfloat16 PASSED [0.0177s] [ 53%] 2025-12-04T12:25:43.2461717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_bool PASSED [0.0129s] [ 53%] 2025-12-04T12:25:43.2461843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_float32 PASSED [0.0128s] [ 53%] 2025-12-04T12:25:43.2461979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_float64 PASSED [0.0127s] [ 53%] 2025-12-04T12:25:43.2462104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_amin_cuda_int16 PASSED [0.0127s] [ 53%] 2025-12-04T12:25:43.2462226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_bool PASSED [0.0047s] [ 53%] 2025-12-04T12:25:43.2462353Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_float64 PASSED [0.0048s] [ 53%] 2025-12-04T12:25:43.2462477Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int16 PASSED [0.0048s] [ 53%] 2025-12-04T12:25:43.2462601Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_int32 PASSED [0.0048s] [ 53%] 2025-12-04T12:25:43.2462725Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_aminmax_cuda_uint8 PASSED [0.0047s] [ 53%] 2025-12-04T12:25:43.2462848Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_float64 PASSED [1.6240s] [ 53%] 2025-12-04T12:25:43.2462973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_angle_cuda_int32 PASSED [0.0049s] [ 53%] 2025-12-04T12:25:43.2463095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_bfloat16 PASSED [0.0226s] [ 53%] 2025-12-04T12:25:43.2463214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_bool PASSED [0.0191s] [ 53%] 2025-12-04T12:25:43.2463337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_float64 PASSED [0.0208s] [ 53%] 2025-12-04T12:25:43.2463455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int16 PASSED [0.0206s] [ 53%] 2025-12-04T12:25:43.2463572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int64 PASSED [1.6053s] [ 53%] 2025-12-04T12:25:43.2463701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_int8 PASSED [0.0239s] [ 53%] 2025-12-04T12:25:43.2463820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_any_cuda_uint8 PASSED [0.0221s] [ 53%] 2025-12-04T12:25:43.2463950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_bfloat16 PASSED [0.0154s] [ 53%] 2025-12-04T12:25:43.2464071Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int16 PASSED [0.0112s] [ 53%] 2025-12-04T12:25:43.2464195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int32 PASSED [0.0112s] [ 53%] 2025-12-04T12:25:43.2464318Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_arange_cuda_int64 PASSED [0.0112s] [ 53%] 2025-12-04T12:25:43.2464446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_bfloat16 PASSED [1.5937s] [ 53%] 2025-12-04T12:25:43.2464572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmax_cuda_float32 PASSED [0.0103s] [ 53%] 2025-12-04T12:25:43.2464699Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_bfloat16 PASSED [0.0084s] [ 53%] 2025-12-04T12:25:43.2464826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float16 PASSED [0.0082s] [ 53%] 2025-12-04T12:25:43.2464952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_float64 PASSED [1.4096s] [ 53%] 2025-12-04T12:25:43.2465077Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int16 PASSED [0.0097s] [ 53%] 2025-12-04T12:25:43.2465210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int64 PASSED [0.0082s] [ 53%] 2025-12-04T12:25:43.2465344Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argmin_cuda_int8 PASSED [0.0083s] [ 53%] 2025-12-04T12:25:43.2465467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_bool PASSED [1.0502s] [ 53%] 2025-12-04T12:25:43.2465593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int64 PASSED [0.0341s] [ 53%] 2025-12-04T12:25:43.2465724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_int8 PASSED [1.0583s] [ 53%] 2025-12-04T12:25:43.2465849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argsort_cuda_uint8 PASSED [0.0304s] [ 53%] 2025-12-04T12:25:43.2465978Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_bfloat16 PASSED [1.0231s] [ 53%] 2025-12-04T12:25:43.2466103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_bool PASSED [0.0057s] [ 53%] 2025-12-04T12:25:43.2466235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_complex64 PASSED [0.0044s] [ 53%] 2025-12-04T12:25:43.2466369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float32 PASSED [1.0327s] [ 53%] 2025-12-04T12:25:43.2466498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_float64 PASSED [0.0061s] [ 53%] 2025-12-04T12:25:43.2466625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_argwhere_cuda_int32 PASSED [0.0044s] [ 53%] 2025-12-04T12:25:43.2466764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_float16 PASSED [0.0055s] [ 54%] 2025-12-04T12:25:43.2466905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_float64 PASSED [0.0051s] [ 54%] 2025-12-04T12:25:43.2467042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int32 PASSED [0.0051s] [ 54%] 2025-12-04T12:25:43.2467181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int64 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2467316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_int8 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2467462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_copy_cuda_uint8 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2467598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex128 PASSED [1.0332s] [ 54%] 2025-12-04T12:25:43.2467733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_complex64 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2467863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_cuda_int64 PASSED [0.0039s] [ 54%] 2025-12-04T12:25:43.2468014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_bfloat16 PASSED [1.0551s] [ 54%] 2025-12-04T12:25:43.2468215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_complex128 PASSED [0.0049s] [ 54%] 2025-12-04T12:25:43.2468368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_complex32 PASSED [0.0035s] [ 54%] 2025-12-04T12:25:43.2468519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int64 PASSED [1.0308s] [ 54%] 2025-12-04T12:25:43.2468665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_int8 PASSED [0.0049s] [ 54%] 2025-12-04T12:25:43.2468812Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_partial_views_cuda_uint8 PASSED [0.0035s] [ 54%] 2025-12-04T12:25:43.2468949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_bool PASSED [0.0079s] [ 54%] 2025-12-04T12:25:43.2469101Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_int32 PASSED [0.0075s] [ 54%] 2025-12-04T12:25:43.2469250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_as_strided_scatter_cuda_int8 PASSED [0.0074s] [ 54%] 2025-12-04T12:25:43.2469375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asin_cuda_int64 PASSED [0.0032s] [ 54%] 2025-12-04T12:25:43.2469507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex128 PASSED [1.0318s] [ 54%] 2025-12-04T12:25:43.2469650Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex32 PASSED [0.0053s] [ 54%] 2025-12-04T12:25:43.2469777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_complex64 PASSED [0.0033s] [ 54%] 2025-12-04T12:25:43.2469900Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_float16 PASSED [1.0380s] [ 54%] 2025-12-04T12:25:43.2470023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_int16 PASSED [0.0053s] [ 54%] 2025-12-04T12:25:43.2470145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_asinh_cuda_uint8 PASSED [0.0037s] [ 54%] 2025-12-04T12:25:43.2470272Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_bfloat16 PASSED [0.0138s] [ 54%] 2025-12-04T12:25:43.2470396Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_float32 PASSED [0.0099s] [ 54%] 2025-12-04T12:25:43.2470517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int16 PASSED [0.0122s] [ 54%] 2025-12-04T12:25:43.2470639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan2_cuda_int32 PASSED [0.0121s] [ 54%] 2025-12-04T12:25:43.2470764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_bfloat16 PASSED [1.0579s] [ 54%] 2025-12-04T12:25:43.2470884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_float16 PASSED [0.0055s] [ 54%] 2025-12-04T12:25:43.2471008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atan_cuda_uint8 PASSED [0.0037s] [ 54%] 2025-12-04T12:25:43.2471135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_complex32 PASSED [1.2242s] [ 54%] 2025-12-04T12:25:43.2471259Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_float32 PASSED [0.0046s] [ 54%] 2025-12-04T12:25:43.2471395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_float64 PASSED [1.0294s] [ 54%] 2025-12-04T12:25:43.2471521Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atanh_cuda_int8 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2471648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_bool PASSED [0.0039s] [ 54%] 2025-12-04T12:25:43.2471786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_complex128 PASSED [1.0402s] [ 54%] 2025-12-04T12:25:43.2471914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int32 PASSED [0.0054s] [ 54%] 2025-12-04T12:25:43.2472041Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_1d_cuda_int8 PASSED [1.0440s] [ 54%] 2025-12-04T12:25:43.2472180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_complex128 PASSED [0.0059s] [ 54%] 2025-12-04T12:25:43.2472311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_float32 PASSED [0.0043s] [ 54%] 2025-12-04T12:25:43.2472438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int32 PASSED [1.0492s] [ 54%] 2025-12-04T12:25:43.2472565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_2d_cuda_int8 PASSED [0.0056s] [ 54%] 2025-12-04T12:25:43.2472700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_complex128 PASSED [0.0049s] [ 54%] 2025-12-04T12:25:43.2472830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_float16 PASSED [0.0046s] [ 54%] 2025-12-04T12:25:43.2472967Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_atleast_3d_cuda_int8 PASSED [1.0460s] [ 54%] 2025-12-04T12:25:43.2473106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_baddbmm_cuda_bfloat16 PASSED [0.0102s] [ 54%] 2025-12-04T12:25:43.2473232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_baddbmm_cuda_float32 PASSED [0.0077s] [ 54%] 2025-12-04T12:25:43.2473363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float16 PASSED [0.0056s] [ 54%] 2025-12-04T12:25:43.2473506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bernoulli_cuda_float64 PASSED [0.0053s] [ 54%] 2025-12-04T12:25:43.2473636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_bfloat16 PASSED [1.0351s] [ 54%] 2025-12-04T12:25:43.2473771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_complex32 PASSED [0.0042s] [ 54%] 2025-12-04T12:25:43.2473899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float16 PASSED [1.0319s] [ 54%] 2025-12-04T12:25:43.2474032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float32 PASSED [0.0042s] [ 54%] 2025-12-04T12:25:43.2474162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_float64 PASSED [1.0508s] [ 54%] 2025-12-04T12:25:43.2474289Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bfloat16_cuda_int8 PASSED [0.0041s] [ 54%] 2025-12-04T12:25:43.2474419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int64 PASSED [0.0090s] [ 54%] 2025-12-04T12:25:43.2474542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_int8 PASSED [0.0079s] [ 54%] 2025-12-04T12:25:43.2474669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bincount_cuda_uint8 PASSED [0.0073s] [ 54%] 2025-12-04T12:25:43.2474797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int16 PASSED [0.0102s] [ 54%] 2025-12-04T12:25:43.2474930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int32 PASSED [0.0097s] [ 54%] 2025-12-04T12:25:43.2475059Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_and_cuda_int8 PASSED [0.0097s] [ 54%] 2025-12-04T12:25:43.2475208Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_int16 PASSED [1.0571s] [ 54%] 2025-12-04T12:25:43.2475336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_not_cuda_uint8 PASSED [0.0061s] [ 54%] 2025-12-04T12:25:43.2475480Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int64 PASSED [0.0104s] [ 54%] 2025-12-04T12:25:43.2475620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_right_shift_cuda_int8 PASSED [0.0098s] [ 54%] 2025-12-04T12:25:43.2475750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int64 PASSED [0.0098s] [ 54%] 2025-12-04T12:25:43.2475881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_int8 PASSED [0.0097s] [ 54%] 2025-12-04T12:25:43.2476014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bitwise_xor_cuda_uint8 PASSED [0.0097s] [ 54%] 2025-12-04T12:25:43.2476151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex128 PASSED [0.0080s] [ 54%] 2025-12-04T12:25:43.2476285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_complex64 PASSED [0.0153s] [ 54%] 2025-12-04T12:25:43.2476417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_float32 PASSED [1.0292s] [ 54%] 2025-12-04T12:25:43.2476548Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int16 PASSED [0.0101s] [ 54%] 2025-12-04T12:25:43.2476675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int32 PASSED [1.0359s] [ 54%] 2025-12-04T12:25:43.2476814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int64 PASSED [0.0106s] [ 54%] 2025-12-04T12:25:43.2476951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_block_diag_cuda_int8 PASSED [1.0410s] [ 54%] 2025-12-04T12:25:43.2477072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bmm_cuda_float32 PASSED [0.0048s] [ 54%] 2025-12-04T12:25:43.2477195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_bool PASSED [1.1820s] [ 54%] 2025-12-04T12:25:43.2477341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_complex64 PASSED [0.0043s] [ 54%] 2025-12-04T12:25:43.2477466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bool_cuda_float16 PASSED [1.6044s] [ 54%] 2025-12-04T12:25:43.2477608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float16 PASSED [0.0052s] [ 54%] 2025-12-04T12:25:43.2477750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_float32 PASSED [0.0036s] [ 54%] 2025-12-04T12:25:43.2477888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int16 PASSED [1.5602s] [ 54%] 2025-12-04T12:25:43.2478026Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_int64 PASSED [0.0052s] [ 54%] 2025-12-04T12:25:43.2478201Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_tensors_cuda_uint8 PASSED [0.0036s] [ 54%] 2025-12-04T12:25:43.2478333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_bool PASSED [1.6100s] [ 54%] 2025-12-04T12:25:43.2478467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_float16 PASSED [1.6083s] [ 54%] 2025-12-04T12:25:43.2478602Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int32 PASSED [1.5667s] [ 54%] 2025-12-04T12:25:43.2478732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_broadcast_to_cuda_int64 PASSED [1.6051s] [ 54%] 2025-12-04T12:25:43.2478865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_float64 PASSED [0.0198s] [ 54%] 2025-12-04T12:25:43.2478992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_bucketize_cuda_int16 PASSED [0.0174s] [ 54%] 2025-12-04T12:25:43.2479132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_float64 PASSED [1.5554s] [ 54%] 2025-12-04T12:25:43.2479255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int16 PASSED [0.0044s] [ 54%] 2025-12-04T12:25:43.2479377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int32 PASSED [1.5408s] [ 54%] 2025-12-04T12:25:43.2479500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int64 PASSED [0.0042s] [ 54%] 2025-12-04T12:25:43.2479620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_int8 PASSED [1.5401s] [ 54%] 2025-12-04T12:25:43.2479743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_byte_cuda_uint8 PASSED [0.0042s] [ 54%] 2025-12-04T12:25:43.2479881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_bfloat16 PASSED [0.0107s] [ 54%] 2025-12-04T12:25:43.2480027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_complex128 PASSED [0.0090s] [ 54%] 2025-12-04T12:25:43.2480165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float32 PASSED [0.0088s] [ 54%] 2025-12-04T12:25:43.2480304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_float64 PASSED [0.0088s] [ 54%] 2025-12-04T12:25:43.2480440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cartesian_prod_cuda_uint8 PASSED [0.0087s] [ 54%] 2025-12-04T12:25:43.2480565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_complex64 PASSED [0.0121s] [ 54%] 2025-12-04T12:25:43.2480701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_float64 PASSED [0.0094s] [ 54%] 2025-12-04T12:25:43.2480835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int16 PASSED [1.5451s] [ 54%] 2025-12-04T12:25:43.2480953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int64 PASSED [0.0111s] [ 54%] 2025-12-04T12:25:43.2481073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cat_cuda_int8 PASSED [0.0091s] [ 54%] 2025-12-04T12:25:43.2481214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_bfloat16 PASSED [0.0057s] [ 54%] 2025-12-04T12:25:43.2481339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float16 PASSED [0.0053s] [ 54%] 2025-12-04T12:25:43.2481466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cauchy_cuda_float32 PASSED [0.0048s] [ 54%] 2025-12-04T12:25:43.2481589Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdist_cuda_float32 PASSED [0.2017s] [ 54%] 2025-12-04T12:25:43.2481720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_bfloat16 PASSED [0.0035s] [ 54%] 2025-12-04T12:25:43.2481844Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_bool PASSED [1.5369s] [ 54%] 2025-12-04T12:25:43.2481972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_float32 PASSED [0.0043s] [ 54%] 2025-12-04T12:25:43.2482094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cdouble_cuda_int32 PASSED [1.5336s] [ 54%] 2025-12-04T12:25:43.2483763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_float32 PASSED [0.0050s] [ 54%] 2025-12-04T12:25:43.2483888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_int64 PASSED [1.5257s] [ 54%] 2025-12-04T12:25:43.2484009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ceil_cuda_uint8 PASSED [0.0051s] [ 54%] 2025-12-04T12:25:43.2484139Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_complex64 PASSED [1.5561s] [ 54%] 2025-12-04T12:25:43.2484268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cfloat_cuda_uint8 PASSED [0.0043s] [ 54%] 2025-12-04T12:25:43.2484390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_bool PASSED [1.5328s] [ 54%] 2025-12-04T12:25:43.2484533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_float16 PASSED [0.0044s] [ 54%] 2025-12-04T12:25:43.2484657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int64 PASSED [1.5315s] [ 54%] 2025-12-04T12:25:43.2484778Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chalf_cuda_int8 PASSED [0.0042s] [ 54%] 2025-12-04T12:25:43.2484906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_complex64 PASSED [1.5184s] [ 54%] 2025-12-04T12:25:43.2485028Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int16 PASSED [0.0041s] [ 54%] 2025-12-04T12:25:43.2485147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_char_cuda_int8 PASSED [1.4983s] [ 54%] 2025-12-04T12:25:43.2485292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_inverse_cuda_complex64 PASSED [0.0240s] [ 54%] 2025-12-04T12:25:43.2485432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_inverse_cuda_float32 PASSED [0.0127s] [ 54%] 2025-12-04T12:25:43.2485572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cholesky_solve_cuda_complex64 PASSED [1.5836s] [ 54%] 2025-12-04T12:25:43.2485701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_complex128 PASSED [0.0052s] [ 54%] 2025-12-04T12:25:43.2485825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float16 PASSED [1.5365s] [ 54%] 2025-12-04T12:25:43.2485949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_float32 PASSED [0.0056s] [ 54%] 2025-12-04T12:25:43.2486082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int32 PASSED [1.5496s] [ 54%] 2025-12-04T12:25:43.2486215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_int8 PASSED [0.0058s] [ 54%] 2025-12-04T12:25:43.2486336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_chunk_cuda_uint8 PASSED [1.5451s] [ 54%] 2025-12-04T12:25:43.2486463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float16 PASSED [0.0200s] [ 54%] 2025-12-04T12:25:43.2486597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float32 PASSED [0.0138s] [ 55%] 2025-12-04T12:25:43.2486721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_float64 PASSED [0.0136s] [ 55%] 2025-12-04T12:25:43.2486842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_cuda_uint8 PASSED [0.0138s] [ 55%] 2025-12-04T12:25:43.2486971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_bool PASSED [0.0138s] [ 55%] 2025-12-04T12:25:43.2487101Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_float64 PASSED [0.0148s] [ 55%] 2025-12-04T12:25:43.2487229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_max_cuda_int32 PASSED [0.0141s] [ 55%] 2025-12-04T12:25:43.2487354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_bool PASSED [0.0139s] [ 55%] 2025-12-04T12:25:43.2487488Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clamp_min_cuda_float16 PASSED [0.0175s] [ 55%] 2025-12-04T12:25:43.2487610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_clone_cuda_int8 PASSED [1.5176s] [ 55%] 2025-12-04T12:25:43.2487748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_complex128 PASSED [0.0077s] [ 55%] 2025-12-04T12:25:43.2487883Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_float16 PASSED [1.5853s] [ 55%] 2025-12-04T12:25:43.2488016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_column_stack_cuda_float32 PASSED [0.0083s] [ 55%] 2025-12-04T12:25:43.2488193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_bool PASSED [0.0920s] [ 55%] 2025-12-04T12:25:43.2488349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_complex128 PASSED [0.0977s] [ 55%] 2025-12-04T12:25:43.2488490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_complex64 PASSED [0.0916s] [ 55%] 2025-12-04T12:25:43.2488624Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_float16 PASSED [0.0909s] [ 55%] 2025-12-04T12:25:43.2488759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_combinations_cuda_float32 PASSED [0.0907s] [ 55%] 2025-12-04T12:25:43.2488886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_complex_cuda_float32 PASSED [0.0118s] [ 55%] 2025-12-04T12:25:43.2489011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_float64 PASSED [1.5833s] [ 55%] 2025-12-04T12:25:43.2489131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_cuda_int8 PASSED [0.0038s] [ 55%] 2025-12-04T12:25:43.2489269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_bfloat16 PASSED [1.5720s] [ 55%] 2025-12-04T12:25:43.2489401Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_bool PASSED [0.0043s] [ 55%] 2025-12-04T12:25:43.2489539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_float16 PASSED [1.5703s] [ 55%] 2025-12-04T12:25:43.2489670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_conj_physical_cuda_int16 PASSED [0.0043s] [ 55%] 2025-12-04T12:25:43.2489805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_bool PASSED [0.0306s] [ 55%] 2025-12-04T12:25:43.2489962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_complex128 PASSED [0.0297s] [ 55%] 2025-12-04T12:25:43.2490112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_float32 PASSED [0.0292s] [ 55%] 2025-12-04T12:25:43.2490250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_constant_pad_nd_cuda_int32 PASSED [0.0289s] [ 55%] 2025-12-04T12:25:43.2490387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_complex128 PASSED [1.3499s] [ 55%] 2025-12-04T12:25:43.2490540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_float16 PASSED [0.0036s] [ 55%] 2025-12-04T12:25:43.2490670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_float64 PASSED [1.1329s] [ 55%] 2025-12-04T12:25:43.2490800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int16 PASSED [0.0034s] [ 55%] 2025-12-04T12:25:43.2490930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int32 PASSED [1.0718s] [ 55%] 2025-12-04T12:25:43.2491058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_contiguous_cuda_int64 PASSED [0.0034s] [ 55%] 2025-12-04T12:25:43.2491186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_float64 PASSED [0.0179s] [ 55%] 2025-12-04T12:25:43.2491317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_copysign_cuda_int64 PASSED [0.0166s] [ 55%] 2025-12-04T12:25:43.2491450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_complex128 PASSED [0.0433s] [ 55%] 2025-12-04T12:25:43.2491582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_float32 PASSED [0.0331s] [ 55%] 2025-12-04T12:25:43.2491705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_int16 PASSED [0.0353s] [ 55%] 2025-12-04T12:25:43.2491834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_corrcoef_cuda_int8 PASSED [0.0351s] [ 55%] 2025-12-04T12:25:43.2491961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_float32 PASSED [1.0742s] [ 55%] 2025-12-04T12:25:43.2492086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_float64 PASSED [0.0070s] [ 55%] 2025-12-04T12:25:43.2492220Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int16 PASSED [0.0048s] [ 55%] 2025-12-04T12:25:43.2492338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_int32 PASSED [1.0454s] [ 55%] 2025-12-04T12:25:43.2492458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cos_cuda_uint8 PASSED [0.0059s] [ 55%] 2025-12-04T12:25:43.2492580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float16 PASSED [0.0045s] [ 55%] 2025-12-04T12:25:43.2492702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_float64 PASSED [1.0356s] [ 55%] 2025-12-04T12:25:43.2492821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_int8 PASSED [0.0059s] [ 55%] 2025-12-04T12:25:43.2492950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cosh_cuda_uint8 PASSED [0.0043s] [ 55%] 2025-12-04T12:25:43.2493086Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_bfloat16 PASSED [0.0231s] [ 55%] 2025-12-04T12:25:43.2493229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_complex128 PASSED [1.0621s] [ 55%] 2025-12-04T12:25:43.2493368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_complex64 PASSED [0.0238s] [ 55%] 2025-12-04T12:25:43.2493503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int16 PASSED [0.0203s] [ 55%] 2025-12-04T12:25:43.2493633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_count_nonzero_cuda_int8 PASSED [0.0201s] [ 55%] 2025-12-04T12:25:43.2493768Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_bfloat16 PASSED [0.4061s] [ 55%] 2025-12-04T12:25:43.2493906Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_complex64 PASSED [0.3693s] [ 55%] 2025-12-04T12:25:43.2494025Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_int32 PASSED [0.3794s] [ 55%] 2025-12-04T12:25:43.2494144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cov_cuda_int64 PASSED [0.3893s] [ 55%] 2025-12-04T12:25:43.2494283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_complex64 PASSED [1.0560s] [ 55%] 2025-12-04T12:25:43.2494404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_int64 PASSED [0.0065s] [ 55%] 2025-12-04T12:25:43.2494526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cross_cuda_int8 PASSED [0.0046s] [ 55%] 2025-12-04T12:25:43.2494652Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float32 PASSED [1.0532s] [ 55%] 2025-12-04T12:25:43.2494777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_float64 PASSED [0.0049s] [ 55%] 2025-12-04T12:25:43.2494901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int32 PASSED [1.0363s] [ 55%] 2025-12-04T12:25:43.2495023Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_int8 PASSED [0.0048s] [ 55%] 2025-12-04T12:25:43.2495149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummax_cuda_uint8 PASSED [1.0299s] [ 55%] 2025-12-04T12:25:43.2495276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_float32 PASSED [0.0051s] [ 55%] 2025-12-04T12:25:43.2495403Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_float64 PASSED [1.0428s] [ 55%] 2025-12-04T12:25:43.2495524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int64 PASSED [0.0052s] [ 55%] 2025-12-04T12:25:43.2495646Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cummin_cuda_int8 PASSED [1.0500s] [ 55%] 2025-12-04T12:25:43.2495774Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_float64 PASSED [0.0171s] [ 55%] 2025-12-04T12:25:43.2495898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumprod_cuda_int16 PASSED [0.0165s] [ 55%] 2025-12-04T12:25:43.2496037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_complex128 PASSED [0.0069s] [ 55%] 2025-12-04T12:25:43.2496167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_complex64 PASSED [0.0068s] [ 55%] 2025-12-04T12:25:43.2496291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_float32 PASSED [0.0069s] [ 55%] 2025-12-04T12:25:43.2496413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_int16 PASSED [0.0074s] [ 55%] 2025-12-04T12:25:43.2496535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumsum_cuda_uint8 PASSED [0.0075s] [ 55%] 2025-12-04T12:25:43.2496687Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_bfloat16 PASSED [0.0513s] [ 55%] 2025-12-04T12:25:43.2496843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_complex64 PASSED [0.0363s] [ 55%] 2025-12-04T12:25:43.2496991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_float32 PASSED [0.0361s] [ 55%] 2025-12-04T12:25:43.2497136Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_int8 PASSED [0.0396s] [ 55%] 2025-12-04T12:25:43.2497279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_cumulative_trapezoid_cuda_uint8 PASSED [1.0760s] [ 55%] 2025-12-04T12:25:43.2497410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_float32 PASSED [0.0049s] [ 55%] 2025-12-04T12:25:43.2497533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_int32 PASSED [0.0039s] [ 55%] 2025-12-04T12:25:43.2497670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_deg2rad_cuda_uint8 PASSED [1.0621s] [ 55%] 2025-12-04T12:25:43.2497803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_bfloat16 PASSED [0.0142s] [ 55%] 2025-12-04T12:25:43.2497928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_bool PASSED [0.0119s] [ 55%] 2025-12-04T12:25:43.2498050Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float16 PASSED [0.0115s] [ 55%] 2025-12-04T12:25:43.2498255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_cuda_float32 PASSED [1.0627s] [ 55%] 2025-12-04T12:25:43.2498393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_complex128 PASSED [0.0287s] [ 55%] 2025-12-04T12:25:43.2498527Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_float32 PASSED [0.0259s] [ 55%] 2025-12-04T12:25:43.2498655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diag_embed_cuda_int32 PASSED [0.0256s] [ 55%] 2025-12-04T12:25:43.2498793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_complex128 PASSED [1.0623s] [ 55%] 2025-12-04T12:25:43.2498921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float16 PASSED [0.0134s] [ 55%] 2025-12-04T12:25:43.2499052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float32 PASSED [0.0108s] [ 55%] 2025-12-04T12:25:43.2499180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagflat_cuda_float64 PASSED [0.0106s] [ 55%] 2025-12-04T12:25:43.2499316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_bfloat16 PASSED [1.0455s] [ 55%] 2025-12-04T12:25:43.2499458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_complex128 PASSED [0.0128s] [ 55%] 2025-12-04T12:25:43.2499598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_complex32 PASSED [0.0109s] [ 55%] 2025-12-04T12:25:43.2499735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_float16 PASSED [0.0104s] [ 55%] 2025-12-04T12:25:43.2499866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_copy_cuda_int16 PASSED [0.0103s] [ 55%] 2025-12-04T12:25:43.2500005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_bool PASSED [0.0064s] [ 55%] 2025-12-04T12:25:43.2500134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_float32 PASSED [0.0064s] [ 55%] 2025-12-04T12:25:43.2500263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int32 PASSED [0.0064s] [ 55%] 2025-12-04T12:25:43.2500388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int64 PASSED [0.0063s] [ 55%] 2025-12-04T12:25:43.2500513Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_cuda_int8 PASSED [0.0064s] [ 55%] 2025-12-04T12:25:43.2500657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_complex64 PASSED [0.0132s] [ 55%] 2025-12-04T12:25:43.2500798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float32 PASSED [0.0128s] [ 55%] 2025-12-04T12:25:43.2500937Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_float64 PASSED [1.0401s] [ 55%] 2025-12-04T12:25:43.2501074Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diagonal_scatter_cuda_int64 PASSED [0.0157s] [ 55%] 2025-12-04T12:25:43.2501201Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_complex128 PASSED [0.2286s] [ 55%] 2025-12-04T12:25:43.2501327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_complex64 PASSED [0.2268s] [ 55%] 2025-12-04T12:25:43.2501451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_float16 PASSED [0.3139s] [ 55%] 2025-12-04T12:25:43.2501584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int16 PASSED [0.2290s] [ 55%] 2025-12-04T12:25:43.2501720Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_int32 PASSED [0.2269s] [ 55%] 2025-12-04T12:25:43.2501840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_diff_cuda_uint8 PASSED [0.2265s] [ 55%] 2025-12-04T12:25:43.2501964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_bool PASSED [0.0043s] [ 55%] 2025-12-04T12:25:43.2502098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_int8 PASSED [0.0041s] [ 55%] 2025-12-04T12:25:43.2502224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_digamma_cuda_uint8 PASSED [1.0424s] [ 55%] 2025-12-04T12:25:43.2502346Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_float32 PASSED [0.0654s] [ 55%] 2025-12-04T12:25:43.2502470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dist_cuda_float64 PASSED [0.0625s] [ 55%] 2025-12-04T12:25:43.2502612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_floor_rounding_cuda_float32 PASSED [0.0372s] [ 55%] 2025-12-04T12:25:43.2502754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_bool PASSED [0.0128s] [ 55%] 2025-12-04T12:25:43.2502893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int32 PASSED [0.0127s] [ 55%] 2025-12-04T12:25:43.2503033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_no_rounding_mode_cuda_int8 PASSED [0.0126s] [ 55%] 2025-12-04T12:25:43.2503174Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_bfloat16 PASSED [0.0143s] [ 55%] 2025-12-04T12:25:43.2503316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float16 PASSED [0.0143s] [ 55%] 2025-12-04T12:25:43.2503455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_float64 PASSED [0.0111s] [ 55%] 2025-12-04T12:25:43.2503596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int16 PASSED [0.0103s] [ 55%] 2025-12-04T12:25:43.2503735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int32 PASSED [0.0103s] [ 55%] 2025-12-04T12:25:43.2503888Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_int8 PASSED [0.0133s] [ 55%] 2025-12-04T12:25:43.2504027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_div_trunc_rounding_cuda_uint8 PASSED [0.0108s] [ 55%] 2025-12-04T12:25:43.2504152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dot_cuda_complex64 PASSED [1.0742s] [ 55%] 2025-12-04T12:25:43.2504283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex32 PASSED [0.0042s] [ 55%] 2025-12-04T12:25:43.2504412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_complex64 PASSED [1.0435s] [ 55%] 2025-12-04T12:25:43.2504540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_float64 PASSED [0.0042s] [ 55%] 2025-12-04T12:25:43.2504662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_double_cuda_int32 PASSED [1.0642s] [ 55%] 2025-12-04T12:25:43.2504794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex128 PASSED [0.0099s] [ 56%] 2025-12-04T12:25:43.2504921Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex32 PASSED [0.0075s] [ 56%] 2025-12-04T12:25:43.2505048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_complex64 PASSED [1.0550s] [ 56%] 2025-12-04T12:25:43.2505171Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_float64 PASSED [0.0095s] [ 56%] 2025-12-04T12:25:43.2505293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int32 PASSED [0.0075s] [ 56%] 2025-12-04T12:25:43.2505426Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int64 PASSED [1.0590s] [ 56%] 2025-12-04T12:25:43.2505564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_dstack_cuda_int8 PASSED [0.0091s] [ 56%] 2025-12-04T12:25:43.2505690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_float16 PASSED [0.0290s] [ 56%] 2025-12-04T12:25:43.2505816Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_einsum_cuda_float64 PASSED [0.0247s] [ 56%] 2025-12-04T12:25:43.2505950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_bfloat16 PASSED [0.0038s] [ 56%] 2025-12-04T12:25:43.2506072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_bool PASSED [0.0037s] [ 56%] 2025-12-04T12:25:43.2506196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_cuda_float64 PASSED [0.0038s] [ 56%] 2025-12-04T12:25:43.2506334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_complex64 PASSED [0.0065s] [ 56%] 2025-12-04T12:25:43.2506467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_float64 PASSED [0.0063s] [ 56%] 2025-12-04T12:25:43.2506594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int16 PASSED [0.0063s] [ 56%] 2025-12-04T12:25:43.2506723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_like_cuda_int8 PASSED [0.0064s] [ 56%] 2025-12-04T12:25:43.2506861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_bfloat16 PASSED [0.0151s] [ 56%] 2025-12-04T12:25:43.2506995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_bool PASSED [0.0150s] [ 56%] 2025-12-04T12:25:43.2507127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int32 PASSED [0.0151s] [ 56%] 2025-12-04T12:25:43.2507261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int64 PASSED [0.0150s] [ 56%] 2025-12-04T12:25:43.2507394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_permuted_cuda_int8 PASSED [0.0151s] [ 56%] 2025-12-04T12:25:43.2507526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_bool XFAIL [0.0045s] [ 56%] 2025-12-04T12:25:43.2507668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_empty_strided_cuda_int16 XFAIL [1.0546s] [ 56%] 2025-12-04T12:25:43.2507791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_complex64 PASSED [1.0472s] [ 56%] 2025-12-04T12:25:43.2507911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float16 PASSED [0.0141s] [ 56%] 2025-12-04T12:25:43.2508033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_float64 PASSED [0.0108s] [ 56%] 2025-12-04T12:25:43.2508227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eq_cuda_int64 PASSED [1.0451s] [ 56%] 2025-12-04T12:25:43.2508354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_bfloat16 PASSED [0.0066s] [ 56%] 2025-12-04T12:25:43.2508485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_complex128 PASSED [0.0051s] [ 56%] 2025-12-04T12:25:43.2508609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_float64 PASSED [0.0050s] [ 56%] 2025-12-04T12:25:43.2508731Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_int64 PASSED [0.0049s] [ 56%] 2025-12-04T12:25:43.2508853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_equal_cuda_uint8 PASSED [1.0615s] [ 56%] 2025-12-04T12:25:43.2508978Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erf_cuda_float16 PASSED [0.0055s] [ 56%] 2025-12-04T12:25:43.2509098Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_bool PASSED [0.0050s] [ 56%] 2025-12-04T12:25:43.2509233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfc_cuda_uint8 PASSED [0.0043s] [ 56%] 2025-12-04T12:25:43.2509376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bfloat16 PASSED [1.0378s] [ 56%] 2025-12-04T12:25:43.2509499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_bool PASSED [0.0051s] [ 56%] 2025-12-04T12:25:43.2509620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_erfinv_cuda_uint8 PASSED [0.0036s] [ 56%] 2025-12-04T12:25:43.2509761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_complex128 PASSED [1.0513s] [ 56%] 2025-12-04T12:25:43.2509884Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_float32 PASSED [0.0072s] [ 56%] 2025-12-04T12:25:43.2510006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_int64 PASSED [0.0045s] [ 56%] 2025-12-04T12:25:43.2510125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp2_cuda_uint8 PASSED [1.0344s] [ 56%] 2025-12-04T12:25:43.2510246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int32 PASSED [0.0065s] [ 56%] 2025-12-04T12:25:43.2510364Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_int8 PASSED [0.0045s] [ 56%] 2025-12-04T12:25:43.2510483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exp_cuda_uint8 PASSED [1.0444s] [ 56%] 2025-12-04T12:25:43.2510616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_bfloat16 PASSED [0.0049s] [ 56%] 2025-12-04T12:25:43.2510745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_bool PASSED [1.0531s] [ 56%] 2025-12-04T12:25:43.2510875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_as_cuda_float64 PASSED [0.0054s] [ 56%] 2025-12-04T12:25:43.2511008Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_bfloat16 PASSED [0.0087s] [ 56%] 2025-12-04T12:25:43.2511146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_complex64 PASSED [0.0080s] [ 56%] 2025-12-04T12:25:43.2511281Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_copy_cuda_float16 PASSED [0.0084s] [ 56%] 2025-12-04T12:25:43.2511411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_bfloat16 PASSED [0.0058s] [ 56%] 2025-12-04T12:25:43.2511549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_bool PASSED [0.0054s] [ 56%] 2025-12-04T12:25:43.2511677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float16 PASSED [1.0395s] [ 56%] 2025-12-04T12:25:43.2511801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_float64 PASSED [0.0073s] [ 56%] 2025-12-04T12:25:43.2511922Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expand_cuda_int8 PASSED [0.0054s] [ 56%] 2025-12-04T12:25:43.2512042Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_bool PASSED [0.0035s] [ 56%] 2025-12-04T12:25:43.2512168Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_float32 PASSED [1.0357s] [ 56%] 2025-12-04T12:25:43.2512292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_expm1_cuda_int16 PASSED [0.0052s] [ 56%] 2025-12-04T12:25:43.2512429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_exponential_cuda_float64 PASSED [0.0059s] [ 56%] 2025-12-04T12:25:43.2512550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float16 PASSED [0.0628s] [ 56%] 2025-12-04T12:25:43.2512680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e4m3fn PASSED [0.0620s] [ 56%] 2025-12-04T12:25:43.2512815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_float8_e4m3fnuz PASSED [0.0619s] [ 56%] 2025-12-04T12:25:43.2512933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int16 PASSED [0.0709s] [ 56%] 2025-12-04T12:25:43.2513061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int64 PASSED [1.0910s] [ 56%] 2025-12-04T12:25:43.2513188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_int8 PASSED [0.0653s] [ 56%] 2025-12-04T12:25:43.2513306Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_eye_cuda_uint8 PASSED [0.0618s] [ 56%] 2025-12-04T12:25:43.2513440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_complex128 PASSED [0.3208s] [ 56%] 2025-12-04T12:25:43.2513585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float16 PASSED [0.0182s] [ 56%] 2025-12-04T12:25:43.2513711Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float32 PASSED [0.0104s] [ 56%] 2025-12-04T12:25:43.2513840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_float64 PASSED [0.0079s] [ 56%] 2025-12-04T12:25:43.2513963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int16 PASSED [0.0079s] [ 56%] 2025-12-04T12:25:43.2514088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_int8 PASSED [0.0079s] [ 56%] 2025-12-04T12:25:43.2514213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft2_cuda_uint8 PASSED [0.0080s] [ 56%] 2025-12-04T12:25:43.2514340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_bool PASSED [0.0086s] [ 56%] 2025-12-04T12:25:43.2514468Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_complex32 PASSED [0.0081s] [ 56%] 2025-12-04T12:25:43.2514597Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_float64 PASSED [0.0282s] [ 56%] 2025-12-04T12:25:43.2514719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int32 PASSED [0.0086s] [ 56%] 2025-12-04T12:25:43.2514845Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int64 PASSED [0.0085s] [ 56%] 2025-12-04T12:25:43.2514972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_int8 PASSED [0.0084s] [ 56%] 2025-12-04T12:25:43.2515099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fft_cuda_uint8 PASSED [0.0084s] [ 56%] 2025-12-04T12:25:43.2515229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_complex32 PASSED [0.0094s] [ 56%] 2025-12-04T12:25:43.2515372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_complex64 PASSED [0.0093s] [ 56%] 2025-12-04T12:25:43.2515500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_float32 PASSED [1.0453s] [ 56%] 2025-12-04T12:25:43.2515625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_float64 PASSED [0.0122s] [ 56%] 2025-12-04T12:25:43.2515750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_int8 PASSED [0.0101s] [ 56%] 2025-12-04T12:25:43.2515874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftn_cuda_uint8 PASSED [0.0097s] [ 56%] 2025-12-04T12:25:43.2516003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_bool PASSED [0.0078s] [ 56%] 2025-12-04T12:25:43.2516132Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int32 PASSED [0.0074s] [ 56%] 2025-12-04T12:25:43.2516263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_fftshift_cuda_int64 PASSED [0.0075s] [ 56%] 2025-12-04T12:25:43.2516394Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_complex32 PASSED [0.0318s] [ 56%] 2025-12-04T12:25:43.2516523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_float16 PASSED [0.0153s] [ 56%] 2025-12-04T12:25:43.2516651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int16 PASSED [1.0512s] [ 56%] 2025-12-04T12:25:43.2516779Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_int8 PASSED [0.0191s] [ 56%] 2025-12-04T12:25:43.2516916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft2_cuda_uint8 PASSED [1.0536s] [ 56%] 2025-12-04T12:25:43.2517052Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_bool PASSED [0.0170s] [ 56%] 2025-12-04T12:25:43.2517179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_float32 PASSED [0.0150s] [ 56%] 2025-12-04T12:25:43.2517304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfft_cuda_int32 PASSED [0.0147s] [ 56%] 2025-12-04T12:25:43.2517440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_bool PASSED [1.0541s] [ 56%] 2025-12-04T12:25:43.2517573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_complex32 PASSED [0.0193s] [ 56%] 2025-12-04T12:25:43.2517700Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int16 PASSED [0.0190s] [ 56%] 2025-12-04T12:25:43.2517827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_hfftn_cuda_int64 PASSED [0.0186s] [ 56%] 2025-12-04T12:25:43.2517954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft2_cuda_int32 PASSED [0.0081s] [ 56%] 2025-12-04T12:25:43.2518157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_bool PASSED [0.0144s] [ 56%] 2025-12-04T12:25:43.2518291Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_complex64 PASSED [1.0430s] [ 56%] 2025-12-04T12:25:43.2518419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_float64 PASSED [0.0167s] [ 56%] 2025-12-04T12:25:43.2518544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int16 PASSED [0.0148s] [ 56%] 2025-12-04T12:25:43.2518667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_int32 PASSED [0.0144s] [ 56%] 2025-12-04T12:25:43.2518792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifft_cuda_uint8 PASSED [1.0475s] [ 56%] 2025-12-04T12:25:43.2518924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_complex128 PASSED [0.0116s] [ 56%] 2025-12-04T12:25:43.2519056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_complex32 PASSED [0.0097s] [ 56%] 2025-12-04T12:25:43.2519200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float16 PASSED [0.0099s] [ 56%] 2025-12-04T12:25:43.2519332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_float64 PASSED [0.0097s] [ 56%] 2025-12-04T12:25:43.2519459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftn_cuda_int8 PASSED [1.0467s] [ 56%] 2025-12-04T12:25:43.2519600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex128 PASSED [0.0099s] [ 56%] 2025-12-04T12:25:43.2519737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_complex32 PASSED [1.0421s] [ 56%] 2025-12-04T12:25:43.2519875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float32 PASSED [0.0098s] [ 56%] 2025-12-04T12:25:43.2520010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ifftshift_cuda_float64 PASSED [1.0448s] [ 56%] 2025-12-04T12:25:43.2520137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int32 PASSED [0.0166s] [ 56%] 2025-12-04T12:25:43.2520265Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft2_cuda_int64 PASSED [1.0528s] [ 56%] 2025-12-04T12:25:43.2520388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_bool PASSED [0.0169s] [ 56%] 2025-12-04T12:25:43.2520519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float16 PASSED [0.0133s] [ 56%] 2025-12-04T12:25:43.2520645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float32 PASSED [0.0140s] [ 56%] 2025-12-04T12:25:43.2520792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_float64 PASSED [1.0542s] [ 56%] 2025-12-04T12:25:43.2520933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfft_cuda_int64 PASSED [0.0168s] [ 56%] 2025-12-04T12:25:43.2521063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float16 PASSED [0.0201s] [ 56%] 2025-12-04T12:25:43.2521192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float32 PASSED [0.0154s] [ 56%] 2025-12-04T12:25:43.2521336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_float64 PASSED [0.3290s] [ 56%] 2025-12-04T12:25:43.2521464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_ihfftn_cuda_int8 PASSED [0.0276s] [ 56%] 2025-12-04T12:25:43.2521599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_complex32 PASSED [0.0087s] [ 56%] 2025-12-04T12:25:43.2521732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_complex64 PASSED [0.0082s] [ 56%] 2025-12-04T12:25:43.2521863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float16 PASSED [0.0088s] [ 56%] 2025-12-04T12:25:43.2521992Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float32 PASSED [1.0368s] [ 56%] 2025-12-04T12:25:43.2522126Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_float64 PASSED [0.1594s] [ 56%] 2025-12-04T12:25:43.2522251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_int8 PASSED [0.0088s] [ 56%] 2025-12-04T12:25:43.2522378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft2_cuda_uint8 PASSED [0.0085s] [ 56%] 2025-12-04T12:25:43.2522506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_float64 PASSED [0.0088s] [ 56%] 2025-12-04T12:25:43.2522634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int32 PASSED [0.0087s] [ 57%] 2025-12-04T12:25:43.2522761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfft_cuda_int64 PASSED [0.0087s] [ 57%] 2025-12-04T12:25:43.2522894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_complex64 PASSED [0.0099s] [ 57%] 2025-12-04T12:25:43.2523034Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_float64 PASSED [1.0451s] [ 57%] 2025-12-04T12:25:43.2523162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_int32 PASSED [0.0123s] [ 57%] 2025-12-04T12:25:43.2523288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_irfftn_cuda_int8 PASSED [0.0105s] [ 57%] 2025-12-04T12:25:43.2523412Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft2_cuda_bool PASSED [0.0085s] [ 57%] 2025-12-04T12:25:43.2523537Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fft_rfft2_cuda_int16 PASSED [0.0083s] [ 57%] 2025-12-04T12:25:43.2523664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_float32 PASSED [0.0039s] [ 57%] 2025-12-04T12:25:43.2523788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_int8 PASSED [1.0486s] [ 57%] 2025-12-04T12:25:43.2523910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fill_cuda_uint8 PASSED [0.0058s] [ 57%] 2025-12-04T12:25:43.2524039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_bfloat16 PASSED [0.0042s] [ 57%] 2025-12-04T12:25:43.2524172Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_complex128 PASSED [1.0338s] [ 57%] 2025-12-04T12:25:43.2524299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_float16 PASSED [0.0058s] [ 57%] 2025-12-04T12:25:43.2524423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_int16 PASSED [0.0041s] [ 57%] 2025-12-04T12:25:43.2524558Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flatten_cuda_int32 PASSED [1.0351s] [ 57%] 2025-12-04T12:25:43.2524686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_bool PASSED [0.0101s] [ 57%] 2025-12-04T12:25:43.2524814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_complex128 PASSED [0.0083s] [ 57%] 2025-12-04T12:25:43.2524941Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float16 PASSED [0.0080s] [ 57%] 2025-12-04T12:25:43.2525075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_float32 PASSED [0.0079s] [ 57%] 2025-12-04T12:25:43.2525197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int32 PASSED [0.0078s] [ 57%] 2025-12-04T12:25:43.2525315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flip_cuda_int64 PASSED [0.0079s] [ 57%] 2025-12-04T12:25:43.2525444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_complex64 PASSED [0.0034s] [ 57%] 2025-12-04T12:25:43.2525568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_float16 PASSED [0.0033s] [ 57%] 2025-12-04T12:25:43.2525691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_int8 PASSED [0.0032s] [ 57%] 2025-12-04T12:25:43.2525815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fliplr_cuda_uint8 PASSED [0.0033s] [ 57%] 2025-12-04T12:25:43.2525940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_bool PASSED [0.0034s] [ 57%] 2025-12-04T12:25:43.2526070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_flipud_cuda_complex64 PASSED [0.0032s] [ 57%] 2025-12-04T12:25:43.2526196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_bfloat16 PASSED [1.0311s] [ 57%] 2025-12-04T12:25:43.2526316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_bool PASSED [0.0039s] [ 57%] 2025-12-04T12:25:43.2526439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int16 PASSED [1.0520s] [ 57%] 2025-12-04T12:25:43.2526560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_cuda_int64 PASSED [0.0041s] [ 57%] 2025-12-04T12:25:43.2526693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_float16 PASSED [0.0118s] [ 57%] 2025-12-04T12:25:43.2526834Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int64 PASSED [0.0105s] [ 57%] 2025-12-04T12:25:43.2526962Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_int8 PASSED [0.0104s] [ 57%] 2025-12-04T12:25:43.2527090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_float_power_cuda_uint8 PASSED [0.0103s] [ 57%] 2025-12-04T12:25:43.2527215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_cuda_bfloat16 PASSED [1.0695s] [ 57%] 2025-12-04T12:25:43.2527352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_bfloat16 PASSED [0.0420s] [ 57%] 2025-12-04T12:25:43.2527485Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float16 PASSED [0.0391s] [ 57%] 2025-12-04T12:25:43.2527619Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_float64 PASSED [0.0361s] [ 57%] 2025-12-04T12:25:43.2527749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_floor_divide_cuda_uint8 PASSED [0.0106s] [ 57%] 2025-12-04T12:25:43.2527874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_bfloat16 PASSED [0.0129s] [ 57%] 2025-12-04T12:25:43.2527994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int64 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2528155Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmax_cuda_int8 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2528276Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_float32 PASSED [0.0097s] [ 57%] 2025-12-04T12:25:43.2528409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int16 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2528540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmin_cuda_int32 PASSED [0.0095s] [ 57%] 2025-12-04T12:25:43.2528665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_bfloat16 PASSED [0.0158s] [ 57%] 2025-12-04T12:25:43.2528786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_float32 PASSED [0.0106s] [ 57%] 2025-12-04T12:25:43.2528920Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_float64 PASSED [0.0103s] [ 57%] 2025-12-04T12:25:43.2529039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int16 PASSED [0.0101s] [ 57%] 2025-12-04T12:25:43.2529158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_fmod_cuda_int64 PASSED [0.0103s] [ 57%] 2025-12-04T12:25:43.2529284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_bfloat16 PASSED [1.0691s] [ 57%] 2025-12-04T12:25:43.2529405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float16 PASSED [0.0061s] [ 57%] 2025-12-04T12:25:43.2529525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_frac_cuda_float64 PASSED [0.0040s] [ 57%] 2025-12-04T12:25:43.2529643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_bool PASSED [1.0542s] [ 57%] 2025-12-04T12:25:43.2529770Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_complex128 PASSED [0.0057s] [ 57%] 2025-12-04T12:25:43.2529890Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_int16 PASSED [0.0041s] [ 57%] 2025-12-04T12:25:43.2530011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_cuda_int32 PASSED [1.0419s] [ 57%] 2025-12-04T12:25:43.2530141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_bfloat16 PASSED [0.0089s] [ 57%] 2025-12-04T12:25:43.2530279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_complex128 PASSED [1.0456s] [ 57%] 2025-12-04T12:25:43.2530411Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_complex64 PASSED [0.0087s] [ 57%] 2025-12-04T12:25:43.2530554Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_float32 PASSED [1.0407s] [ 57%] 2025-12-04T12:25:43.2530679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_int16 PASSED [0.0088s] [ 57%] 2025-12-04T12:25:43.2530805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_full_like_cuda_int32 PASSED [1.0454s] [ 57%] 2025-12-04T12:25:43.2530933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_complex128 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2531060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_complex64 PASSED [1.0518s] [ 57%] 2025-12-04T12:25:43.2531184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gather_cuda_float32 PASSED [0.0087s] [ 57%] 2025-12-04T12:25:43.2531302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gcd_cuda_int64 PASSED [0.0101s] [ 57%] 2025-12-04T12:25:43.2531417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_bool PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2531539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_float16 PASSED [0.0121s] [ 57%] 2025-12-04T12:25:43.2531655Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int16 PASSED [0.0095s] [ 57%] 2025-12-04T12:25:43.2531771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int32 PASSED [0.0095s] [ 57%] 2025-12-04T12:25:43.2531886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ge_cuda_int8 PASSED [0.0095s] [ 57%] 2025-12-04T12:25:43.2532011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int32 PASSED [0.0053s] [ 57%] 2025-12-04T12:25:43.2532147Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geometric_cuda_int8 PASSED [0.0050s] [ 57%] 2025-12-04T12:25:43.2532282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_complex64 PASSED [0.0190s] [ 57%] 2025-12-04T12:25:43.2532407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_float32 PASSED [0.0179s] [ 57%] 2025-12-04T12:25:43.2532529Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_geqrf_cuda_float64 PASSED [0.0173s] [ 57%] 2025-12-04T12:25:43.2532673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_complex128 PASSED [0.3023s] [ 57%] 2025-12-04T12:25:43.2532804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_complex64 PASSED [0.3003s] [ 57%] 2025-12-04T12:25:43.2532931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_float16 PASSED [0.2529s] [ 57%] 2025-12-04T12:25:43.2533057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gradient_cuda_float64 PASSED [0.1829s] [ 57%] 2025-12-04T12:25:43.2533197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_bfloat16 PASSED [0.7599s] [ 57%] 2025-12-04T12:25:43.2533334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_float16 PASSED [0.7642s] [ 57%] 2025-12-04T12:25:43.2533472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_2d_cuda_float64 PASSED [0.7503s] [ 57%] 2025-12-04T12:25:43.2533627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_grid_sampler_3d_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 57%] 2025-12-04T12:25:43.2533748Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_float16 PASSED [0.0123s] [ 57%] 2025-12-04T12:25:43.2533866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int16 PASSED [0.0097s] [ 57%] 2025-12-04T12:25:43.2533982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int32 PASSED [0.0097s] [ 57%] 2025-12-04T12:25:43.2534099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_int8 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2534214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_gt_cuda_uint8 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.2534355Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_float64 PASSED [1.0479s] [ 57%] 2025-12-04T12:25:43.2534476Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_half_cuda_int32 PASSED [0.0040s] [ 57%] 2025-12-04T12:25:43.2534608Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int32 PASSED [0.0120s] [ 57%] 2025-12-04T12:25:43.2534735Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_int64 PASSED [0.0111s] [ 57%] 2025-12-04T12:25:43.2534864Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hash_tensor_cuda_uint8 PASSED [0.0109s] [ 57%] 2025-12-04T12:25:43.2534994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_bfloat16 PASSED [0.0209s] [ 57%] 2025-12-04T12:25:43.2535125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_float64 PASSED [0.0177s] [ 57%] 2025-12-04T12:25:43.2535250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int32 PASSED [0.0188s] [ 57%] 2025-12-04T12:25:43.2535376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_int8 PASSED [0.0188s] [ 57%] 2025-12-04T12:25:43.2535501Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_heaviside_cuda_uint8 PASSED [0.0188s] [ 57%] 2025-12-04T12:25:43.2535625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_histc_cuda_float64 PASSED [0.0525s] [ 57%] 2025-12-04T12:25:43.2535750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_bfloat16 PASSED [0.0033s] [ 57%] 2025-12-04T12:25:43.2535891Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex128 PASSED [1.0629s] [ 57%] 2025-12-04T12:25:43.2536029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_complex32 PASSED [0.0047s] [ 57%] 2025-12-04T12:25:43.2536152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float16 PASSED [0.0035s] [ 57%] 2025-12-04T12:25:43.2536278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_float32 PASSED [1.0460s] [ 57%] 2025-12-04T12:25:43.2536409Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hsplit_cuda_int64 PASSED [0.0049s] [ 57%] 2025-12-04T12:25:43.2536539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex128 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.2536666Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_complex32 PASSED [0.0041s] [ 57%] 2025-12-04T12:25:43.2536791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hstack_cuda_float16 PASSED [1.0478s] [ 57%] 2025-12-04T12:25:43.2536914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_hypot_cuda_float16 PASSED [0.0155s] [ 57%] 2025-12-04T12:25:43.2537032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_bool PASSED [1.0350s] [ 57%] 2025-12-04T12:25:43.2537151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_float32 PASSED [0.0055s] [ 57%] 2025-12-04T12:25:43.2537271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_float64 PASSED [0.0037s] [ 57%] 2025-12-04T12:25:43.2537386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_int16 PASSED [1.0418s] [ 57%] 2025-12-04T12:25:43.2537502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_i0_cuda_uint8 PASSED [0.0056s] [ 57%] 2025-12-04T12:25:43.2537625Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_igamma_cuda_float64 PASSED [0.0106s] [ 57%] 2025-12-04T12:25:43.2537753Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_imag_cuda_complex128 PASSED [1.0456s] [ 57%] 2025-12-04T12:25:43.2537877Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_imag_cuda_complex32 PASSED [0.0053s] [ 57%] 2025-12-04T12:25:43.2538005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_float64 PASSED [0.0118s] [ 57%] 2025-12-04T12:25:43.2538182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int16 PASSED [0.0108s] [ 57%] 2025-12-04T12:25:43.2538309Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_add_cuda_int32 PASSED [1.0624s] [ 57%] 2025-12-04T12:25:43.2538444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex128 PASSED [0.0068s] [ 57%] 2025-12-04T12:25:43.2538576Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_complex64 PASSED [0.0050s] [ 57%] 2025-12-04T12:25:43.2538710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float16 PASSED [0.0047s] [ 57%] 2025-12-04T12:25:43.2538843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float32 PASSED [0.0047s] [ 57%] 2025-12-04T12:25:43.2538974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_float64 PASSED [0.0047s] [ 57%] 2025-12-04T12:25:43.2539102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int32 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.2539230Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_copy_cuda_int64 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.2539362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_complex32 PASSED [0.0078s] [ 57%] 2025-12-04T12:25:43.2539492Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_float16 PASSED [0.0075s] [ 57%] 2025-12-04T12:25:43.2539620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_fill_cuda_uint8 PASSED [0.0074s] [ 57%] 2025-12-04T12:25:43.2539759Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_put_cuda_bool PASSED [0.0058s] [ 57%] 2025-12-04T12:25:43.2539913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_bfloat16 PASSED [0.0078s] [ 57%] 2025-12-04T12:25:43.2540055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_float16 PASSED [1.0497s] [ 57%] 2025-12-04T12:25:43.2540210Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_float32 PASSED [0.0100s] [ 58%] 2025-12-04T12:25:43.2540347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_int16 PASSED [0.0081s] [ 58%] 2025-12-04T12:25:43.2540484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amax_cuda_uint8 PASSED [0.0078s] [ 58%] 2025-12-04T12:25:43.2540623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_float16 PASSED [0.0078s] [ 58%] 2025-12-04T12:25:43.2540761Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int16 PASSED [0.0077s] [ 58%] 2025-12-04T12:25:43.2540898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int64 PASSED [1.0522s] [ 58%] 2025-12-04T12:25:43.2541039Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_int8 PASSED [0.0098s] [ 58%] 2025-12-04T12:25:43.2541182Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_amin_cuda_uint8 PASSED [0.0080s] [ 58%] 2025-12-04T12:25:43.2541323Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_float32 PASSED [0.0086s] [ 58%] 2025-12-04T12:25:43.2541458Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int16 PASSED [0.0083s] [ 58%] 2025-12-04T12:25:43.2541594Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_int64 PASSED [0.0083s] [ 58%] 2025-12-04T12:25:43.2541730Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_mean_cuda_uint8 PASSED [1.0492s] [ 58%] 2025-12-04T12:25:43.2541870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float32 PASSED [0.0097s] [ 58%] 2025-12-04T12:25:43.2542022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_float64 PASSED [0.0080s] [ 58%] 2025-12-04T12:25:43.2542160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_int32 PASSED [0.0079s] [ 58%] 2025-12-04T12:25:43.2542295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_reduce_prod_cuda_uint8 PASSED [0.0077s] [ 58%] 2025-12-04T12:25:43.2542432Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_complex128 PASSED [0.0046s] [ 58%] 2025-12-04T12:25:43.2542567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_complex64 PASSED [0.0043s] [ 58%] 2025-12-04T12:25:43.2542698Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_index_select_cuda_int8 PASSED [1.0612s] [ 58%] 2025-12-04T12:25:43.2542826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_complex128 PASSED [0.0077s] [ 58%] 2025-12-04T12:25:43.2542951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float32 PASSED [1.0506s] [ 58%] 2025-12-04T12:25:43.2543075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_inner_cuda_float64 PASSED [0.0075s] [ 58%] 2025-12-04T12:25:43.2543197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_bfloat16 PASSED [1.0483s] [ 58%] 2025-12-04T12:25:43.2543317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_bool PASSED [0.0040s] [ 58%] 2025-12-04T12:25:43.2543441Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_complex128 PASSED [1.0395s] [ 58%] 2025-12-04T12:25:43.2543574Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_int_cuda_float32 PASSED [0.0040s] [ 58%] 2025-12-04T12:25:43.2543712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_bfloat16 PASSED [0.1468s] [ 58%] 2025-12-04T12:25:43.2543843Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_complex64 PASSED [0.1038s] [ 58%] 2025-12-04T12:25:43.2543969Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_float32 PASSED [0.1023s] [ 58%] 2025-12-04T12:25:43.2544104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isclose_cuda_int64 PASSED [0.1001s] [ 58%] 2025-12-04T12:25:43.2544237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_complex128 PASSED [0.0182s] [ 58%] 2025-12-04T12:25:43.2544369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_complex32 PASSED [0.0192s] [ 58%] 2025-12-04T12:25:43.2544496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_float32 PASSED [0.0084s] [ 58%] 2025-12-04T12:25:43.2544621Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int32 PASSED [1.0779s] [ 58%] 2025-12-04T12:25:43.2544747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isfinite_cuda_int64 PASSED [0.0061s] [ 58%] 2025-12-04T12:25:43.2544870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_float32 PASSED [0.0046s] [ 58%] 2025-12-04T12:25:43.2544991Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_float64 PASSED [1.0629s] [ 58%] 2025-12-04T12:25:43.2545112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_int16 PASSED [0.0062s] [ 58%] 2025-12-04T12:25:43.2545233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isin_cuda_uint8 PASSED [0.0044s] [ 58%] 2025-12-04T12:25:43.2545361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_complex128 PASSED [1.0500s] [ 58%] 2025-12-04T12:25:43.2545489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_complex64 PASSED [0.0059s] [ 58%] 2025-12-04T12:25:43.2545612Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_float16 PASSED [0.0039s] [ 58%] 2025-12-04T12:25:43.2545749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_float32 PASSED [1.0596s] [ 58%] 2025-12-04T12:25:43.2545868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isinf_cuda_int8 PASSED [0.0047s] [ 58%] 2025-12-04T12:25:43.2545995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_bfloat16 PASSED [0.0037s] [ 58%] 2025-12-04T12:25:43.2546117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_float32 PASSED [1.0517s] [ 58%] 2025-12-04T12:25:43.2546238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isnan_cuda_int16 PASSED [0.0045s] [ 58%] 2025-12-04T12:25:43.2546366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_float16 PASSED [0.0035s] [ 58%] 2025-12-04T12:25:43.2546496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_float64 PASSED [1.0489s] [ 58%] 2025-12-04T12:25:43.2546620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_int8 PASSED [0.0040s] [ 58%] 2025-12-04T12:25:43.2546747Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isneginf_cuda_uint8 PASSED [1.0480s] [ 58%] 2025-12-04T12:25:43.2546872Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_int32 PASSED [0.0044s] [ 58%] 2025-12-04T12:25:43.2546997Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isposinf_cuda_int64 PASSED [1.0339s] [ 58%] 2025-12-04T12:25:43.2547123Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_complex32 PASSED [0.0081s] [ 58%] 2025-12-04T12:25:43.2547247Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_float64 PASSED [1.0386s] [ 58%] 2025-12-04T12:25:43.2547380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_int16 PASSED [0.0057s] [ 58%] 2025-12-04T12:25:43.2547515Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_isreal_cuda_uint8 PASSED [0.0041s] [ 58%] 2025-12-04T12:25:43.2547642Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_bfloat16 PASSED [1.0434s] [ 58%] 2025-12-04T12:25:43.2547767Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_complex64 PASSED [0.0055s] [ 58%] 2025-12-04T12:25:43.2547904Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int32 PASSED [1.0295s] [ 58%] 2025-12-04T12:25:43.2548024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int64 PASSED [0.0055s] [ 58%] 2025-12-04T12:25:43.2548193Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_int8 PASSED [1.0385s] [ 58%] 2025-12-04T12:25:43.2548313Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_item_cuda_uint8 PASSED [0.0055s] [ 58%] 2025-12-04T12:25:43.2548473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128 PASSED [0.0067s] [ 58%] 2025-12-04T12:25:43.2548627Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float16 PASSED [0.0037s] [ 58%] 2025-12-04T12:25:43.2548782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float32 PASSED [0.0039s] [ 58%] 2025-12-04T12:25:43.2548934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_float64 PASSED [0.0048s] [ 58%] 2025-12-04T12:25:43.2549085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int32 PASSED [0.0049s] [ 58%] 2025-12-04T12:25:43.2549234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_2inputs_2outputs_cuda_int64 PASSED [0.2634s] [ 58%] 2025-12-04T12:25:43.2549395Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_bool PASSED [0.2938s] [ 58%] 2025-12-04T12:25:43.2549561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex64 PASSED [0.0066s] [ 58%] 2025-12-04T12:25:43.2549737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int16 PASSED [0.3040s] [ 58%] 2025-12-04T12:25:43.2549881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_complex64 PASSED [0.0045s] [ 58%] 2025-12-04T12:25:43.2550021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_cuda_float64 PASSED [0.0038s] [ 58%] 2025-12-04T12:25:43.2550179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_float32 PASSED [0.2593s] [ 58%] 2025-12-04T12:25:43.2550335Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int16 PASSED [0.3123s] [ 58%] 2025-12-04T12:25:43.2550491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_int8 PASSED [0.2563s] [ 58%] 2025-12-04T12:25:43.2550645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8 PASSED [0.2567s] [ 58%] 2025-12-04T12:25:43.2550789Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_complex64 PASSED [1.1984s] [ 58%] 2025-12-04T12:25:43.2550926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_int16 PASSED [0.1210s] [ 58%] 2025-12-04T12:25:43.2551064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_jiterator_unary_cuda_uint8 PASSED [1.0365s] [ 58%] 2025-12-04T12:25:43.2551185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_int16 PASSED [0.0064s] [ 58%] 2025-12-04T12:25:43.2551321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kron_cuda_uint8 PASSED [0.0046s] [ 58%] 2025-12-04T12:25:43.2551464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_float64 PASSED [0.0059s] [ 58%] 2025-12-04T12:25:43.2551590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int64 PASSED [1.0629s] [ 58%] 2025-12-04T12:25:43.2551714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_int8 PASSED [0.0073s] [ 58%] 2025-12-04T12:25:43.2551852Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_kthvalue_cuda_uint8 PASSED [0.0057s] [ 58%] 2025-12-04T12:25:43.2551972Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int16 PASSED [0.0210s] [ 58%] 2025-12-04T12:25:43.2552090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_int64 PASSED [0.0147s] [ 58%] 2025-12-04T12:25:43.2552209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lcm_cuda_uint8 PASSED [0.0148s] [ 58%] 2025-12-04T12:25:43.2552336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_bfloat16 PASSED [0.0203s] [ 58%] 2025-12-04T12:25:43.2552464Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_complex64 PASSED [0.0148s] [ 58%] 2025-12-04T12:25:43.2552586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ldexp_cuda_int16 PASSED [0.0159s] [ 58%] 2025-12-04T12:25:43.2552706Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float16 PASSED [0.0120s] [ 58%] 2025-12-04T12:25:43.2552825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_le_cuda_float32 PASSED [0.0096s] [ 58%] 2025-12-04T12:25:43.2552950Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_complex32 PASSED [0.0194s] [ 58%] 2025-12-04T12:25:43.2553072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lerp_cuda_float32 PASSED [0.0134s] [ 58%] 2025-12-04T12:25:43.2553199Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_bfloat16 PASSED [0.0042s] [ 58%] 2025-12-04T12:25:43.2553321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_bool PASSED [1.0697s] [ 58%] 2025-12-04T12:25:43.2553446Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_float16 PASSED [0.0060s] [ 58%] 2025-12-04T12:25:43.2553580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lgamma_cuda_int8 PASSED [0.0043s] [ 58%] 2025-12-04T12:25:43.2553719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_cuda_float64 PASSED [0.0177s] [ 58%] 2025-12-04T12:25:43.2553861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cholesky_ex_cuda_float64 PASSED [1.0648s] [ 58%] 2025-12-04T12:25:43.2553995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_float32 PASSED [1.0660s] [ 58%] 2025-12-04T12:25:43.2554127Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cond_cuda_float64 PASSED [1.0582s] [ 58%] 2025-12-04T12:25:43.2554269Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_complex128 PASSED [0.0059s] [ 58%] 2025-12-04T12:25:43.2554404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_float16 PASSED [0.0043s] [ 58%] 2025-12-04T12:25:43.2554535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int16 PASSED [0.0042s] [ 58%] 2025-12-04T12:25:43.2554667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_cross_cuda_int64 PASSED [1.0493s] [ 58%] 2025-12-04T12:25:43.2554797Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_det_cuda_float32 PASSED [0.0144s] [ 58%] 2025-12-04T12:25:43.2554930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_bool PASSED [0.0068s] [ 58%] 2025-12-04T12:25:43.2555067Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float16 PASSED [0.0068s] [ 58%] 2025-12-04T12:25:43.2555216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_float64 PASSED [0.0071s] [ 58%] 2025-12-04T12:25:43.2555357Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_int8 PASSED [0.0066s] [ 58%] 2025-12-04T12:25:43.2555495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_diagonal_cuda_uint8 PASSED [0.0064s] [ 58%] 2025-12-04T12:25:43.2555639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eig_cuda_complex128 PASSED [0.0473s] [ 58%] 2025-12-04T12:25:43.2555772Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigh_cuda_float64 PASSED [0.0079s] [ 58%] 2025-12-04T12:25:43.2555912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvals_cuda_complex128 PASSED [0.0466s] [ 58%] 2025-12-04T12:25:43.2556054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_complex128 PASSED [0.0115s] [ 58%] 2025-12-04T12:25:43.2556195Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_complex64 PASSED [0.0110s] [ 58%] 2025-12-04T12:25:43.2556333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_eigvalsh_cuda_float64 PASSED [0.0104s] [ 58%] 2025-12-04T12:25:43.2556585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_complex128 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 58%] 2025-12-04T12:25:43.2556830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_householder_product_cuda_complex64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 58%] 2025-12-04T12:25:43.2556965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_cuda_complex64 PASSED [0.0120s] [ 58%] 2025-12-04T12:25:43.2557104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_inv_ex_cuda_complex128 PASSED [0.0070s] [ 58%] 2025-12-04T12:25:43.2557250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_complex128 PASSED [0.0219s] [ 58%] 2025-12-04T12:25:43.2557388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_float32 PASSED [0.0049s] [ 58%] 2025-12-04T12:25:43.2557630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_cuda_float64 PASSED [0.0048s] [ 58%] 2025-12-04T12:25:43.2557776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_ldl_factor_ex_cuda_float64 PASSED [0.0040s] [ 58%] 2025-12-04T12:25:43.2557914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_cuda_complex128 PASSED [1.7642s] [ 58%] 2025-12-04T12:25:43.2558048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_cuda_float64 PASSED [0.1385s] [ 58%] 2025-12-04T12:25:43.2558242Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lstsq_grad_oriented_cuda_complex128 PASSED [1.7568s] [ 58%] 2025-12-04T12:25:43.2558386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_cuda_complex128 PASSED [0.0436s] [ 58%] 2025-12-04T12:25:43.2558534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_complex128 PASSED [0.0174s] [ 58%] 2025-12-04T12:25:43.2558677Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_factor_ex_cuda_float64 PASSED [0.0166s] [ 58%] 2025-12-04T12:25:43.2558821Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_complex64 PASSED [0.0981s] [ 58%] 2025-12-04T12:25:43.2558959Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_float32 PASSED [0.0872s] [ 58%] 2025-12-04T12:25:43.2559095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_lu_solve_cuda_float64 PASSED [0.0862s] [ 59%] 2025-12-04T12:25:43.2559263Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_complex128 PASSED [0.0565s] [ 59%] 2025-12-04T12:25:43.2559418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_matrix_power_cuda_float32 PASSED [0.0566s] [ 59%] 2025-12-04T12:25:43.2559552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_cuda_float32 PASSED [0.1504s] [ 59%] 2025-12-04T12:25:43.2559714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_bfloat16 PASSED [0.0864s] [ 59%] 2025-12-04T12:25:43.2559892Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_complex128 PASSED [0.1118s] [ 59%] 2025-12-04T12:25:43.2560054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_complex64 PASSED [0.1096s] [ 59%] 2025-12-04T12:25:43.2560215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float64 PASSED [0.1157s] [ 59%] 2025-12-04T12:25:43.2560352Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_complex64 PASSED [0.0606s] [ 59%] 2025-12-04T12:25:43.2560486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_cuda_float32 PASSED [0.0571s] [ 59%] 2025-12-04T12:25:43.2560637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_complex128 PASSED [0.0208s] [ 59%] 2025-12-04T12:25:43.2560787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_complex64 PASSED [0.0214s] [ 59%] 2025-12-04T12:25:43.2560932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_hermitian_cuda_float64 PASSED [0.0194s] [ 59%] 2025-12-04T12:25:43.2561162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_complex128 SKIPPED [0.0007s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 59%] 2025-12-04T12:25:43.2561386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 59%] 2025-12-04T12:25:43.2561519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_qr_cuda_float32 PASSED [0.0233s] [ 59%] 2025-12-04T12:25:43.2561678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_slogdet_cuda_complex128 PASSED [0.0089s] [ 59%] 2025-12-04T12:25:43.2561815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_slogdet_cuda_float64 PASSED [1.6350s] [ 59%] 2025-12-04T12:25:43.2561951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_cuda_float64 PASSED [0.0513s] [ 59%] 2025-12-04T12:25:43.2562091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_complex128 PASSED [0.0197s] [ 59%] 2025-12-04T12:25:43.2562231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_complex64 PASSED [0.0193s] [ 59%] 2025-12-04T12:25:43.2562369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_float32 PASSED [0.0191s] [ 59%] 2025-12-04T12:25:43.2562506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_ex_cuda_float64 PASSED [0.0187s] [ 59%] 2025-12-04T12:25:43.2562659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_triangular_cuda_complex64 PASSED [0.2712s] [ 59%] 2025-12-04T12:25:43.2562809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_solve_triangular_cuda_float32 PASSED [0.1189s] [ 59%] 2025-12-04T12:25:43.2562944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex128 PASSED [1.7444s] [ 59%] 2025-12-04T12:25:43.2563078Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svd_cuda_complex64 PASSED [1.7140s] [ 59%] 2025-12-04T12:25:43.2563228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_complex64 PASSED [0.0269s] [ 59%] 2025-12-04T12:25:43.2563386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_svdvals_cuda_float64 PASSED [0.0304s] [ 59%] 2025-12-04T12:25:43.2563525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorinv_cuda_float32 PASSED [0.0084s] [ 59%] 2025-12-04T12:25:43.2563664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorinv_cuda_float64 PASSED [0.0077s] [ 59%] 2025-12-04T12:25:43.2563818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_float32 PASSED [0.0213s] [ 59%] 2025-12-04T12:25:43.2563958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_tensorsolve_cuda_float64 PASSED [0.0138s] [ 59%] 2025-12-04T12:25:43.2564097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_complex128 PASSED [1.6566s] [ 59%] 2025-12-04T12:25:43.2564232Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_float64 PASSED [0.0324s] [ 59%] 2025-12-04T12:25:43.2564367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int16 PASSED [1.6386s] [ 59%] 2025-12-04T12:25:43.2564500Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int32 PASSED [0.0348s] [ 59%] 2025-12-04T12:25:43.2564632Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vander_cuda_int64 PASSED [1.6641s] [ 59%] 2025-12-04T12:25:43.2564771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_complex64 PASSED [0.0747s] [ 59%] 2025-12-04T12:25:43.2564905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vecdot_cuda_float16 PASSED [0.0611s] [ 59%] 2025-12-04T12:25:43.2565053Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linalg_vector_norm_cuda_complex128 PASSED [0.1514s] [ 59%] 2025-12-04T12:25:43.2565185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_complex64 PASSED [0.0217s] [ 59%] 2025-12-04T12:25:43.2565315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_float32 PASSED [0.0216s] [ 59%] 2025-12-04T12:25:43.2565440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int16 PASSED [0.0216s] [ 59%] 2025-12-04T12:25:43.2565578Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_int8 PASSED [0.0216s] [ 59%] 2025-12-04T12:25:43.2565704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_cuda_uint8 PASSED [0.0141s] [ 59%] 2025-12-04T12:25:43.2565859Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_complex128 PASSED [0.1141s] [ 59%] 2025-12-04T12:25:43.2566009Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_float16 PASSED [0.1125s] [ 59%] 2025-12-04T12:25:43.2566158Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_int16 PASSED [0.1125s] [ 59%] 2025-12-04T12:25:43.2566304Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_linspace_tensor_overload_cuda_int32 PASSED [0.1132s] [ 59%] 2025-12-04T12:25:43.2566433Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_complex64 PASSED [1.6414s] [ 59%] 2025-12-04T12:25:43.2566557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_float16 PASSED [0.0060s] [ 59%] 2025-12-04T12:25:43.2566680Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log10_cuda_uint8 PASSED [0.0042s] [ 59%] 2025-12-04T12:25:43.2566803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_bfloat16 PASSED [0.0033s] [ 59%] 2025-12-04T12:25:43.2566926Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_bool PASSED [1.6030s] [ 59%] 2025-12-04T12:25:43.2567048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_float32 PASSED [0.0048s] [ 59%] 2025-12-04T12:25:43.2567180Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int16 PASSED [0.0035s] [ 59%] 2025-12-04T12:25:43.2567311Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log1p_cuda_int64 PASSED [1.6332s] [ 59%] 2025-12-04T12:25:43.2567434Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_bfloat16 PASSED [0.0062s] [ 59%] 2025-12-04T12:25:43.2567561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_complex128 PASSED [0.1573s] [ 59%] 2025-12-04T12:25:43.2567693Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float16 PASSED [1.6317s] [ 59%] 2025-12-04T12:25:43.2567818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float32 PASSED [0.0066s] [ 59%] 2025-12-04T12:25:43.2567940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_float64 PASSED [0.0043s] [ 59%] 2025-12-04T12:25:43.2568061Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log2_cuda_int8 PASSED [1.6386s] [ 59%] 2025-12-04T12:25:43.2568217Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_complex32 PASSED [0.1360s] [ 59%] 2025-12-04T12:25:43.2568338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_float32 PASSED [0.0043s] [ 59%] 2025-12-04T12:25:43.2568459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_float64 PASSED [1.6255s] [ 59%] 2025-12-04T12:25:43.2568579Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int16 PASSED [0.0066s] [ 59%] 2025-12-04T12:25:43.2568696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_cuda_int8 PASSED [0.0043s] [ 59%] 2025-12-04T12:25:43.2568828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_normal_cuda_float64 PASSED [0.0051s] [ 59%] 2025-12-04T12:25:43.2568976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_bfloat16 PASSED [0.0119s] [ 59%] 2025-12-04T12:25:43.2569125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_float16 PASSED [1.6148s] [ 59%] 2025-12-04T12:25:43.2569268Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int16 PASSED [1.6380s] [ 59%] 2025-12-04T12:25:43.2569428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_log_softmax_with_dtype_cuda_int32 PASSED [1.6277s] [ 59%] 2025-12-04T12:25:43.2569559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp2_cuda_float16 PASSED [0.0079s] [ 59%] 2025-12-04T12:25:43.2569688Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float16 PASSED [0.0291s] [ 59%] 2025-12-04T12:25:43.2569819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float32 PASSED [0.0250s] [ 59%] 2025-12-04T12:25:43.2569949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logaddexp_cuda_float64 PASSED [0.0250s] [ 59%] 2025-12-04T12:25:43.2570088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_bfloat16 PASSED [1.6093s] [ 59%] 2025-12-04T12:25:43.2570225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float16 PASSED [0.0073s] [ 59%] 2025-12-04T12:25:43.2570360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float32 PASSED [0.0055s] [ 59%] 2025-12-04T12:25:43.2570495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logcumsumexp_cuda_float64 PASSED [0.0053s] [ 59%] 2025-12-04T12:25:43.2570634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_complex128 PASSED [0.0155s] [ 59%] 2025-12-04T12:25:43.2570766Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_float32 PASSED [0.0123s] [ 59%] 2025-12-04T12:25:43.2570895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int16 PASSED [0.0123s] [ 59%] 2025-12-04T12:25:43.2571037Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_and_cuda_int64 PASSED [0.0122s] [ 59%] 2025-12-04T12:25:43.2571185Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_bfloat16 PASSED [0.0044s] [ 59%] 2025-12-04T12:25:43.2571321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_complex64 PASSED [1.5946s] [ 59%] 2025-12-04T12:25:43.2571465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_float16 PASSED [0.0072s] [ 59%] 2025-12-04T12:25:43.2571593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int16 PASSED [0.0047s] [ 59%] 2025-12-04T12:25:43.2571721Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_int64 PASSED [0.0045s] [ 59%] 2025-12-04T12:25:43.2571851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_not_cuda_uint8 PASSED [1.6361s] [ 59%] 2025-12-04T12:25:43.2571984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_complex64 PASSED [0.0144s] [ 59%] 2025-12-04T12:25:43.2572115Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_float32 PASSED [0.0125s] [ 59%] 2025-12-04T12:25:43.2572244Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int32 PASSED [0.0123s] [ 59%] 2025-12-04T12:25:43.2572372Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int64 PASSED [0.0123s] [ 59%] 2025-12-04T12:25:43.2572498Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_int8 PASSED [0.0122s] [ 59%] 2025-12-04T12:25:43.2572630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_or_cuda_uint8 PASSED [0.0121s] [ 59%] 2025-12-04T12:25:43.2572763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_bfloat16 PASSED [0.0152s] [ 59%] 2025-12-04T12:25:43.2572896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_int16 PASSED [0.0122s] [ 59%] 2025-12-04T12:25:43.2573024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logical_xor_cuda_int64 PASSED [0.0123s] [ 59%] 2025-12-04T12:25:43.2573149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_bfloat16 PASSED [0.0087s] [ 59%] 2025-12-04T12:25:43.2573283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logit_cuda_uint8 PASSED [0.0080s] [ 59%] 2025-12-04T12:25:43.2573414Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_float64 PASSED [0.1277s] [ 59%] 2025-12-04T12:25:43.2573538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int32 PASSED [0.1193s] [ 59%] 2025-12-04T12:25:43.2573663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_int8 PASSED [0.0496s] [ 59%] 2025-12-04T12:25:43.2573786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_cuda_uint8 PASSED [0.0403s] [ 59%] 2025-12-04T12:25:43.2573934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int16 PASSED [0.6895s] [ 59%] 2025-12-04T12:25:43.2574082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logspace_tensor_overload_cuda_int64 PASSED [0.6885s] [ 59%] 2025-12-04T12:25:43.2574216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_complex128 PASSED [1.6589s] [ 59%] 2025-12-04T12:25:43.2574351Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_complex64 PASSED [0.0340s] [ 59%] 2025-12-04T12:25:43.2574481Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_float16 PASSED [0.0121s] [ 59%] 2025-12-04T12:25:43.2574609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int16 PASSED [0.0116s] [ 59%] 2025-12-04T12:25:43.2574733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int32 PASSED [0.0115s] [ 59%] 2025-12-04T12:25:43.2574870Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_int64 PASSED [0.0114s] [ 59%] 2025-12-04T12:25:43.2575005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_logsumexp_cuda_uint8 PASSED [1.6284s] [ 59%] 2025-12-04T12:25:43.2575131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_bfloat16 PASSED [0.0043s] [ 59%] 2025-12-04T12:25:43.2575250Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_bool PASSED [1.6304s] [ 59%] 2025-12-04T12:25:43.2575383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_float32 PASSED [0.0045s] [ 59%] 2025-12-04T12:25:43.2575503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int16 PASSED [1.6304s] [ 59%] 2025-12-04T12:25:43.2575622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_int8 PASSED [0.0043s] [ 59%] 2025-12-04T12:25:43.2575742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_long_cuda_uint8 PASSED [1.6349s] [ 59%] 2025-12-04T12:25:43.2575863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_float32 PASSED [0.0121s] [ 59%] 2025-12-04T12:25:43.2575979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_int32 PASSED [0.0099s] [ 59%] 2025-12-04T12:25:43.2576097Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lt_cuda_int8 PASSED [0.0096s] [ 59%] 2025-12-04T12:25:43.2576221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_cuda_complex64 PASSED [0.0292s] [ 59%] 2025-12-04T12:25:43.2576340Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_cuda_float64 PASSED [0.0265s] [ 59%] 2025-12-04T12:25:43.2576471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_complex64 PASSED [0.0296s] [ 59%] 2025-12-04T12:25:43.2576600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_solve_cuda_float64 PASSED [0.0287s] [ 59%] 2025-12-04T12:25:43.2576734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_complex128 PASSED [0.0225s] [ 59%] 2025-12-04T12:25:43.2576863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_lu_unpack_cuda_float32 PASSED [0.0217s] [ 59%] 2025-12-04T12:25:43.2576999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex128 PASSED [0.0048s] [ 59%] 2025-12-04T12:25:43.2577120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_complex64 PASSED [0.0047s] [ 59%] 2025-12-04T12:25:43.2577240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int32 PASSED [1.6311s] [ 59%] 2025-12-04T12:25:43.2577356Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int64 PASSED [0.0058s] [ 59%] 2025-12-04T12:25:43.2577472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mH_cuda_int8 PASSED [0.0040s] [ 59%] 2025-12-04T12:25:43.2577588Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_bool PASSED [1.6485s] [ 59%] 2025-12-04T12:25:43.2577713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex32 PASSED [0.0065s] [ 60%] 2025-12-04T12:25:43.2577835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_complex64 PASSED [0.0055s] [ 60%] 2025-12-04T12:25:43.2577956Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_float16 PASSED [1.6267s] [ 60%] 2025-12-04T12:25:43.2578089Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_int16 PASSED [0.0057s] [ 60%] 2025-12-04T12:25:43.2578206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mT_cuda_uint8 PASSED [0.0040s] [ 60%] 2025-12-04T12:25:43.2578338Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_float64 PASSED [0.1519s] [ 60%] 2025-12-04T12:25:43.2578467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_int64 PASSED [0.1273s] [ 60%] 2025-12-04T12:25:43.2578618Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amax_cuda_uint8 PASSED [0.1281s] [ 60%] 2025-12-04T12:25:43.2578762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float16 PASSED [0.1758s] [ 60%] 2025-12-04T12:25:43.2578895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_amin_cuda_float32 PASSED [0.1506s] [ 60%] 2025-12-04T12:25:43.2579025Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int64 PASSED [0.0740s] [ 60%] 2025-12-04T12:25:43.2579169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmax_cuda_int8 PASSED [0.0747s] [ 60%] 2025-12-04T12:25:43.2579305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_bfloat16 PASSED [0.0878s] [ 60%] 2025-12-04T12:25:43.2579437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int32 PASSED [0.0747s] [ 60%] 2025-12-04T12:25:43.2579568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_argmin_cuda_int64 PASSED [0.0736s] [ 60%] 2025-12-04T12:25:43.2579705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_float64 PASSED [0.0462s] [ 60%] 2025-12-04T12:25:43.2579839Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumprod_cuda_int16 PASSED [0.0458s] [ 60%] 2025-12-04T12:25:43.2579979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_complex128 PASSED [0.0465s] [ 60%] 2025-12-04T12:25:43.2580114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_float64 PASSED [0.0457s] [ 60%] 2025-12-04T12:25:43.2580245Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int16 PASSED [0.0460s] [ 60%] 2025-12-04T12:25:43.2580376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int32 PASSED [0.0458s] [ 60%] 2025-12-04T12:25:43.2580508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_cumsum_cuda_int64 PASSED [0.0460s] [ 60%] 2025-12-04T12:25:43.2580636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int32 PASSED [0.0107s] [ 60%] 2025-12-04T12:25:43.2580782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_fill_cuda_int64 PASSED [0.0104s] [ 60%] 2025-12-04T12:25:43.2580923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float16 PASSED [0.0596s] [ 60%] 2025-12-04T12:25:43.2581063Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_log_softmax_cuda_float64 PASSED [0.0534s] [ 60%] 2025-12-04T12:25:43.2581204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_bfloat16 PASSED [0.0592s] [ 60%] 2025-12-04T12:25:43.2581343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float16 PASSED [0.0590s] [ 60%] 2025-12-04T12:25:43.2581483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logaddexp_cuda_float32 PASSED [0.0538s] [ 60%] 2025-12-04T12:25:43.2581623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_bfloat16 PASSED [0.2852s] [ 60%] 2025-12-04T12:25:43.2581764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_float16 PASSED [0.2832s] [ 60%] 2025-12-04T12:25:43.2581898Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_int64 PASSED [0.2341s] [ 60%] 2025-12-04T12:25:43.2582035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_int8 PASSED [0.2334s] [ 60%] 2025-12-04T12:25:43.2582170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_logsumexp_cuda_uint8 PASSED [0.2336s] [ 60%] 2025-12-04T12:25:43.2582305Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_mean_cuda_complex64 PASSED [0.2381s] [ 60%] 2025-12-04T12:25:43.2582450Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_median_cuda_bfloat16 PASSED [1.6666s] [ 60%] 2025-12-04T12:25:43.2582595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_median_cuda_float16 PASSED [0.0311s] [ 60%] 2025-12-04T12:25:43.2582726Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_norm_cuda_float16 PASSED [1.0520s] [ 60%] 2025-12-04T12:25:43.2582857Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_float32 PASSED [0.1774s] [ 60%] 2025-12-04T12:25:43.2582994Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_prod_cuda_int8 PASSED [0.1531s] [ 60%] 2025-12-04T12:25:43.2583130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_bool PASSED [0.0060s] [ 60%] 2025-12-04T12:25:43.2583266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float16 PASSED [0.0058s] [ 60%] 2025-12-04T12:25:43.2583402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float32 PASSED [0.0057s] [ 60%] 2025-12-04T12:25:43.2583539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_float64 PASSED [0.0058s] [ 60%] 2025-12-04T12:25:43.2583671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_scatter_cuda_uint8 PASSED [0.0057s] [ 60%] 2025-12-04T12:25:43.2583806Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_bfloat16 PASSED [0.0072s] [ 60%] 2025-12-04T12:25:43.2583944Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_complex128 PASSED [0.0069s] [ 60%] 2025-12-04T12:25:43.2584082Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_complex64 PASSED [0.0069s] [ 60%] 2025-12-04T12:25:43.2584215Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_float64 PASSED [0.0069s] [ 60%] 2025-12-04T12:25:43.2584349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_select_cuda_int16 PASSED [0.0068s] [ 60%] 2025-12-04T12:25:43.2584486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_bfloat16 PASSED [0.0541s] [ 60%] 2025-12-04T12:25:43.2584622Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_softmin_cuda_float32 PASSED [0.0478s] [ 60%] 2025-12-04T12:25:43.2584765Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_complex64 PASSED [0.5151s] [ 60%] 2025-12-04T12:25:43.2584897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_float32 PASSED [0.4623s] [ 60%] 2025-12-04T12:25:43.2585024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int16 PASSED [0.4901s] [ 60%] 2025-12-04T12:25:43.2585152Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_std_cuda_int64 PASSED [0.4936s] [ 60%] 2025-12-04T12:25:43.2585279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_sum_cuda_int16 PASSED [0.1457s] [ 60%] 2025-12-04T12:25:43.2585410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float16 PASSED [0.5439s] [ 60%] 2025-12-04T12:25:43.2585541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float32 PASSED [0.4396s] [ 60%] 2025-12-04T12:25:43.2585672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_float64 PASSED [0.4410s] [ 60%] 2025-12-04T12:25:43.2585802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_int64 PASSED [0.4686s] [ 60%] 2025-12-04T12:25:43.2585928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_masked_var_cuda_uint8 PASSED [0.4688s] [ 60%] 2025-12-04T12:25:43.2586057Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_complex128 PASSED [0.0298s] [ 60%] 2025-12-04T12:25:43.2586183Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matmul_cuda_float32 PASSED [0.0294s] [ 60%] 2025-12-04T12:25:43.2586324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_float16 PASSED [1.6372s] [ 60%] 2025-12-04T12:25:43.2586467Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_matrix_exp_cuda_float64 PASSED [0.0086s] [ 60%] 2025-12-04T12:25:43.2586598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_bfloat16 PASSED [0.0137s] [ 60%] 2025-12-04T12:25:43.2586736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_float16 PASSED [0.0129s] [ 60%] 2025-12-04T12:25:43.2586865Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_float64 PASSED [0.0097s] [ 60%] 2025-12-04T12:25:43.2586990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_int8 PASSED [0.0096s] [ 60%] 2025-12-04T12:25:43.2587118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_binary_cuda_uint8 PASSED [0.0096s] [ 60%] 2025-12-04T12:25:43.2587278Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float16 PASSED [3.9487s] [ 60%] 2025-12-04T12:25:43.2587439Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float32 PASSED [2.3180s] [ 60%] 2025-12-04T12:25:43.2587600Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_pool2d_with_indices_backward_cuda_float64 PASSED [2.3075s] [ 60%] 2025-12-04T12:25:43.2587746Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_bfloat16 PASSED [1.6508s] [ 60%] 2025-12-04T12:25:43.2587887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_int8 PASSED [0.0050s] [ 60%] 2025-12-04T12:25:43.2588027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_no_dim_cuda_uint8 PASSED [0.0036s] [ 60%] 2025-12-04T12:25:43.2588221Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_bfloat16 PASSED [1.6529s] [ 60%] 2025-12-04T12:25:43.2588363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_bool PASSED [0.0053s] [ 60%] 2025-12-04T12:25:43.2588509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float16 PASSED [1.6238s] [ 60%] 2025-12-04T12:25:43.2588670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float32 PASSED [0.0055s] [ 60%] 2025-12-04T12:25:43.2588815Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_float64 PASSED [1.6268s] [ 60%] 2025-12-04T12:25:43.2588958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int16 PASSED [0.0059s] [ 60%] 2025-12-04T12:25:43.2589099Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_max_reduction_with_dim_cuda_int8 PASSED [1.6290s] [ 60%] 2025-12-04T12:25:43.2589224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int16 PASSED [0.0120s] [ 60%] 2025-12-04T12:25:43.2589349Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_maximum_cuda_int8 PASSED [0.0099s] [ 60%] 2025-12-04T12:25:43.2589475Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_bfloat16 PASSED [1.6485s] [ 60%] 2025-12-04T12:25:43.2589599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mean_cuda_float16 PASSED [0.0227s] [ 60%] 2025-12-04T12:25:43.2589729Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_bfloat16 PASSED [1.6727s] [ 60%] 2025-12-04T12:25:43.2589851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int64 PASSED [0.0130s] [ 60%] 2025-12-04T12:25:43.2589973Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_median_cuda_int8 PASSED [0.0107s] [ 60%] 2025-12-04T12:25:43.2590138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_bfloat16 PASSED [1.6407s] [ 60%] 2025-12-04T12:25:43.2590298Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_bool PASSED [0.0147s] [ 60%] 2025-12-04T12:25:43.2590444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int16 PASSED [1.5756s] [ 60%] 2025-12-04T12:25:43.2590590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_list_of_tensors_cuda_int64 PASSED [0.0144s] [ 60%] 2025-12-04T12:25:43.2590754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_float32 PASSED [1.5934s] [ 60%] 2025-12-04T12:25:43.2590903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_meshgrid_variadic_tensors_cuda_int64 PASSED [0.0146s] [ 60%] 2025-12-04T12:25:43.2591032Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_float64 PASSED [0.0110s] [ 60%] 2025-12-04T12:25:43.2591160Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_binary_cuda_int8 PASSED [0.0098s] [ 60%] 2025-12-04T12:25:43.2591300Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_no_dim_cuda_int32 PASSED [1.4612s] [ 60%] 2025-12-04T12:25:43.2591443Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_min_reduction_with_dim_cuda_int64 PASSED [0.0055s] [ 60%] 2025-12-04T12:25:43.2591567Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int16 PASSED [0.0107s] [ 60%] 2025-12-04T12:25:43.2591690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_minimum_cuda_int8 PASSED [0.0101s] [ 60%] 2025-12-04T12:25:43.2591810Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_int16 PASSED [0.0051s] [ 60%] 2025-12-04T12:25:43.2591931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mode_cuda_uint8 PASSED [0.0048s] [ 60%] 2025-12-04T12:25:43.2592060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_bfloat16 PASSED [1.1001s] [ 60%] 2025-12-04T12:25:43.2592190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_float32 PASSED [0.0044s] [ 60%] 2025-12-04T12:25:43.2592317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_float64 PASSED [1.0722s] [ 60%] 2025-12-04T12:25:43.2592461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int16 PASSED [0.0047s] [ 60%] 2025-12-04T12:25:43.2592584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_movedim_cuda_int64 PASSED [1.0796s] [ 60%] 2025-12-04T12:25:43.2592704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_bool PASSED [0.0058s] [ 60%] 2025-12-04T12:25:43.2592827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_float32 PASSED [0.0041s] [ 60%] 2025-12-04T12:25:43.2592947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int16 PASSED [1.0767s] [ 60%] 2025-12-04T12:25:43.2593069Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_msort_cuda_int32 PASSED [0.0063s] [ 60%] 2025-12-04T12:25:43.2593190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_bfloat16 PASSED [0.0139s] [ 60%] 2025-12-04T12:25:43.2593315Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_complex128 PASSED [0.0100s] [ 60%] 2025-12-04T12:25:43.2593435Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_float16 PASSED [0.0130s] [ 60%] 2025-12-04T12:25:43.2593556Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int16 PASSED [0.0096s] [ 60%] 2025-12-04T12:25:43.2593673Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_int8 PASSED [0.0096s] [ 60%] 2025-12-04T12:25:43.2593792Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mul_cuda_uint8 PASSED [0.0096s] [ 60%] 2025-12-04T12:25:43.2593924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_multinomial_cuda_float64 PASSED [0.0089s] [ 60%] 2025-12-04T12:25:43.2594056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mv_cuda_float16 PASSED [0.0037s] [ 60%] 2025-12-04T12:25:43.2594214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16 PASSED [0.0197s] [ 60%] 2025-12-04T12:25:43.2594362Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0169s] [ 60%] 2025-12-04T12:25:43.2594519Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64 PASSED [0.0169s] [ 60%] 2025-12-04T12:25:43.2594663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int16 PASSED [0.0185s] [ 60%] 2025-12-04T12:25:43.2594805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int64 PASSED [0.0183s] [ 60%] 2025-12-04T12:25:43.2594946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_uint8 PASSED [0.0183s] [ 60%] 2025-12-04T12:25:43.2595093Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16 PASSED [0.0193s] [ 60%] 2025-12-04T12:25:43.2595237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0169s] [ 60%] 2025-12-04T12:25:43.2595380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int32 PASSED [0.0183s] [ 60%] 2025-12-04T12:25:43.2595524Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int8 PASSED [0.0184s] [ 60%] 2025-12-04T12:25:43.2595671Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_float64 PASSED [0.0169s] [ 60%] 2025-12-04T12:25:43.2595811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_uint8 PASSED [0.0183s] [ 60%] 2025-12-04T12:25:43.2595940Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_bool PASSED [0.0038s] [ 60%] 2025-12-04T12:25:43.2596070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_float16 PASSED [1.0821s] [ 60%] 2025-12-04T12:25:43.2596200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_float32 PASSED [0.0056s] [ 60%] 2025-12-04T12:25:43.2596339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_int64 PASSED [0.0040s] [ 61%] 2025-12-04T12:25:43.2596469Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nan_to_num_cuda_uint8 PASSED [1.0860s] [ 61%] 2025-12-04T12:25:43.2596599Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_complex128 PASSED [0.0891s] [ 61%] 2025-12-04T12:25:43.2596726Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmean_cuda_float32 PASSED [0.0809s] [ 61%] 2025-12-04T12:25:43.2596853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int32 PASSED [0.0108s] [ 61%] 2025-12-04T12:25:43.2596979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int64 PASSED [0.0105s] [ 61%] 2025-12-04T12:25:43.2597107Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nanmedian_cuda_int8 PASSED [0.0105s] [ 61%] 2025-12-04T12:25:43.2597235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_complex64 PASSED [0.0229s] [ 61%] 2025-12-04T12:25:43.2597361Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float16 PASSED [0.0194s] [ 61%] 2025-12-04T12:25:43.2597484Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float32 PASSED [1.0935s] [ 61%] 2025-12-04T12:25:43.2597611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_float64 PASSED [0.0211s] [ 61%] 2025-12-04T12:25:43.2597733Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int32 PASSED [0.0148s] [ 61%] 2025-12-04T12:25:43.2597868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int64 PASSED [0.0142s] [ 61%] 2025-12-04T12:25:43.2597999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nansum_cuda_int8 PASSED [0.0145s] [ 61%] 2025-12-04T12:25:43.2598187Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_bfloat16 XFAIL [0.0031s] [ 61%] 2025-12-04T12:25:43.2598313Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_bool XFAIL [1.0873s] [ 61%] 2025-12-04T12:25:43.2598460Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_complex32 XFAIL [1.0851s] [ 61%] 2025-12-04T12:25:43.2598590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_float16 XFAIL [1.0878s] [ 61%] 2025-12-04T12:25:43.2598717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_int64 XFAIL [1.1045s] [ 61%] 2025-12-04T12:25:43.2598847Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_copy_cuda_uint8 XFAIL [1.0877s] [ 61%] 2025-12-04T12:25:43.2598976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_complex128 PASSED [1.0946s] [ 61%] 2025-12-04T12:25:43.2599103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_complex64 PASSED [1.0981s] [ 61%] 2025-12-04T12:25:43.2599226Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_float16 PASSED [0.0076s] [ 61%] 2025-12-04T12:25:43.2599350Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_int16 PASSED [0.0058s] [ 61%] 2025-12-04T12:25:43.2599470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_int8 PASSED [0.0056s] [ 61%] 2025-12-04T12:25:43.2599592Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_narrow_cuda_uint8 PASSED [0.0055s] [ 61%] 2025-12-04T12:25:43.2599732Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_batch_norm_cuda_float64 PASSED [0.0169s] [ 61%] 2025-12-04T12:25:43.2599887Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_bfloat16 PASSED [0.0150s] [ 61%] 2025-12-04T12:25:43.2600040Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_dropout_backward_cuda_float16 PASSED [0.0145s] [ 61%] 2025-12-04T12:25:43.2600196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_bfloat16 PASSED [0.0452s] [ 61%] 2025-12-04T12:25:43.2600336Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_float16 PASSED [0.0449s] [ 61%] 2025-12-04T12:25:43.2600474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_native_layer_norm_cuda_float64 PASSED [0.0378s] [ 61%] 2025-12-04T12:25:43.2600595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_float64 PASSED [0.0132s] [ 61%] 2025-12-04T12:25:43.2600712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ne_cuda_int64 PASSED [0.0099s] [ 61%] 2025-12-04T12:25:43.2600833Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_float64 PASSED [1.0896s] [ 61%] 2025-12-04T12:25:43.2600953Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_int16 PASSED [0.0047s] [ 61%] 2025-12-04T12:25:43.2601073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_neg_cuda_uint8 PASSED [1.0799s] [ 61%] 2025-12-04T12:25:43.2601202Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_bfloat16 PASSED [0.0083s] [ 61%] 2025-12-04T12:25:43.2601337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_complex128 PASSED [1.0918s] [ 61%] 2025-12-04T12:25:43.2601465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float16 PASSED [0.0089s] [ 61%] 2025-12-04T12:25:43.2601595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float32 PASSED [1.0850s] [ 61%] 2025-12-04T12:25:43.2601736Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_float64 PASSED [0.0088s] [ 61%] 2025-12-04T12:25:43.2601874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_int16 PASSED [1.0850s] [ 61%] 2025-12-04T12:25:43.2602001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_cuda_uint8 PASSED [0.0087s] [ 61%] 2025-12-04T12:25:43.2602145Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_complex32 PASSED [1.0859s] [ 61%] 2025-12-04T12:25:43.2602296Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_complex64 PASSED [0.0089s] [ 61%] 2025-12-04T12:25:43.2602436Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float16 PASSED [1.0826s] [ 61%] 2025-12-04T12:25:43.2602573Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float32 PASSED [0.0084s] [ 61%] 2025-12-04T12:25:43.2602712Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_float64 PASSED [1.0837s] [ 61%] 2025-12-04T12:25:43.2602849Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_empty_strided_cuda_uint8 PASSED [0.0084s] [ 61%] 2025-12-04T12:25:43.2602979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_bfloat16 PASSED [1.1088s] [ 61%] 2025-12-04T12:25:43.2603102Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_bool PASSED [0.0125s] [ 61%] 2025-12-04T12:25:43.2603233Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_float32 PASSED [1.1015s] [ 61%] 2025-12-04T12:25:43.2603360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_float64 PASSED [0.0089s] [ 61%] 2025-12-04T12:25:43.2603483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_full_cuda_int8 PASSED [1.0949s] [ 61%] 2025-12-04T12:25:43.2603615Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_complex32 PASSED [0.0088s] [ 61%] 2025-12-04T12:25:43.2603742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float16 PASSED [1.1007s] [ 61%] 2025-12-04T12:25:43.2603869Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_float64 PASSED [0.0086s] [ 61%] 2025-12-04T12:25:43.2604004Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int64 PASSED [1.1024s] [ 61%] 2025-12-04T12:25:43.2604128Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_int8 PASSED [0.0093s] [ 61%] 2025-12-04T12:25:43.2604251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_ones_cuda_uint8 PASSED [1.0908s] [ 61%] 2025-12-04T12:25:43.2604378Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_bool PASSED [0.0095s] [ 61%] 2025-12-04T12:25:43.2604509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_complex32 PASSED [1.0928s] [ 61%] 2025-12-04T12:25:43.2604634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_new_zeros_cuda_int8 PASSED [0.0090s] [ 61%] 2025-12-04T12:25:43.2604763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nextafter_cuda_float32 PASSED [0.0102s] [ 61%] 2025-12-04T12:25:43.2604927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0160s] [ 61%] 2025-12-04T12:25:43.2605090Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float64 PASSED [0.0154s] [ 61%] 2025-12-04T12:25:43.2605253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16 PASSED [0.0248s] [ 61%] 2025-12-04T12:25:43.2605415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_avg_pool2d_cuda_float16 PASSED [0.0246s] [ 61%] 2025-12-04T12:25:43.2605583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float64 PASSED [0.0118s] [ 61%] 2025-12-04T12:25:43.2605756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16 PASSED [0.0176s] [ 61%] 2025-12-04T12:25:43.2605914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [0.0175s] [ 61%] 2025-12-04T12:25:43.2606085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_alpha_dropout_cuda_float16 PASSED [1.0983s] [ 61%] 2025-12-04T12:25:43.2606236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool1d_cuda_float16 PASSED [0.0129s] [ 61%] 2025-12-04T12:25:43.2606388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0110s] [ 61%] 2025-12-04T12:25:43.2606541Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_bfloat16 PASSED [0.0059s] [ 61%] 2025-12-04T12:25:43.2606691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_avg_pool2d_cuda_float16 PASSED [0.0059s] [ 61%] 2025-12-04T12:25:43.2606841Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_batch_norm_cuda_bfloat16 PASSED [0.0435s] [ 61%] 2025-12-04T12:25:43.2607003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_bfloat16 XFAIL [0.0230s] [ 61%] 2025-12-04T12:25:43.2607164Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_float16 XFAIL [1.1103s] [ 61%] 2025-12-04T12:25:43.2607325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_cuda_float32 XFAIL [0.0200s] [ 61%] 2025-12-04T12:25:43.2607505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16 PASSED [1.2046s] [ 61%] 2025-12-04T12:25:43.2607683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16 PASSED [0.1066s] [ 61%] 2025-12-04T12:25:43.2607837Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_channel_shuffle_cuda_int8 PASSED [1.1096s] [ 61%] 2025-12-04T12:25:43.2607993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_bfloat16 PASSED [0.3498s] [ 61%] 2025-12-04T12:25:43.2608196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float16 PASSED [1.4419s] [ 61%] 2025-12-04T12:25:43.2608339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv1d_cuda_float64 PASSED [0.0152s] [ 61%] 2025-12-04T12:25:43.2608675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_complex32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x7da707e00c00 size: 768 2025-12-04T12:25:43.2608863Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x7da707e00c00 size: 768 2025-12-04T12:25:43.2608907Z PASSED [2.7936s] [ 61%] 2025-12-04T12:25:43.2609054Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv2d_cuda_float32 PASSED [0.0352s] [ 61%] 2025-12-04T12:25:43.2609385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_bfloat16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x7da6e9003a00 size: 5888 2025-12-04T12:25:43.2609575Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x7da6e9003a00 size: 5888 2025-12-04T12:25:43.2609793Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x7da6e9003600 size: 6656 2025-12-04T12:25:43.2609995Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x7da6e9003600 size: 6656 2025-12-04T12:25:43.2610037Z PASSED [0.0711s] [ 61%] 2025-12-04T12:25:43.2610387Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv3d_cuda_complex64 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7da6f4c05a00 size: 11008 2025-12-04T12:25:43.2610574Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7da6f4c05a00 size: 11008 2025-12-04T12:25:43.2610776Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7da6f4c06000 size: 12544 2025-12-04T12:25:43.2610965Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7da6f4c06000 size: 12544 2025-12-04T12:25:43.2611006Z PASSED [0.2104s] [ 61%] 2025-12-04T12:25:43.2611169Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose2d_cuda_complex32 PASSED [0.1737s] [ 61%] 2025-12-04T12:25:43.2611413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16 PASSED [1.7607s] [ 61%] 2025-12-04T12:25:43.2611577Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex128 PASSED [0.1012s] [ 61%] 2025-12-04T12:25:43.2611738Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_complex32 PASSED [0.1303s] [ 61%] 2025-12-04T12:25:43.2611894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float16 PASSED [1.1067s] [ 61%] 2025-12-04T12:25:43.2612049Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.0478s] [ 61%] 2025-12-04T12:25:43.2612206Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_conv_transpose3d_cuda_float64 PASSED [1.0990s] [ 61%] 2025-12-04T12:25:43.2612386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float16 PASSED [0.0771s] [ 61%] 2025-12-04T12:25:43.2612550Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float32 PASSED [0.0595s] [ 61%] 2025-12-04T12:25:43.2612714Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_float64 PASSED [0.0593s] [ 61%] 2025-12-04T12:25:43.2612874Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32 PASSED [0.0629s] [ 61%] 2025-12-04T12:25:43.2613031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float16 PASSED [0.0774s] [ 61%] 2025-12-04T12:25:43.2613188Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cosine_similarity_cuda_float64 PASSED [0.0568s] [ 61%] 2025-12-04T12:25:43.2613342Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16 PASSED [0.0901s] [ 61%] 2025-12-04T12:25:43.2613496Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_cross_entropy_cuda_float32 PASSED [0.0672s] [ 61%] 2025-12-04T12:25:43.2613648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_bfloat16 PASSED [0.0171s] [ 61%] 2025-12-04T12:25:43.2613800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout3d_cuda_float16 PASSED [0.0169s] [ 61%] 2025-12-04T12:25:43.2613946Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_dropout_cuda_float16 PASSED [0.0158s] [ 61%] 2025-12-04T12:25:43.2614096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_bfloat16 PASSED [0.0059s] [ 61%] 2025-12-04T12:25:43.2614249Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float16 PASSED [0.0059s] [ 61%] 2025-12-04T12:25:43.2614390Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float32 PASSED [0.0056s] [ 61%] 2025-12-04T12:25:43.2614540Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_elu_cuda_float64 PASSED [0.0056s] [ 61%] 2025-12-04T12:25:43.2614690Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float16 PASSED [0.0102s] [ 61%] 2025-12-04T12:25:43.2614838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_embedding_cuda_float32 PASSED [0.0099s] [ 61%] 2025-12-04T12:25:43.2615016Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64 PASSED [0.0107s] [ 61%] 2025-12-04T12:25:43.2615200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_float16 PASSED [0.0045s] [ 61%] 2025-12-04T12:25:43.2615380Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int16 PASSED [1.1033s] [ 61%] 2025-12-04T12:25:43.2615561Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32 PASSED [0.0057s] [ 61%] 2025-12-04T12:25:43.2615726Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_bfloat16 PASSED [0.0403s] [ 61%] 2025-12-04T12:25:43.2615893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float16 PASSED [0.0399s] [ 61%] 2025-12-04T12:25:43.2616058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [1.1350s] [ 61%] 2025-12-04T12:25:43.2616216Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [1.9507s] [ 61%] 2025-12-04T12:25:43.2616371Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gelu_cuda_bfloat16 PASSED [1.1110s] [ 61%] 2025-12-04T12:25:43.2616511Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_gelu_cuda_float64 PASSED [0.0155s] [ 61%] 2025-12-04T12:25:43.2616651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float32 PASSED [1.1300s] [ 61%] 2025-12-04T12:25:43.2616788Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_glu_cuda_float64 PASSED [0.0404s] [ 61%] 2025-12-04T12:25:43.2616939Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_grid_sample_cuda_bfloat16 PASSED [0.7585s] [ 61%] 2025-12-04T12:25:43.2617092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_grid_sample_cuda_float64 PASSED [0.7532s] [ 61%] 2025-12-04T12:25:43.2617241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_group_norm_cuda_float16 PASSED [0.0765s] [ 61%] 2025-12-04T12:25:43.2617393Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_bfloat16 PASSED [0.0098s] [ 61%] 2025-12-04T12:25:43.2617545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardshrink_cuda_float64 PASSED [0.0074s] [ 61%] 2025-12-04T12:25:43.2617695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0071s] [ 61%] 2025-12-04T12:25:43.2617842Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float16 PASSED [0.0114s] [ 61%] 2025-12-04T12:25:43.2618000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float32 PASSED [0.0099s] [ 61%] 2025-12-04T12:25:43.2618236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardswish_cuda_float64 PASSED [0.0098s] [ 61%] 2025-12-04T12:25:43.2618384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_float32 PASSED [0.0094s] [ 61%] 2025-12-04T12:25:43.2618528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int16 PASSED [0.0093s] [ 62%] 2025-12-04T12:25:43.2618686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_hardtanh_cuda_int32 PASSED [0.0092s] [ 62%] 2025-12-04T12:25:43.2618835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float16 PASSED [0.0204s] [ 62%] 2025-12-04T12:25:43.2618984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float32 PASSED [0.0172s] [ 62%] 2025-12-04T12:25:43.2619134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_huber_loss_cuda_float64 PASSED [1.1064s] [ 62%] 2025-12-04T12:25:43.2619294Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_bfloat16 PASSED [0.0421s] [ 62%] 2025-12-04T12:25:43.2619452Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_float32 PASSED [1.1261s] [ 62%] 2025-12-04T12:25:43.2619610Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_area_cuda_float64 PASSED [0.0384s] [ 62%] 2025-12-04T12:25:43.2619771Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.9890s] [ 62%] 2025-12-04T12:25:43.2619933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_bicubic_cuda_float64 PASSED [0.9893s] [ 62%] 2025-12-04T12:25:43.2620094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0640s] [ 62%] 2025-12-04T12:25:43.2620253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_linear_cuda_float64 PASSED [0.0635s] [ 62%] 2025-12-04T12:25:43.2620428Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.0310s] [ 62%] 2025-12-04T12:25:43.2620613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8 PASSED [0.0249s] [ 62%] 2025-12-04T12:25:43.2620777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_interpolate_nearest_cuda_float16 PASSED [0.0357s] [ 62%] 2025-12-04T12:25:43.2620923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_l1_loss_cuda_float64 PASSED [0.0118s] [ 62%] 2025-12-04T12:25:43.2621075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_layer_norm_cuda_bfloat16 PASSED [0.0169s] [ 62%] 2025-12-04T12:25:43.2621224Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_leaky_relu_cuda_float32 PASSED [0.0108s] [ 62%] 2025-12-04T12:25:43.2621373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_linear_cuda_complex64 PASSED [0.1040s] [ 62%] 2025-12-04T12:25:43.2621525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_bfloat16 PASSED [0.0077s] [ 62%] 2025-12-04T12:25:43.2621676Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float32 PASSED [1.0831s] [ 62%] 2025-12-04T12:25:43.2621825Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_logsigmoid_cuda_float64 PASSED [0.0091s] [ 62%] 2025-12-04T12:25:43.2621987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_float32 PASSED [0.0700s] [ 62%] 2025-12-04T12:25:43.2622162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0691s] [ 62%] 2025-12-04T12:25:43.2622337Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int32 PASSED [0.0730s] [ 62%] 2025-12-04T12:25:43.2622495Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_margin_ranking_loss_cuda_int8 PASSED [0.0767s] [ 62%] 2025-12-04T12:25:43.2622656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_bfloat16 PASSED [2.0551s] [ 62%] 2025-12-04T12:25:43.2622805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool1d_cuda_float16 PASSED [0.9605s] [ 62%] 2025-12-04T12:25:43.2622952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool2d_cuda_float16 PASSED [0.8574s] [ 62%] 2025-12-04T12:25:43.2623103Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16 PASSED [0.4192s] [ 62%] 2025-12-04T12:25:43.2623252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_pool3d_cuda_float32 PASSED [0.4227s] [ 62%] 2025-12-04T12:25:43.2623404Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_float16 PASSED [0.3205s] [ 62%] 2025-12-04T12:25:43.2623555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_cuda_float32 PASSED [0.3213s] [ 62%] 2025-12-04T12:25:43.2623713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool1d_grad_cuda_float64 PASSED [0.0541s] [ 62%] 2025-12-04T12:25:43.2623863Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool2d_cuda_float64 PASSED [0.4565s] [ 62%] 2025-12-04T12:25:43.2624014Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float16 PASSED [0.1565s] [ 62%] 2025-12-04T12:25:43.2624167Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_cuda_float64 PASSED [0.1574s] [ 62%] 2025-12-04T12:25:43.2624324Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float16 PASSED [0.0227s] [ 62%] 2025-12-04T12:25:43.2624494Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float64 PASSED [0.0273s] [ 62%] 2025-12-04T12:25:43.2624637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mish_cuda_float64 PASSED [0.0058s] [ 62%] 2025-12-04T12:25:43.2624785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_bfloat16 PASSED [0.0106s] [ 62%] 2025-12-04T12:25:43.2624931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_float16 PASSED [0.0102s] [ 62%] 2025-12-04T12:25:43.2625084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_mse_loss_cuda_float64 PASSED [0.0089s] [ 62%] 2025-12-04T12:25:43.2625257Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float16 PASSED [3.9738s] [ 62%] 2025-12-04T12:25:43.2625429Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64 PASSED [3.2953s] [ 62%] 2025-12-04T12:25:43.2625590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_bfloat16 PASSED [0.0368s] [ 62%] 2025-12-04T12:25:43.2625749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0243s] [ 62%] 2025-12-04T12:25:43.2625912Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_margin_loss_cuda_float64 PASSED [0.0263s] [ 62%] 2025-12-04T12:25:43.2626084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float16 PASSED [0.0350s] [ 62%] 2025-12-04T12:25:43.2626266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [1.1227s] [ 62%] 2025-12-04T12:25:43.2626423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_nll_loss_cuda_float32 PASSED [0.1192s] [ 62%] 2025-12-04T12:25:43.2626570Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_nll_loss_cuda_float64 PASSED [0.1160s] [ 62%] 2025-12-04T12:25:43.2626734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_bfloat16 PASSED [0.0213s] [ 62%] 2025-12-04T12:25:43.2626893Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_complex128 PASSED [0.0212s] [ 62%] 2025-12-04T12:25:43.2627047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_complex64 PASSED [0.0211s] [ 62%] 2025-12-04T12:25:43.2627200Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_float16 PASSED [0.0210s] [ 62%] 2025-12-04T12:25:43.2627354Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_float64 PASSED [0.0215s] [ 62%] 2025-12-04T12:25:43.2627505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_circular_cuda_int8 PASSED [0.0226s] [ 62%] 2025-12-04T12:25:43.2627657Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_bfloat16 PASSED [0.0295s] [ 62%] 2025-12-04T12:25:43.2627809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_float64 PASSED [0.0290s] [ 62%] 2025-12-04T12:25:43.2627955Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_int8 PASSED [0.0291s] [ 62%] 2025-12-04T12:25:43.2628234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_constant_cuda_uint8 PASSED [0.0289s] [ 62%] 2025-12-04T12:25:43.2628388Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_bfloat16 PASSED [0.0086s] [ 62%] 2025-12-04T12:25:43.2628538Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float16 PASSED [1.0913s] [ 62%] 2025-12-04T12:25:43.2628703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_float64 PASSED [0.0106s] [ 62%] 2025-12-04T12:25:43.2628851Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_int8 PASSED [0.0088s] [ 62%] 2025-12-04T12:25:43.2628999Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_reflect_cuda_uint8 PASSED [0.0085s] [ 62%] 2025-12-04T12:25:43.2629151Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_float16 PASSED [0.0086s] [ 62%] 2025-12-04T12:25:43.2629302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_cuda_int32 PASSED [1.1095s] [ 62%] 2025-12-04T12:25:43.2629470Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_complex64 PASSED [0.0078s] [ 62%] 2025-12-04T12:25:43.2629636Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32 PASSED [0.0056s] [ 62%] 2025-12-04T12:25:43.2629798Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int16 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2629960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pad_replicate_negative_cuda_int8 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2630121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_complex128 PASSED [0.0171s] [ 62%] 2025-12-04T12:25:43.2630292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int32 PASSED [0.0169s] [ 62%] 2025-12-04T12:25:43.2630461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pairwise_distance_cuda_int64 PASSED [0.0168s] [ 62%] 2025-12-04T12:25:43.2630605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pdist_cuda_float32 PASSED [0.0070s] [ 62%] 2025-12-04T12:25:43.2630763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_complex64 PASSED [0.0057s] [ 62%] 2025-12-04T12:25:43.2630928Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2631081Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_float64 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2631231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_int64 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2631381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_shuffle_cuda_uint8 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2631539Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_bfloat16 PASSED [0.0059s] [ 62%] 2025-12-04T12:25:43.2631691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool PASSED [0.0059s] [ 62%] 2025-12-04T12:25:43.2631853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex128 PASSED [0.0059s] [ 62%] 2025-12-04T12:25:43.2632010Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0059s] [ 62%] 2025-12-04T12:25:43.2632162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_pixel_unshuffle_cuda_uint8 PASSED [0.0059s] [ 62%] 2025-12-04T12:25:43.2632319Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.3237s] [ 62%] 2025-12-04T12:25:43.2632474Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_int16 PASSED [0.3769s] [ 62%] 2025-12-04T12:25:43.2632637Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_poisson_nll_loss_cuda_int32 PASSED [0.3779s] [ 62%] 2025-12-04T12:25:43.2632784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_prelu_cuda_bfloat16 PASSED [0.0263s] [ 62%] 2025-12-04T12:25:43.2632924Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu6_cuda_int8 PASSED [1.1014s] [ 62%] 2025-12-04T12:25:43.2633066Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_float32 PASSED [0.0073s] [ 62%] 2025-12-04T12:25:43.2633204Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_int64 PASSED [0.0056s] [ 62%] 2025-12-04T12:25:43.2633343Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_relu_cuda_int8 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.2633491Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_bfloat16 PASSED [0.0087s] [ 62%] 2025-12-04T12:25:43.2633638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rms_norm_cuda_float64 PASSED [1.0824s] [ 62%] 2025-12-04T12:25:43.2633780Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_rrelu_cuda_float16 PASSED [0.0103s] [ 62%] 2025-12-04T12:25:43.2633952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float64 PASSED [0.2131s] [ 62%] 2025-12-04T12:25:43.2634092Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_silu_cuda_float32 PASSED [0.0040s] [ 62%] 2025-12-04T12:25:43.2634262Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_bfloat16 PASSED [0.0189s] [ 62%] 2025-12-04T12:25:43.2634427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0161s] [ 62%] 2025-12-04T12:25:43.2634585Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_bfloat16 PASSED [0.0107s] [ 62%] 2025-12-04T12:25:43.2634742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_soft_margin_loss_cuda_float16 PASSED [0.0105s] [ 62%] 2025-12-04T12:25:43.2634899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_bfloat16 PASSED [0.0106s] [ 62%] 2025-12-04T12:25:43.2635048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_cuda_float32 PASSED [0.0086s] [ 62%] 2025-12-04T12:25:43.2635207Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_bfloat16 PASSED [1.0822s] [ 62%] 2025-12-04T12:25:43.2635367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32 PASSED [1.1021s] [ 62%] 2025-12-04T12:25:43.2635523Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softmin_with_dtype_cuda_int64 PASSED [1.1127s] [ 62%] 2025-12-04T12:25:43.2635675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softshrink_cuda_bfloat16 PASSED [0.0149s] [ 62%] 2025-12-04T12:25:43.2635826Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_complex128 PASSED [0.0077s] [ 62%] 2025-12-04T12:25:43.2635971Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float16 PASSED [1.1027s] [ 62%] 2025-12-04T12:25:43.2636116Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_float32 PASSED [0.0093s] [ 62%] 2025-12-04T12:25:43.2636261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_int64 PASSED [0.0075s] [ 62%] 2025-12-04T12:25:43.2636405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_softsign_cuda_int8 PASSED [0.0072s] [ 62%] 2025-12-04T12:25:43.2636557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_bfloat16 PASSED [0.0063s] [ 62%] 2025-12-04T12:25:43.2636722Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_complex128 PASSED [0.0055s] [ 62%] 2025-12-04T12:25:43.2636871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_float16 PASSED [0.0061s] [ 62%] 2025-12-04T12:25:43.2637020Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_tanhshrink_cuda_uint8 PASSED [0.0058s] [ 62%] 2025-12-04T12:25:43.2637165Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_threshold_cuda_int32 PASSED [0.0058s] [ 62%] 2025-12-04T12:25:43.2637329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.0464s] [ 62%] 2025-12-04T12:25:43.2637487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int16 PASSED [0.0502s] [ 62%] 2025-12-04T12:25:43.2637645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int32 PASSED [0.0501s] [ 62%] 2025-12-04T12:25:43.2637802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_int64 PASSED [0.0545s] [ 62%] 2025-12-04T12:25:43.2637958Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_loss_cuda_uint8 PASSED [0.0507s] [ 62%] 2025-12-04T12:25:43.2638163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0466s] [ 62%] 2025-12-04T12:25:43.2638365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float64 PASSED [0.0460s] [ 62%] 2025-12-04T12:25:43.2638555Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 PASSED [0.0503s] [ 62%] 2025-12-04T12:25:43.2638703Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_unfold_cuda_bfloat16 PASSED [0.3056s] [ 62%] 2025-12-04T12:25:43.2638873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0181s] [ 62%] 2025-12-04T12:25:43.2639003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_bfloat16 PASSED [0.0148s] [ 62%] 2025-12-04T12:25:43.2639131Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float16 PASSED [0.0143s] [ 62%] 2025-12-04T12:25:43.2639258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_cuda_float32 PASSED [0.0142s] [ 62%] 2025-12-04T12:25:43.2639425Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex128 SKIPPED [0.0006s] (Only runs on cpu) [ 62%] 2025-12-04T12:25:43.2639590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_complex64 SKIPPED [0.0005s] (Only runs on cpu) [ 62%] 2025-12-04T12:25:43.2639754Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float16 SKIPPED [0.0005s] (Only runs on cpu) [ 62%] 2025-12-04T12:25:43.2639914Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 62%] 2025-12-04T12:25:43.2640072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_int64 SKIPPED [0.0005s] (Only runs on cpu) [ 63%] 2025-12-04T12:25:43.2640229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_nonzero_static_cuda_uint8 SKIPPED [0.0005s] (Only runs on cpu) [ 63%] 2025-12-04T12:25:43.2640358Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_complex128 PASSED [0.0431s] [ 63%] 2025-12-04T12:25:43.2640483Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_cuda_float16 PASSED [0.0518s] [ 63%] 2025-12-04T12:25:43.2640630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_bfloat16 PASSED [1.1117s] [ 63%] 2025-12-04T12:25:43.2640762Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_complex128 PASSED [0.0078s] [ 63%] 2025-12-04T12:25:43.2640895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_fro_cuda_complex64 PASSED [0.0055s] [ 63%] 2025-12-04T12:25:43.2641027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_complex64 PASSED [0.0051s] [ 63%] 2025-12-04T12:25:43.2641155Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float16 PASSED [0.0072s] [ 63%] 2025-12-04T12:25:43.2641283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_inf_cuda_float64 PASSED [0.0050s] [ 63%] 2025-12-04T12:25:43.2641415Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_complex128 PASSED [0.0057s] [ 63%] 2025-12-04T12:25:43.2641542Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_norm_nuc_cuda_float64 PASSED [0.0061s] [ 63%] 2025-12-04T12:25:43.2641685Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_bfloat16 PASSED [0.0062s] [ 63%] 2025-12-04T12:25:43.2641827Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float16 PASSED [0.0060s] [ 63%] 2025-12-04T12:25:43.2641968Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_normal_number_mean_cuda_float64 PASSED [1.1001s] [ 63%] 2025-12-04T12:25:43.2642091Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_float64 PASSED [0.0046s] [ 63%] 2025-12-04T12:25:43.2642227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_cuda_int32 PASSED [1.0974s] [ 63%] 2025-12-04T12:25:43.2642377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex128 PASSED [0.0090s] [ 63%] 2025-12-04T12:25:43.2642508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_complex32 PASSED [1.0873s] [ 63%] 2025-12-04T12:25:43.2642638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_float16 PASSED [0.0085s] [ 63%] 2025-12-04T12:25:43.2642777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ones_like_cuda_float64 PASSED [1.0828s] [ 63%] 2025-12-04T12:25:43.2642903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_bfloat16 PASSED [0.0061s] [ 63%] 2025-12-04T12:25:43.2643031Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_complex128 PASSED [0.0038s] [ 63%] 2025-12-04T12:25:43.2643157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_complex64 PASSED [1.0773s] [ 63%] 2025-12-04T12:25:43.2643283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_float64 PASSED [0.0056s] [ 63%] 2025-12-04T12:25:43.2643405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_outer_cuda_int16 PASSED [0.0039s] [ 63%] 2025-12-04T12:25:43.2643543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_complex128 PASSED [0.2488s] [ 63%] 2025-12-04T12:25:43.2643674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pca_lowrank_cuda_float32 PASSED [0.2483s] [ 63%] 2025-12-04T12:25:43.2643804Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_bool PASSED [0.0046s] [ 63%] 2025-12-04T12:25:43.2643933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_int32 PASSED [1.0963s] [ 63%] 2025-12-04T12:25:43.2644065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_copy_cuda_uint8 PASSED [0.0067s] [ 63%] 2025-12-04T12:25:43.2644191Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_float64 PASSED [0.0038s] [ 63%] 2025-12-04T12:25:43.2644316Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int64 PASSED [1.0849s] [ 63%] 2025-12-04T12:25:43.2644449Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_permute_cuda_int8 PASSED [0.0052s] [ 63%] 2025-12-04T12:25:43.2644580Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_complex64 PASSED [0.0278s] [ 63%] 2025-12-04T12:25:43.2644707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_float32 PASSED [0.0250s] [ 63%] 2025-12-04T12:25:43.2644835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pinverse_cuda_float64 PASSED [0.0237s] [ 63%] 2025-12-04T12:25:43.2644986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.2645137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float16 PASSED [0.0075s] [ 63%] 2025-12-04T12:25:43.2645287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_float64 PASSED [0.0074s] [ 63%] 2025-12-04T12:25:43.2645437Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int16 PASSED [0.0075s] [ 63%] 2025-12-04T12:25:43.2645583Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int32 PASSED [1.0860s] [ 63%] 2025-12-04T12:25:43.2645728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_int8 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2645873Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_0_cuda_uint8 PASSED [0.0078s] [ 63%] 2025-12-04T12:25:43.2646022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.2646181Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.2646339Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int16 PASSED [0.0075s] [ 63%] 2025-12-04T12:25:43.2646486Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_1_cuda_int64 PASSED [1.0780s] [ 63%] 2025-12-04T12:25:43.2646641Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_int64 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2646787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_2_cuda_uint8 PASSED [0.0078s] [ 63%] 2025-12-04T12:25:43.2646934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float16 PASSED [0.0077s] [ 63%] 2025-12-04T12:25:43.2647083Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_float64 PASSED [0.0077s] [ 63%] 2025-12-04T12:25:43.2647228Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_int8 PASSED [0.0075s] [ 63%] 2025-12-04T12:25:43.2647373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_3_cuda_uint8 PASSED [1.0807s] [ 63%] 2025-12-04T12:25:43.2647526Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_bfloat16 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2647675Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0079s] [ 63%] 2025-12-04T12:25:43.2647822Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_polygamma_polygamma_n_4_cuda_uint8 PASSED [0.0077s] [ 63%] 2025-12-04T12:25:43.2647951Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_bfloat16 PASSED [1.0849s] [ 63%] 2025-12-04T12:25:43.2648112Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_complex64 PASSED [0.0048s] [ 63%] 2025-12-04T12:25:43.2648241Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_float64 PASSED [1.0893s] [ 63%] 2025-12-04T12:25:43.2648367Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_int64 PASSED [0.0032s] [ 63%] 2025-12-04T12:25:43.2648508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_positive_cuda_uint8 PASSED [1.0625s] [ 63%] 2025-12-04T12:25:43.2648634Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex128 PASSED [0.0118s] [ 63%] 2025-12-04T12:25:43.2648758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_complex32 PASSED [0.0135s] [ 63%] 2025-12-04T12:25:43.2648880Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float16 PASSED [0.0129s] [ 63%] 2025-12-04T12:25:43.2649001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_float64 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2649120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_int8 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2649240Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_pow_cuda_uint8 PASSED [0.0096s] [ 63%] 2025-12-04T12:25:43.2649366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_complex32 PASSED [1.0880s] [ 63%] 2025-12-04T12:25:43.2649489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_complex64 PASSED [0.0264s] [ 63%] 2025-12-04T12:25:43.2649613Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float16 PASSED [1.0987s] [ 63%] 2025-12-04T12:25:43.2649739Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_float32 PASSED [0.0265s] [ 63%] 2025-12-04T12:25:43.2649860Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int16 PASSED [1.0933s] [ 63%] 2025-12-04T12:25:43.2649993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_prod_cuda_int64 PASSED [0.0265s] [ 63%] 2025-12-04T12:25:43.2650125Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_bool PASSED [1.0916s] [ 63%] 2025-12-04T12:25:43.2650246Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_float64 PASSED [0.0249s] [ 63%] 2025-12-04T12:25:43.2650366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_int64 PASSED [0.0227s] [ 63%] 2025-12-04T12:25:43.2650499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_put_cuda_uint8 PASSED [0.0225s] [ 63%] 2025-12-04T12:25:43.2650631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_quantile_cuda_float64 PASSED [0.4319s] [ 63%] 2025-12-04T12:25:43.2650755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_int16 PASSED [1.0685s] [ 63%] 2025-12-04T12:25:43.2650881Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_int64 PASSED [0.0051s] [ 63%] 2025-12-04T12:25:43.2651006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rad2deg_cuda_uint8 PASSED [0.0035s] [ 63%] 2025-12-04T12:25:43.2651138Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_bfloat16 PASSED [0.0126s] [ 63%] 2025-12-04T12:25:43.2651271Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_complex64 PASSED [0.0107s] [ 63%] 2025-12-04T12:25:43.2651400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rand_like_cuda_float16 PASSED [0.0120s] [ 63%] 2025-12-04T12:25:43.2651528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_bfloat16 PASSED [0.0104s] [ 63%] 2025-12-04T12:25:43.2651654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_float16 PASSED [0.0104s] [ 63%] 2025-12-04T12:25:43.2651777Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_cuda_int32 PASSED [0.0102s] [ 63%] 2025-12-04T12:25:43.2651913Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_bfloat16 PASSED [0.0135s] [ 63%] 2025-12-04T12:25:43.2652045Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randint_like_cuda_int16 PASSED [0.0133s] [ 63%] 2025-12-04T12:25:43.2652186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_cuda_float64 PASSED [0.0041s] [ 63%] 2025-12-04T12:25:43.2652317Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_bfloat16 PASSED [0.0121s] [ 63%] 2025-12-04T12:25:43.2652454Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex128 PASSED [0.0106s] [ 63%] 2025-12-04T12:25:43.2652586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_randn_like_cuda_complex32 PASSED [0.0121s] [ 63%] 2025-12-04T12:25:43.2652711Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_bfloat16 PASSED [0.0038s] [ 63%] 2025-12-04T12:25:43.2652838Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_complex128 PASSED [1.0692s] [ 63%] 2025-12-04T12:25:43.2652961Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_ravel_cuda_float64 PASSED [0.0054s] [ 63%] 2025-12-04T12:25:43.2653085Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_complex32 PASSED [0.0038s] [ 63%] 2025-12-04T12:25:43.2653209Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_float16 PASSED [1.0701s] [ 63%] 2025-12-04T12:25:43.2653330Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int16 PASSED [0.0034s] [ 63%] 2025-12-04T12:25:43.2653451Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_real_cuda_int32 PASSED [1.0637s] [ 63%] 2025-12-04T12:25:43.2653582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_bfloat16 PASSED [0.0062s] [ 63%] 2025-12-04T12:25:43.2653710Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int32 PASSED [0.0043s] [ 63%] 2025-12-04T12:25:43.2653850Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reciprocal_cuda_int64 PASSED [1.0628s] [ 63%] 2025-12-04T12:25:43.2653990Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_float64 PASSED [0.0124s] [ 63%] 2025-12-04T12:25:43.2654117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_remainder_cuda_uint8 PASSED [0.0105s] [ 63%] 2025-12-04T12:25:43.2654256Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_complex64 PASSED [1.0668s] [ 63%] 2025-12-04T12:25:43.2654381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float16 PASSED [0.0154s] [ 63%] 2025-12-04T12:25:43.2654505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float32 PASSED [1.0572s] [ 63%] 2025-12-04T12:25:43.2654628Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_renorm_cuda_float64 PASSED [0.0123s] [ 63%] 2025-12-04T12:25:43.2654755Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_bfloat16 PASSED [0.0298s] [ 63%] 2025-12-04T12:25:43.2654886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_complex128 PASSED [0.0294s] [ 63%] 2025-12-04T12:25:43.2655011Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float32 PASSED [0.0291s] [ 63%] 2025-12-04T12:25:43.2655135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_float64 PASSED [0.0292s] [ 63%] 2025-12-04T12:25:43.2655258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int16 PASSED [0.0291s] [ 63%] 2025-12-04T12:25:43.2655381Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_int32 PASSED [0.0292s] [ 63%] 2025-12-04T12:25:43.2655503Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_cuda_uint8 PASSED [0.0293s] [ 63%] 2025-12-04T12:25:43.2655647Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_bfloat16 PASSED [0.0091s] [ 63%] 2025-12-04T12:25:43.2655785Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_bool PASSED [1.0608s] [ 63%] 2025-12-04T12:25:43.2655931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_complex128 PASSED [0.0109s] [ 63%] 2025-12-04T12:25:43.2656083Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float16 PASSED [0.0088s] [ 63%] 2025-12-04T12:25:43.2656227Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_float32 PASSED [0.0086s] [ 63%] 2025-12-04T12:25:43.2656366Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int16 PASSED [0.0085s] [ 63%] 2025-12-04T12:25:43.2656504Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int32 PASSED [0.0084s] [ 63%] 2025-12-04T12:25:43.2656643Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_int8 PASSED [1.0660s] [ 63%] 2025-12-04T12:25:43.2656781Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_repeat_interleave_cuda_uint8 PASSED [0.0113s] [ 63%] 2025-12-04T12:25:43.2656916Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_complex64 PASSED [1.0517s] [ 63%] 2025-12-04T12:25:43.2657048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_float32 PASSED [0.0053s] [ 63%] 2025-12-04T12:25:43.2657177Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_as_cuda_int64 PASSED [0.0038s] [ 63%] 2025-12-04T12:25:43.2657303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_bfloat16 PASSED [1.0460s] [ 63%] 2025-12-04T12:25:43.2657427Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int16 PASSED [0.0062s] [ 63%] 2025-12-04T12:25:43.2657549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_reshape_cuda_int64 PASSED [0.0047s] [ 63%] 2025-12-04T12:25:43.2657683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_bool PASSED [0.0048s] [ 63%] 2025-12-04T12:25:43.2657824Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_complex128 PASSED [0.0045s] [ 63%] 2025-12-04T12:25:43.2657947Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_int32 PASSED [0.0044s] [ 63%] 2025-12-04T12:25:43.2658118Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize__cuda_int8 PASSED [0.0044s] [ 63%] 2025-12-04T12:25:43.2658251Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_bfloat16 PASSED [0.0046s] [ 63%] 2025-12-04T12:25:43.2658377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_bool PASSED [0.0045s] [ 63%] 2025-12-04T12:25:43.2658509Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resize_as__cuda_float16 PASSED [0.0045s] [ 64%] 2025-12-04T12:25:43.2658639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_bool PASSED [1.0579s] [ 64%] 2025-12-04T12:25:43.2658769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_int64 PASSED [0.0033s] [ 64%] 2025-12-04T12:25:43.2658899Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_int8 PASSED [1.0683s] [ 64%] 2025-12-04T12:25:43.2659029Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_conj_cuda_uint8 PASSED [0.0033s] [ 64%] 2025-12-04T12:25:43.2659163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_float32 PASSED [1.0546s] [ 64%] 2025-12-04T12:25:43.2659292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_int32 PASSED [0.0034s] [ 64%] 2025-12-04T12:25:43.2659420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_resolve_neg_cuda_uint8 PASSED [1.0535s] [ 64%] 2025-12-04T12:25:43.2659543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_bfloat16 PASSED [0.0175s] [ 64%] 2025-12-04T12:25:43.2659670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_complex64 PASSED [1.0792s] [ 64%] 2025-12-04T12:25:43.2659791Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float16 PASSED [0.0180s] [ 64%] 2025-12-04T12:25:43.2659927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float32 PASSED [1.0699s] [ 64%] 2025-12-04T12:25:43.2660048Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_float64 PASSED [0.0180s] [ 64%] 2025-12-04T12:25:43.2660170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int64 PASSED [1.0665s] [ 64%] 2025-12-04T12:25:43.2660288Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_roll_cuda_int8 PASSED [0.0176s] [ 64%] 2025-12-04T12:25:43.2660408Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_bool PASSED [1.0734s] [ 64%] 2025-12-04T12:25:43.2660535Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_float64 PASSED [0.0256s] [ 64%] 2025-12-04T12:25:43.2660658Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rot90_cuda_uint8 PASSED [1.0775s] [ 64%] 2025-12-04T12:25:43.2660782Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_float16 PASSED [0.0048s] [ 64%] 2025-12-04T12:25:43.2660903Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int16 PASSED [0.0031s] [ 64%] 2025-12-04T12:25:43.2661024Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_cuda_int32 PASSED [1.0430s] [ 64%] 2025-12-04T12:25:43.2661163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_float32 PASSED [0.0058s] [ 64%] 2025-12-04T12:25:43.2661303Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_0_cuda_float64 PASSED [0.0042s] [ 64%] 2025-12-04T12:25:43.2661455Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_3_cuda_bfloat16 PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.2661605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_3_cuda_float16 PASSED [1.0507s] [ 64%] 2025-12-04T12:25:43.2661749Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_round_decimals_neg_3_cuda_float16 PASSED [0.0059s] [ 64%] 2025-12-04T12:25:43.2661875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_bfloat16 PASSED [0.0045s] [ 64%] 2025-12-04T12:25:43.2662006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_bool PASSED [1.0563s] [ 64%] 2025-12-04T12:25:43.2662134Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_complex64 PASSED [0.0063s] [ 64%] 2025-12-04T12:25:43.2662255Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_float32 PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.2662377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int64 PASSED [1.0704s] [ 64%] 2025-12-04T12:25:43.2662497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsqrt_cuda_int8 PASSED [0.0061s] [ 64%] 2025-12-04T12:25:43.2662623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_complex64 PASSED [0.0125s] [ 64%] 2025-12-04T12:25:43.2662745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_float32 PASSED [0.0119s] [ 64%] 2025-12-04T12:25:43.2662868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_float64 PASSED [0.0117s] [ 64%] 2025-12-04T12:25:43.2662986Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_int16 PASSED [1.0730s] [ 64%] 2025-12-04T12:25:43.2663106Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_rsub_cuda_uint8 PASSED [0.0144s] [ 64%] 2025-12-04T12:25:43.2663243Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_complex64 PASSED [0.0036s] [ 64%] 2025-12-04T12:25:43.2663375Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scalar_tensor_cuda_int8 PASSED [1.0666s] [ 64%] 2025-12-04T12:25:43.2663510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_bfloat16 PASSED [0.0112s] [ 64%] 2025-12-04T12:25:43.2663648Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_bool PASSED [0.0086s] [ 64%] 2025-12-04T12:25:43.2663784Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_add_cuda_complex64 PASSED [0.0084s] [ 64%] 2025-12-04T12:25:43.2663911Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_bfloat16 PASSED [0.0205s] [ 64%] 2025-12-04T12:25:43.2664033Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_bool PASSED [0.0148s] [ 64%] 2025-12-04T12:25:43.2664163Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_complex128 PASSED [0.0149s] [ 64%] 2025-12-04T12:25:43.2664292Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float16 PASSED [1.0747s] [ 64%] 2025-12-04T12:25:43.2664418Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_float32 PASSED [0.0226s] [ 64%] 2025-12-04T12:25:43.2664543Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_cuda_int16 PASSED [0.0152s] [ 64%] 2025-12-04T12:25:43.2664686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_bfloat16 PASSED [0.0186s] [ 64%] 2025-12-04T12:25:43.2664831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_float16 PASSED [0.0186s] [ 64%] 2025-12-04T12:25:43.2664976Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amax_cuda_float32 PASSED [0.0185s] [ 64%] 2025-12-04T12:25:43.2665120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_amin_cuda_float32 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.2665270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_float16 PASSED [0.0201s] [ 64%] 2025-12-04T12:25:43.2665422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_mean_cuda_uint8 PASSED [0.0201s] [ 64%] 2025-12-04T12:25:43.2665563Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_float16 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.2665705Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_float32 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.2665862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_prod_cuda_int32 PASSED [0.0183s] [ 64%] 2025-12-04T12:25:43.2666001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float16 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.2666141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_float64 PASSED [0.0182s] [ 64%] 2025-12-04T12:25:43.2666279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int16 PASSED [0.0182s] [ 64%] 2025-12-04T12:25:43.2666419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int32 PASSED [0.0183s] [ 64%] 2025-12-04T12:25:43.2666557Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_scatter_reduce_sum_cuda_int8 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.2666695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_bfloat16 PASSED [0.2005s] [ 64%] 2025-12-04T12:25:43.2666829Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_float16 PASSED [0.2014s] [ 64%] 2025-12-04T12:25:43.2666963Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_float32 PASSED [0.2018s] [ 64%] 2025-12-04T12:25:43.2667095Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int32 PASSED [0.2004s] [ 64%] 2025-12-04T12:25:43.2667229Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_searchsorted_cuda_int8 PASSED [0.1995s] [ 64%] 2025-12-04T12:25:43.2668927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_bfloat16 PASSED [1.0632s] [ 64%] 2025-12-04T12:25:43.2669076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_bool PASSED [0.0055s] [ 64%] 2025-12-04T12:25:43.2669203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_float64 PASSED [0.0042s] [ 64%] 2025-12-04T12:25:43.2669327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_cuda_int8 PASSED [1.0611s] [ 64%] 2025-12-04T12:25:43.2669465Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_bfloat16 PASSED [0.0093s] [ 64%] 2025-12-04T12:25:43.2669606Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_float32 PASSED [1.0626s] [ 64%] 2025-12-04T12:25:43.2669745Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_select_scatter_cuda_float64 PASSED [0.0100s] [ 64%] 2025-12-04T12:25:43.2669871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_complex32 PASSED [1.1861s] [ 64%] 2025-12-04T12:25:43.2669993Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_float32 PASSED [0.0048s] [ 64%] 2025-12-04T12:25:43.2670113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int16 PASSED [1.0586s] [ 64%] 2025-12-04T12:25:43.2670235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sgn_cuda_int32 PASSED [0.0046s] [ 64%] 2025-12-04T12:25:43.2670360Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_bfloat16 PASSED [1.0577s] [ 64%] 2025-12-04T12:25:43.2670489Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_complex128 PASSED [0.0042s] [ 64%] 2025-12-04T12:25:43.2670611Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float16 PASSED [1.0519s] [ 64%] 2025-12-04T12:25:43.2670752Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_float32 PASSED [0.0041s] [ 64%] 2025-12-04T12:25:43.2670889Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int16 PASSED [1.0469s] [ 64%] 2025-12-04T12:25:43.2671015Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_int64 PASSED [0.0040s] [ 64%] 2025-12-04T12:25:43.2671137Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_short_cuda_uint8 PASSED [1.0586s] [ 64%] 2025-12-04T12:25:43.2671279Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float32 PASSED [0.0056s] [ 64%] 2025-12-04T12:25:43.2671406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_float64 PASSED [0.0041s] [ 64%] 2025-12-04T12:25:43.2671531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sigmoid_cuda_int8 PASSED [1.0510s] [ 64%] 2025-12-04T12:25:43.2671654Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_float64 PASSED [0.0048s] [ 64%] 2025-12-04T12:25:43.2671776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_int16 PASSED [1.0505s] [ 64%] 2025-12-04T12:25:43.2671896Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sign_cuda_uint8 PASSED [0.0045s] [ 64%] 2025-12-04T12:25:43.2672046Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_cosine_cuda_float64 PASSED [0.0110s] [ 64%] 2025-12-04T12:25:43.2672203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_exponential_cuda_float32 PASSED [0.0180s] [ 64%] 2025-12-04T12:25:43.2672359Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_general_cosine_cuda_float32 PASSED [0.0325s] [ 64%] 2025-12-04T12:25:43.2672508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hamming_cuda_float32 PASSED [0.0319s] [ 64%] 2025-12-04T12:25:43.2672656Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hamming_cuda_float64 PASSED [0.0317s] [ 64%] 2025-12-04T12:25:43.2672801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signal_windows_hann_cuda_float32 PASSED [0.0317s] [ 64%] 2025-12-04T12:25:43.2672938Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_bool PASSED [1.0556s] [ 64%] 2025-12-04T12:25:43.2673065Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_float16 PASSED [0.0051s] [ 64%] 2025-12-04T12:25:43.2673196Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_float32 PASSED [0.0035s] [ 64%] 2025-12-04T12:25:43.2673321Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_signbit_cuda_int32 PASSED [1.0609s] [ 64%] 2025-12-04T12:25:43.2673444Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_bfloat16 PASSED [0.0051s] [ 64%] 2025-12-04T12:25:43.2673564Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_bool PASSED [0.0035s] [ 64%] 2025-12-04T12:25:43.2673683Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sin_cuda_int64 PASSED [1.0575s] [ 64%] 2025-12-04T12:25:43.2673807Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_bfloat16 PASSED [0.0081s] [ 64%] 2025-12-04T12:25:43.2673934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_complex128 PASSED [0.2492s] [ 64%] 2025-12-04T12:25:43.2674058Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_float16 PASSED [0.1825s] [ 64%] 2025-12-04T12:25:43.2674178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinc_cuda_int32 PASSED [0.0069s] [ 64%] 2025-12-04T12:25:43.2674302Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_bfloat16 PASSED [0.0033s] [ 64%] 2025-12-04T12:25:43.2674423Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_int32 PASSED [1.0615s] [ 64%] 2025-12-04T12:25:43.2674553Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_int64 PASSED [0.0051s] [ 64%] 2025-12-04T12:25:43.2674686Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sinh_cuda_uint8 PASSED [0.0036s] [ 64%] 2025-12-04T12:25:43.2674811Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_bfloat16 PASSED [1.0539s] [ 64%] 2025-12-04T12:25:43.2674931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_bool PASSED [0.0051s] [ 64%] 2025-12-04T12:25:43.2675068Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_float64 PASSED [1.0661s] [ 64%] 2025-12-04T12:25:43.2675189Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_cuda_int8 PASSED [0.0052s] [ 64%] 2025-12-04T12:25:43.2675326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_float32 PASSED [0.0202s] [ 64%] 2025-12-04T12:25:43.2675462Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_int32 PASSED [0.0187s] [ 64%] 2025-12-04T12:25:43.2675595Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_int8 PASSED [0.0185s] [ 64%] 2025-12-04T12:25:43.2675728Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_slice_scatter_cuda_uint8 PASSED [0.0185s] [ 64%] 2025-12-04T12:25:43.2675867Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int16 PASSED [0.0069s] [ 64%] 2025-12-04T12:25:43.2676007Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int64 PASSED [0.0067s] [ 64%] 2025-12-04T12:25:43.2676143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_int8 PASSED [0.0067s] [ 64%] 2025-12-04T12:25:43.2676283Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_softmax_with_dtype_cuda_uint8 PASSED [0.0067s] [ 64%] 2025-12-04T12:25:43.2676406Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_bool PASSED [0.0060s] [ 64%] 2025-12-04T12:25:43.2676531Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sort_cuda_int64 PASSED [0.0167s] [ 64%] 2025-12-04T12:25:43.2676691Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_mm_reduce_cuda_bfloat16 SKIPPED [0.0001s] (Skipped!) [ 64%] 2025-12-04T12:25:43.2676861Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_mm_reduce_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 64%] 2025-12-04T12:25:43.2677027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0001s] (Skipped!) [ 64%] 2025-12-04T12:25:43.2677186Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 64%] 2025-12-04T12:25:43.2677326Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_int16 PASSED [1.0675s] [ 64%] 2025-12-04T12:25:43.2677463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_airy_ai_cuda_int32 PASSED [0.0061s] [ 64%] 2025-12-04T12:25:43.2677605Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_float64 PASSED [0.1896s] [ 64%] 2025-12-04T12:25:43.2677743Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j0_cuda_int16 PASSED [1.0484s] [ 64%] 2025-12-04T12:25:43.2677882Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_int64 PASSED [0.0065s] [ 64%] 2025-12-04T12:25:43.2678018Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_j1_cuda_uint8 PASSED [0.0046s] [ 64%] 2025-12-04T12:25:43.2678201Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_float32 PASSED [1.0640s] [ 64%] 2025-12-04T12:25:43.2678341Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_float64 PASSED [0.1466s] [ 65%] 2025-12-04T12:25:43.2678497Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int16 PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.2678645Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y0_cuda_int64 PASSED [0.0041s] [ 65%] 2025-12-04T12:25:43.2678787Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_float32 PASSED [1.0655s] [ 65%] 2025-12-04T12:25:43.2678927Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int16 PASSED [0.0061s] [ 65%] 2025-12-04T12:25:43.2679076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_bessel_y1_cuda_int32 PASSED [0.0044s] [ 65%] 2025-12-04T12:25:43.2679237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_int64 PASSED [0.0121s] [ 65%] 2025-12-04T12:25:43.2679391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_int8 PASSED [0.0118s] [ 65%] 2025-12-04T12:25:43.2679549Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2679708Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_float64 PASSED [0.0113s] [ 65%] 2025-12-04T12:25:43.2679867Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int16 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2680022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_u_cuda_int32 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2680179Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_bool PASSED [0.5487s] [ 65%] 2025-12-04T12:25:43.2680334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int16 PASSED [0.0120s] [ 65%] 2025-12-04T12:25:43.2680490Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int32 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2680644Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_int8 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2680800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8 PASSED [0.0116s] [ 65%] 2025-12-04T12:25:43.2680974Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0108s] [ 65%] 2025-12-04T12:25:43.2681130Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int16 PASSED [0.0116s] [ 65%] 2025-12-04T12:25:43.2681285Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_chebyshev_polynomial_w_cuda_int64 PASSED [0.0116s] [ 65%] 2025-12-04T12:25:43.2681422Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_bfloat16 PASSED [0.1368s] [ 65%] 2025-12-04T12:25:43.2681559Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_float16 PASSED [0.0063s] [ 65%] 2025-12-04T12:25:43.2681695Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_float64 PASSED [0.0051s] [ 65%] 2025-12-04T12:25:43.2681828Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_int8 PASSED [0.0049s] [ 65%] 2025-12-04T12:25:43.2681960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_entr_cuda_uint8 PASSED [0.0049s] [ 65%] 2025-12-04T12:25:43.2682094Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_bool PASSED [1.0612s] [ 65%] 2025-12-04T12:25:43.2682231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_float32 PASSED [0.0075s] [ 65%] 2025-12-04T12:25:43.2682368Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_float64 PASSED [0.0042s] [ 65%] 2025-12-04T12:25:43.2682510Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_int16 PASSED [1.0641s] [ 65%] 2025-12-04T12:25:43.2682663Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_erfcx_cuda_uint8 PASSED [0.0061s] [ 65%] 2025-12-04T12:25:43.2682819Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int16 PASSED [0.0124s] [ 65%] 2025-12-04T12:25:43.2682985Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_h_cuda_int8 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2683140Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int16 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2683293Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int32 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2683448Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_hermite_polynomial_he_cuda_int64 PASSED [0.0116s] [ 65%] 2025-12-04T12:25:43.2683582Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_bfloat16 PASSED [0.1665s] [ 65%] 2025-12-04T12:25:43.2683713Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_bool PASSED [1.0544s] [ 65%] 2025-12-04T12:25:43.2683847Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i0e_cuda_float32 PASSED [0.0052s] [ 65%] 2025-12-04T12:25:43.2683977Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_int32 PASSED [0.0040s] [ 65%] 2025-12-04T12:25:43.2684104Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_int64 PASSED [1.0583s] [ 65%] 2025-12-04T12:25:43.2684231Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1_cuda_uint8 PASSED [0.0056s] [ 65%] 2025-12-04T12:25:43.2684363Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_bfloat16 PASSED [0.0039s] [ 65%] 2025-12-04T12:25:43.2684493Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_bool PASSED [1.0579s] [ 65%] 2025-12-04T12:25:43.2684623Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_int16 PASSED [0.0054s] [ 65%] 2025-12-04T12:25:43.2684764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_i1e_cuda_uint8 PASSED [0.0038s] [ 65%] 2025-12-04T12:25:43.2684923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_float64 PASSED [0.0114s] [ 65%] 2025-12-04T12:25:43.2685076Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_laguerre_polynomial_l_cuda_int32 PASSED [0.0124s] [ 65%] 2025-12-04T12:25:43.2685234Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_float32 PASSED [0.0094s] [ 65%] 2025-12-04T12:25:43.2685391Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_legendre_polynomial_p_cuda_int64 PASSED [0.0140s] [ 65%] 2025-12-04T12:25:43.2685528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_bool PASSED [1.0797s] [ 65%] 2025-12-04T12:25:43.2685667Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_float32 PASSED [0.0105s] [ 65%] 2025-12-04T12:25:43.2685809Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_float64 PASSED [0.0073s] [ 65%] 2025-12-04T12:25:43.2685948Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int16 PASSED [0.0073s] [ 65%] 2025-12-04T12:25:43.2686084Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_log_ndtr_cuda_int64 PASSED [0.0072s] [ 65%] 2025-12-04T12:25:43.2686236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_int16 PASSED [1.0628s] [ 65%] 2025-12-04T12:25:43.2686405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i0_cuda_int32 PASSED [0.0064s] [ 65%] 2025-12-04T12:25:43.2686569Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_i1_cuda_int16 PASSED [0.0044s] [ 65%] 2025-12-04T12:25:43.2686719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_int64 PASSED [0.0041s] [ 65%] 2025-12-04T12:25:43.2686868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k0_cuda_uint8 PASSED [1.0501s] [ 65%] 2025-12-04T12:25:43.2687027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int16 PASSED [0.0055s] [ 65%] 2025-12-04T12:25:43.2687175Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int32 PASSED [0.0041s] [ 65%] 2025-12-04T12:25:43.2687325Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_modified_bessel_k1_cuda_int64 PASSED [0.0040s] [ 65%] 2025-12-04T12:25:43.2687461Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_bfloat16 PASSED [0.0082s] [ 65%] 2025-12-04T12:25:43.2687598Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float16 PASSED [0.0080s] [ 65%] 2025-12-04T12:25:43.2687734Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float32 PASSED [0.0072s] [ 65%] 2025-12-04T12:25:43.2687868Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_float64 PASSED [0.0073s] [ 65%] 2025-12-04T12:25:43.2688000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtr_cuda_int64 PASSED [1.1051s] [ 65%] 2025-12-04T12:25:43.2688192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_float32 PASSED [0.0075s] [ 65%] 2025-12-04T12:25:43.2688327Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_ndtri_cuda_int16 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.2688502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16 PASSED [0.0078s] [ 65%] 2025-12-04T12:25:43.2688670Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bool PASSED [0.0075s] [ 65%] 2025-12-04T12:25:43.2688856Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float64 PASSED [0.0075s] [ 65%] 2025-12-04T12:25:43.2689027Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int64 PASSED [1.0861s] [ 65%] 2025-12-04T12:25:43.2689190Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_float32 PASSED [0.0061s] [ 65%] 2025-12-04T12:25:43.2689348Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16 PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.2689507Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int16 PASSED [0.0042s] [ 65%] 2025-12-04T12:25:43.2689664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_scaled_modified_bessel_k1_cuda_int64 PASSED [1.0963s] [ 65%] 2025-12-04T12:25:43.2689835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int32 PASSED [0.0141s] [ 65%] 2025-12-04T12:25:43.2690003Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_int64 PASSED [0.0119s] [ 65%] 2025-12-04T12:25:43.2690170Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_uint8 PASSED [0.0117s] [ 65%] 2025-12-04T12:25:43.2690335Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_bool PASSED [0.0115s] [ 65%] 2025-12-04T12:25:43.2690518Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0093s] [ 65%] 2025-12-04T12:25:43.2690696Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int16 PASSED [0.0115s] [ 65%] 2025-12-04T12:25:43.2690862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_bool PASSED [0.0113s] [ 65%] 2025-12-04T12:25:43.2691044Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_float64 PASSED [0.0092s] [ 65%] 2025-12-04T12:25:43.2691212Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int64 PASSED [0.0115s] [ 65%] 2025-12-04T12:25:43.2691376Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int32 PASSED [0.0115s] [ 65%] 2025-12-04T12:25:43.2691547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_int8 PASSED [0.0115s] [ 65%] 2025-12-04T12:25:43.2691701Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_spherical_bessel_j0_cuda_int32 PASSED [1.1254s] [ 65%] 2025-12-04T12:25:43.2691840Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float16 PASSED [0.0211s] [ 65%] 2025-12-04T12:25:43.2691979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_float64 PASSED [0.0154s] [ 65%] 2025-12-04T12:25:43.2692117Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int32 PASSED [0.0178s] [ 65%] 2025-12-04T12:25:43.2692253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_xlog1py_cuda_int64 PASSED [0.0178s] [ 65%] 2025-12-04T12:25:43.2692384Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int16 PASSED [0.0122s] [ 65%] 2025-12-04T12:25:43.2692517Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_special_zeta_cuda_int32 PASSED [0.0120s] [ 65%] 2025-12-04T12:25:43.2692638Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_bool PASSED [1.0967s] [ 65%] 2025-12-04T12:25:43.2692764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_float64 PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.2692897Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int16 PASSED [1.0827s] [ 65%] 2025-12-04T12:25:43.2693021Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_int64 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.2693141Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_cuda_uint8 PASSED [1.0809s] [ 65%] 2025-12-04T12:25:43.2693280Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float16 PASSED [0.0049s] [ 65%] 2025-12-04T12:25:43.2693419Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_list_args_cuda_float64 PASSED [1.0923s] [ 65%] 2025-12-04T12:25:43.2693571Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_complex64 PASSED [0.0059s] [ 65%] 2025-12-04T12:25:43.2693719Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_float32 PASSED [0.0041s] [ 65%] 2025-12-04T12:25:43.2693862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_copy_cuda_int64 PASSED [1.0924s] [ 65%] 2025-12-04T12:25:43.2694005Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_complex32 PASSED [0.0052s] [ 65%] 2025-12-04T12:25:43.2694143Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float32 PASSED [0.0038s] [ 65%] 2025-12-04T12:25:43.2694282Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_split_with_sizes_cuda_float64 PASSED [1.0929s] [ 65%] 2025-12-04T12:25:43.2694407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_bfloat16 PASSED [0.0056s] [ 65%] 2025-12-04T12:25:43.2694545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_complex128 PASSED [0.0049s] [ 65%] 2025-12-04T12:25:43.2694678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float16 PASSED [1.0942s] [ 65%] 2025-12-04T12:25:43.2694802Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_float32 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.2694933Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sqrt_cuda_int32 PASSED [0.0035s] [ 65%] 2025-12-04T12:25:43.2695056Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int32 PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.2695178Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_int64 PASSED [1.1005s] [ 65%] 2025-12-04T12:25:43.2695301Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_square_cuda_uint8 PASSED [0.0063s] [ 65%] 2025-12-04T12:25:43.2695438Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_complex64 PASSED [0.0075s] [ 65%] 2025-12-04T12:25:43.2695572Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int64 PASSED [0.0070s] [ 65%] 2025-12-04T12:25:43.2695707Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_copy_cuda_int8 PASSED [0.0069s] [ 65%] 2025-12-04T12:25:43.2695830Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_bool PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.2695960Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_complex64 PASSED [0.0047s] [ 65%] 2025-12-04T12:25:43.2696087Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_float16 PASSED [1.1031s] [ 65%] 2025-12-04T12:25:43.2696211Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int64 PASSED [0.0066s] [ 65%] 2025-12-04T12:25:43.2696333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_cuda_int8 PASSED [0.0048s] [ 65%] 2025-12-04T12:25:43.2696473Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_float64 PASSED [1.0983s] [ 65%] 2025-12-04T12:25:43.2696609Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int16 PASSED [0.0059s] [ 65%] 2025-12-04T12:25:43.2696757Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_int32 PASSED [0.0043s] [ 65%] 2025-12-04T12:25:43.2696894Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_squeeze_multiple_cuda_uint8 PASSED [1.0821s] [ 65%] 2025-12-04T12:25:43.2697022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_bfloat16 PASSED [0.0124s] [ 65%] 2025-12-04T12:25:43.2697149Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_complex128 PASSED [0.0102s] [ 65%] 2025-12-04T12:25:43.2697277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_complex64 PASSED [0.0100s] [ 65%] 2025-12-04T12:25:43.2697402Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_float16 PASSED [0.0097s] [ 65%] 2025-12-04T12:25:43.2697525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stack_cuda_int16 PASSED [0.0098s] [ 65%] 2025-12-04T12:25:43.2697651Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_complex128 PASSED [0.0127s] [ 65%] 2025-12-04T12:25:43.2697776Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_cuda_float32 PASSED [0.0116s] [ 65%] 2025-12-04T12:25:43.2697919Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_complex64 PASSED [1.1027s] [ 65%] 2025-12-04T12:25:43.2698060Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float16 PASSED [0.0069s] [ 65%] 2025-12-04T12:25:43.2698261Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_mean_unbiased_cuda_float64 PASSED [0.0043s] [ 66%] 2025-12-04T12:25:43.2698420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_complex128 PASSED [1.0840s] [ 66%] 2025-12-04T12:25:43.2698568Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_std_unbiased_cuda_complex64 PASSED [0.0063s] [ 66%] 2025-12-04T12:25:43.2698694Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_complex64 PASSED [0.6703s] [ 66%] 2025-12-04T12:25:43.2698831Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_float32 PASSED [0.0223s] [ 66%] 2025-12-04T12:25:43.2698952Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_stft_cuda_float64 PASSED [0.3571s] [ 66%] 2025-12-04T12:25:43.2699075Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_complex32 PASSED [1.1154s] [ 66%] 2025-12-04T12:25:43.2699197Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int16 PASSED [0.0139s] [ 66%] 2025-12-04T12:25:43.2699320Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int32 PASSED [0.0114s] [ 66%] 2025-12-04T12:25:43.2699440Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_int8 PASSED [0.0113s] [ 66%] 2025-12-04T12:25:43.2699560Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sub_cuda_uint8 PASSED [0.0111s] [ 66%] 2025-12-04T12:25:43.2699678Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_bool PASSED [1.1151s] [ 66%] 2025-12-04T12:25:43.2699803Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_complex32 PASSED [0.0202s] [ 66%] 2025-12-04T12:25:43.2699923Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_cuda_int16 PASSED [0.0171s] [ 66%] 2025-12-04T12:25:43.2700055Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int16 PASSED [0.0130s] [ 66%] 2025-12-04T12:25:43.2700184Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int32 PASSED [0.0128s] [ 66%] 2025-12-04T12:25:43.2700313Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int64 PASSED [0.0102s] [ 66%] 2025-12-04T12:25:43.2700441Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_int8 PASSED [1.0894s] [ 66%] 2025-12-04T12:25:43.2700584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_sum_to_size_cuda_uint8 PASSED [0.0153s] [ 66%] 2025-12-04T12:25:43.2700723Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_complex64 PASSED [0.3391s] [ 66%] 2025-12-04T12:25:43.2700854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_svd_lowrank_cuda_float32 PASSED [0.3343s] [ 66%] 2025-12-04T12:25:43.2700979Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_float64 PASSED [0.0044s] [ 66%] 2025-12-04T12:25:43.2701101Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_int32 PASSED [0.0039s] [ 66%] 2025-12-04T12:25:43.2701225Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_int64 PASSED [1.0957s] [ 66%] 2025-12-04T12:25:43.2701347Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_copy_cuda_uint8 PASSED [0.0061s] [ 66%] 2025-12-04T12:25:43.2701466Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_bool PASSED [0.0035s] [ 66%] 2025-12-04T12:25:43.2701584Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float32 PASSED [1.0902s] [ 66%] 2025-12-04T12:25:43.2701704Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_float64 PASSED [0.0047s] [ 66%] 2025-12-04T12:25:43.2701820Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_t_cuda_int32 PASSED [1.0730s] [ 66%] 2025-12-04T12:25:43.2701964Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_complex128 PASSED [0.0140s] [ 66%] 2025-12-04T12:25:43.2702113Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_along_dim_cuda_float64 PASSED [1.0679s] [ 66%] 2025-12-04T12:25:43.2702252Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_complex128 PASSED [0.0099s] [ 66%] 2025-12-04T12:25:43.2702377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_complex64 PASSED [0.0083s] [ 66%] 2025-12-04T12:25:43.2702502Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float32 PASSED [0.0081s] [ 66%] 2025-12-04T12:25:43.2702633Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_float64 PASSED [1.0554s] [ 66%] 2025-12-04T12:25:43.2702756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int16 PASSED [0.0101s] [ 66%] 2025-12-04T12:25:43.2702875Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_int64 PASSED [0.0084s] [ 66%] 2025-12-04T12:25:43.2702995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_take_cuda_uint8 PASSED [0.0081s] [ 66%] 2025-12-04T12:25:43.2703114Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_bool PASSED [1.0571s] [ 66%] 2025-12-04T12:25:43.2703238Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex32 PASSED [0.0051s] [ 66%] 2025-12-04T12:25:43.2703365Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_complex64 PASSED [0.0033s] [ 66%] 2025-12-04T12:25:43.2703487Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tan_cuda_float32 PASSED [1.0568s] [ 66%] 2025-12-04T12:25:43.2703616Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_complex128 PASSED [0.0044s] [ 66%] 2025-12-04T12:25:43.2703737Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_float32 PASSED [1.0538s] [ 66%] 2025-12-04T12:25:43.2703862Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_float64 PASSED [0.0044s] [ 66%] 2025-12-04T12:25:43.2703982Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tanh_cuda_uint8 PASSED [0.0034s] [ 66%] 2025-12-04T12:25:43.2704119Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_bfloat16 PASSED [0.0081s] [ 66%] 2025-12-04T12:25:43.2704254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float16 PASSED [0.0078s] [ 66%] 2025-12-04T12:25:43.2704400Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensor_split_cuda_float64 PASSED [1.0627s] [ 66%] 2025-12-04T12:25:43.2704533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_complex64 PASSED [0.0145s] [ 66%] 2025-12-04T12:25:43.2704664Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tensordot_cuda_float32 PASSED [0.0121s] [ 66%] 2025-12-04T12:25:43.2704783Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_int64 PASSED [0.0387s] [ 66%] 2025-12-04T12:25:43.2704905Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tile_cuda_uint8 PASSED [0.0384s] [ 66%] 2025-12-04T12:25:43.2705022Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_bool PASSED [1.0602s] [ 66%] 2025-12-04T12:25:43.2705148Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_complex128 PASSED [0.0079s] [ 66%] 2025-12-04T12:25:43.2705266Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_cuda_uint8 PASSED [1.0772s] [ 66%] 2025-12-04T12:25:43.2705397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_bfloat16 PASSED [0.0050s] [ 66%] 2025-12-04T12:25:43.2705534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_complex128 PASSED [1.0608s] [ 66%] 2025-12-04T12:25:43.2705665Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_complex64 PASSED [0.0049s] [ 66%] 2025-12-04T12:25:43.2705794Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_float16 PASSED [1.0583s] [ 66%] 2025-12-04T12:25:43.2705932Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_float32 PASSED [0.0050s] [ 66%] 2025-12-04T12:25:43.2706070Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_to_sparse_cuda_int16 PASSED [1.0636s] [ 66%] 2025-12-04T12:25:43.2706192Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int32 PASSED [0.0084s] [ 66%] 2025-12-04T12:25:43.2706314Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_topk_cuda_int8 PASSED [0.0068s] [ 66%] 2025-12-04T12:25:43.2706593Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 66%] 2025-12-04T12:25:43.2706764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__flash_attention_forward_cuda_float16 PASSED [0.0181s] [ 66%] 2025-12-04T12:25:43.2706930Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 PASSED [0.0102s] [ 66%] 2025-12-04T12:25:43.2707093Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bool PASSED [0.0096s] [ 66%] 2025-12-04T12:25:43.2707253Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float16 PASSED [0.0096s] [ 66%] 2025-12-04T12:25:43.2707413Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_uint8 PASSED [0.0096s] [ 66%] 2025-12-04T12:25:43.2707544Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_complex64 PASSED [1.0720s] [ 66%] 2025-12-04T12:25:43.2707668Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_float16 PASSED [0.0049s] [ 66%] 2025-12-04T12:25:43.2707793Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trace_cuda_float64 PASSED [1.0872s] [ 66%] 2025-12-04T12:25:43.2707934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex32 PASSED [0.0093s] [ 66%] 2025-12-04T12:25:43.2708121Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_complex64 PASSED [1.0683s] [ 66%] 2025-12-04T12:25:43.2708258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_float32 PASSED [0.0087s] [ 66%] 2025-12-04T12:25:43.2708410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int16 PASSED [1.0842s] [ 66%] 2025-12-04T12:25:43.2708545Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_copy_cuda_int32 PASSED [0.0089s] [ 66%] 2025-12-04T12:25:43.2708672Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_bool PASSED [1.0637s] [ 66%] 2025-12-04T12:25:43.2708805Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_complex32 PASSED [0.0063s] [ 66%] 2025-12-04T12:25:43.2708942Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_complex64 PASSED [0.0048s] [ 66%] 2025-12-04T12:25:43.2709072Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float16 PASSED [0.0046s] [ 66%] 2025-12-04T12:25:43.2709203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_float64 PASSED [1.0624s] [ 66%] 2025-12-04T12:25:43.2709332Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_int32 PASSED [0.0063s] [ 66%] 2025-12-04T12:25:43.2709459Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_transpose_cuda_uint8 PASSED [0.0048s] [ 66%] 2025-12-04T12:25:43.2709590Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_bfloat16 PASSED [0.0461s] [ 66%] 2025-12-04T12:25:43.2709724Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_complex128 PASSED [0.0333s] [ 66%] 2025-12-04T12:25:43.2709853Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float16 PASSED [0.0472s] [ 66%] 2025-12-04T12:25:43.2710000Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float32 PASSED [1.1082s] [ 66%] 2025-12-04T12:25:43.2710146Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapezoid_cuda_float64 PASSED [0.0359s] [ 66%] 2025-12-04T12:25:43.2710270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_float32 PASSED [0.0335s] [ 66%] 2025-12-04T12:25:43.2710407Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trapz_cuda_int32 PASSED [0.0359s] [ 66%] 2025-12-04T12:25:43.2710528Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_cuda_int16 PASSED [0.0125s] [ 66%] 2025-12-04T12:25:43.2710659Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_tril_indices_cuda_int64 PASSED [0.0130s] [ 66%] 2025-12-04T12:25:43.2710786Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_complex128 PASSED [0.0125s] [ 66%] 2025-12-04T12:25:43.2710910Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_float32 PASSED [0.0123s] [ 66%] 2025-12-04T12:25:43.2711035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int16 PASSED [0.0122s] [ 66%] 2025-12-04T12:25:43.2711157Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int32 PASSED [1.0747s] [ 66%] 2025-12-04T12:25:43.2711277Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_int64 PASSED [0.0144s] [ 66%] 2025-12-04T12:25:43.2711397Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_triu_cuda_uint8 PASSED [0.0123s] [ 66%] 2025-12-04T12:25:43.2711525Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_bool PASSED [0.0129s] [ 66%] 2025-12-04T12:25:43.2711662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_complex128 PASSED [0.0103s] [ 66%] 2025-12-04T12:25:43.2711800Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_complex32 PASSED [0.0136s] [ 66%] 2025-12-04T12:25:43.2711934Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float16 PASSED [0.0135s] [ 66%] 2025-12-04T12:25:43.2712064Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float32 PASSED [0.0102s] [ 66%] 2025-12-04T12:25:43.2712214Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_float64 PASSED [0.0102s] [ 66%] 2025-12-04T12:25:43.2712345Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int32 PASSED [0.0125s] [ 66%] 2025-12-04T12:25:43.2712472Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_true_divide_cuda_int8 PASSED [0.0126s] [ 66%] 2025-12-04T12:25:43.2712596Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int32 PASSED [0.0028s] [ 66%] 2025-12-04T12:25:43.2712717Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_trunc_cuda_int64 PASSED [1.0730s] [ 66%] 2025-12-04T12:25:43.2712854Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_complex128 PASSED [0.0077s] [ 66%] 2025-12-04T12:25:43.2712987Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_float32 PASSED [0.0056s] [ 66%] 2025-12-04T12:25:43.2713120Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_float64 PASSED [0.0054s] [ 66%] 2025-12-04T12:25:43.2713333Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int16 PASSED [0.0053s] [ 66%] 2025-12-04T12:25:43.2713463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_copy_cuda_int32 PASSED [1.0590s] [ 66%] 2025-12-04T12:25:43.2713586Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_bool PASSED [0.0065s] [ 66%] 2025-12-04T12:25:43.2713711Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float16 PASSED [0.0051s] [ 66%] 2025-12-04T12:25:43.2713848Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_float32 PASSED [0.0048s] [ 66%] 2025-12-04T12:25:43.2713984Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_int32 PASSED [1.0787s] [ 66%] 2025-12-04T12:25:43.2714105Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unbind_cuda_int64 PASSED [0.0066s] [ 66%] 2025-12-04T12:25:43.2714235Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_bool PASSED [0.0056s] [ 66%] 2025-12-04T12:25:43.2714377Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_float64 PASSED [0.0053s] [ 66%] 2025-12-04T12:25:43.2714505Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int16 PASSED [0.0053s] [ 66%] 2025-12-04T12:25:43.2714630Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int32 PASSED [0.0052s] [ 66%] 2025-12-04T12:25:43.2714758Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_int64 PASSED [0.0053s] [ 66%] 2025-12-04T12:25:43.2714886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unflatten_cuda_uint8 PASSED [0.0053s] [ 66%] 2025-12-04T12:25:43.2715019Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_bfloat16 PASSED [0.0147s] [ 66%] 2025-12-04T12:25:43.2715155Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_complex32 PASSED [0.0145s] [ 66%] 2025-12-04T12:25:43.2715287Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_float64 PASSED [0.0143s] [ 66%] 2025-12-04T12:25:43.2715420Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_int16 PASSED [0.0143s] [ 66%] 2025-12-04T12:25:43.2715552Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_copy_cuda_uint8 PASSED [0.0143s] [ 66%] 2025-12-04T12:25:43.2715679Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_float64 PASSED [0.0084s] [ 66%] 2025-12-04T12:25:43.2715801Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unfold_cuda_int8 PASSED [0.0084s] [ 66%] 2025-12-04T12:25:43.2715931Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_bfloat16 PASSED [1.0762s] [ 66%] 2025-12-04T12:25:43.2716073Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex128 PASSED [0.0052s] [ 66%] 2025-12-04T12:25:43.2716203Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_complex64 PASSED [1.0878s] [ 66%] 2025-12-04T12:25:43.2716329Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_uniform_cuda_float16 PASSED [0.0057s] [ 66%] 2025-12-04T12:25:43.2716471Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_float16 PASSED [0.1002s] [ 67%] 2025-12-04T12:25:43.2716614Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_float32 PASSED [0.0997s] [ 67%] 2025-12-04T12:25:43.2716756Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int32 PASSED [0.0984s] [ 67%] 2025-12-04T12:25:43.2716895Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_int8 PASSED [0.0983s] [ 67%] 2025-12-04T12:25:43.2717035Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_consecutive_cuda_uint8 PASSED [0.1016s] [ 67%] 2025-12-04T12:25:43.2717162Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_bfloat16 PASSED [0.2051s] [ 67%] 2025-12-04T12:25:43.2717284Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_bool PASSED [0.1963s] [ 67%] 2025-12-04T12:25:43.2717410Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float16 PASSED [0.2112s] [ 67%] 2025-12-04T12:25:43.2717533Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_float32 PASSED [0.2039s] [ 67%] 2025-12-04T12:25:43.2717669Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unique_cuda_int16 PASSED [0.2021s] [ 67%] 2025-12-04T12:25:43.2717814Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unravel_index_cuda_int32 PASSED [0.0618s] [ 67%] 2025-12-04T12:25:43.2717949Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_bfloat16 PASSED [1.0824s] [ 67%] 2025-12-04T12:25:43.2718144Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_bool PASSED [0.0050s] [ 67%] 2025-12-04T12:25:43.2718299Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_complex64 PASSED [1.0953s] [ 67%] 2025-12-04T12:25:43.2718430Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_chunk_cuda_uint8 PASSED [0.0049s] [ 67%] 2025-12-04T12:25:43.2718565Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_bfloat16 PASSED [1.0709s] [ 67%] 2025-12-04T12:25:43.2718702Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_complex128 PASSED [0.0048s] [ 67%] 2025-12-04T12:25:43.2718835Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_int32 PASSED [1.0720s] [ 67%] 2025-12-04T12:25:43.2718965Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_int64 PASSED [0.0049s] [ 67%] 2025-12-04T12:25:43.2719096Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsafe_split_cuda_uint8 PASSED [1.0670s] [ 67%] 2025-12-04T12:25:43.2719237Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_complex64 PASSED [0.0097s] [ 67%] 2025-12-04T12:25:43.2719373Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_float16 PASSED [0.0076s] [ 67%] 2025-12-04T12:25:43.2719508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_copy_cuda_int8 PASSED [0.0073s] [ 67%] 2025-12-04T12:25:43.2719639Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_bfloat16 PASSED [0.0048s] [ 67%] 2025-12-04T12:25:43.2719769Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_float16 PASSED [0.0048s] [ 67%] 2025-12-04T12:25:43.2719901Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_float64 PASSED [1.0752s] [ 67%] 2025-12-04T12:25:43.2720047Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_unsqueeze_cuda_int64 PASSED [0.0070s] [ 67%] 2025-12-04T12:25:43.2720173Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_complex128 PASSED [0.0113s] [ 67%] 2025-12-04T12:25:43.2720295Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_float16 PASSED [0.0139s] [ 67%] 2025-12-04T12:25:43.2720417Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_cuda_float32 PASSED [0.0101s] [ 67%] 2025-12-04T12:25:43.2720547Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_complex64 PASSED [0.0156s] [ 67%] 2025-12-04T12:25:43.2720674Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_cuda_float16 PASSED [0.0210s] [ 67%] 2025-12-04T12:25:43.2720818Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_mean_unbiased_cuda_complex64 PASSED [0.0040s] [ 67%] 2025-12-04T12:25:43.2720954Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_bfloat16 PASSED [1.0737s] [ 67%] 2025-12-04T12:25:43.2721088Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_var_unbiased_cuda_float16 PASSED [0.0059s] [ 67%] 2025-12-04T12:25:43.2721213Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_bfloat16 PASSED [0.0038s] [ 67%] 2025-12-04T12:25:43.2721334Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vdot_cuda_float16 PASSED [1.0758s] [ 67%] 2025-12-04T12:25:43.2721463Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_complex64 PASSED [0.0057s] [ 67%] 2025-12-04T12:25:43.2721603Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float16 PASSED [0.0040s] [ 67%] 2025-12-04T12:25:43.2721742Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_float64 PASSED [1.0734s] [ 67%] 2025-12-04T12:25:43.2721866Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int16 PASSED [0.0053s] [ 67%] 2025-12-04T12:25:43.2721989Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int32 PASSED [0.0038s] [ 67%] 2025-12-04T12:25:43.2722122Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_as_cuda_int64 PASSED [1.0716s] [ 67%] 2025-12-04T12:25:43.2722258Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_complex64 PASSED [0.0089s] [ 67%] 2025-12-04T12:25:43.2722386Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float16 PASSED [1.0598s] [ 67%] 2025-12-04T12:25:43.2722516Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_float32 PASSED [0.0087s] [ 67%] 2025-12-04T12:25:43.2722641Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_copy_cuda_int64 PASSED [1.0665s] [ 67%] 2025-12-04T12:25:43.2722763Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_bool PASSED [0.0067s] [ 67%] 2025-12-04T12:25:43.2722886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_float16 PASSED [0.0048s] [ 67%] 2025-12-04T12:25:43.2723006Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_view_cuda_int8 PASSED [1.0595s] [ 67%] 2025-12-04T12:25:43.2723135Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_complex32 PASSED [0.0051s] [ 67%] 2025-12-04T12:25:43.2723260Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float32 PASSED [0.0037s] [ 67%] 2025-12-04T12:25:43.2723385Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_float64 PASSED [1.0781s] [ 67%] 2025-12-04T12:25:43.2723508Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int16 PASSED [0.0049s] [ 67%] 2025-12-04T12:25:43.2723631Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int64 PASSED [0.0036s] [ 67%] 2025-12-04T12:25:43.2723764Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vsplit_cuda_int8 PASSED [1.0665s] [ 67%] 2025-12-04T12:25:43.2723886Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_bool PASSED [0.0072s] [ 67%] 2025-12-04T12:25:43.2724012Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_float32 PASSED [1.0783s] [ 67%] 2025-12-04T12:25:43.2724133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_int8 PASSED [0.0072s] [ 67%] 2025-12-04T12:25:43.2724254Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_vstack_cuda_uint8 PASSED [1.1482s] [ 67%] 2025-12-04T12:25:43.2724383Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_complex64 PASSED [0.0110s] [ 67%] 2025-12-04T12:25:43.2724506Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float16 PASSED [0.0086s] [ 67%] 2025-12-04T12:25:43.2724629Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_float32 PASSED [0.0084s] [ 67%] 2025-12-04T12:25:43.2724750Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int16 PASSED [1.1658s] [ 67%] 2025-12-04T12:25:43.2724871Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_where_cuda_int8 PASSED [0.0108s] [ 67%] 2025-12-04T12:25:43.2724995Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_bfloat16 PASSED [0.0196s] [ 67%] 2025-12-04T12:25:43.2725116Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_bool PASSED [0.0179s] [ 67%] 2025-12-04T12:25:43.2725236Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_xlogy_cuda_int32 PASSED [0.0177s] [ 67%] 2025-12-04T12:25:43.2725369Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_float32 PASSED [0.0045s] [ 67%] 2025-12-04T12:25:43.2725499Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int64 PASSED [0.0044s] [ 67%] 2025-12-04T12:25:43.2725620Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zero__cuda_int8 PASSED [0.0044s] [ 67%] 2025-12-04T12:25:43.2725741Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_float32 PASSED [1.1702s] [ 67%] 2025-12-04T12:25:43.2725879Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_int16 PASSED [0.0047s] [ 67%] 2025-12-04T12:25:43.2726001Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_cuda_uint8 PASSED [1.1623s] [ 67%] 2025-12-04T12:25:43.2726133Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_bfloat16 PASSED [0.0090s] [ 67%] 2025-12-04T12:25:43.2726270Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_complex128 PASSED [1.1540s] [ 67%] 2025-12-04T12:25:43.2726405Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_float64 PASSED [0.0093s] [ 67%] 2025-12-04T12:25:43.2726534Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int16 PASSED [1.1712s] [ 67%] 2025-12-04T12:25:43.2726662Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int64 PASSED [0.0089s] [ 67%] 2025-12-04T12:25:43.2726790Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_int8 PASSED [1.1573s] [ 67%] 2025-12-04T12:25:43.2726917Z test_meta.py::TestMetaCUDA::test_dispatch_symbolic_meta_outplace_zeros_like_cuda_uint8 PASSED [0.0095s] [ 67%] 2025-12-04T12:25:43.2727020Z test_meta.py::TestMetaCUDA::test_embedding_bag_byte_unpack_cuda PASSED [0.0027s] [ 67%] 2025-12-04T12:25:43.2727146Z test_meta.py::TestMetaCUDA::test_embedding_bag_dense_backward_per_sample_weights_cuda PASSED [0.0111s] [ 67%] 2025-12-04T12:25:43.2727240Z test_meta.py::TestMetaCUDA::test_fill_stride_cuda PASSED [0.0012s] [ 67%] 2025-12-04T12:25:43.2727374Z test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask0_cuda SKIPPED [0.0006s] (Only runs on cpu) [ 67%] 2025-12-04T12:25:43.2727519Z test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask1_cuda SKIPPED [0.0005s] (Only runs on cpu) [ 67%] 2025-12-04T12:25:43.2727651Z test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask2_cuda SKIPPED [0.0005s] (Only runs on cpu) [ 67%] 2025-12-04T12:25:43.2727830Z test_meta.py::TestMetaCUDA::test_group_norm_backward_output_mask4_cuda SKIPPED [0.0006s] (Only runs on cpu) [ 67%] 2025-12-04T12:25:43.2727930Z test_meta.py::TestMetaCUDA::test_inplace_masked_fill_error_cuda PASSED [0.0010s] [ 67%] 2025-12-04T12:25:43.2728043Z test_meta.py::TestMetaCUDA::test_meta__fused_moving_avg_obs_fq_helper_cuda PASSED [0.0493s] [ 67%] 2025-12-04T12:25:43.2728262Z test_meta.py::TestMetaCUDA::test_meta_consistency_out_dtype_mismatch_pow_Tensor_Scalar_cuda PASSED [0.0016s] [ 67%] 2025-12-04T12:25:43.2728406Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2728549Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2728688Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2728825Z test_meta.py::TestMetaCUDA::test_meta_inplace_H_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2728968Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729105Z test_meta.py::TestMetaCUDA::test_meta_inplace_T_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729275Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729435Z test_meta.py::TestMetaCUDA::test_meta_inplace___getitem___cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729577Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729728Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2729890Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730033Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730174Z test_meta.py::TestMetaCUDA::test_meta_inplace___radd___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730315Z test_meta.py::TestMetaCUDA::test_meta_inplace___rand___cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730455Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730606Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730751Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2730893Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731036Z test_meta.py::TestMetaCUDA::test_meta_inplace___rdiv___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731190Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731351Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmatmul___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731496Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731655Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731796Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2731937Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732075Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmod___cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732223Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732363Z test_meta.py::TestMetaCUDA::test_meta_inplace___rmul___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732505Z test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732645Z test_meta.py::TestMetaCUDA::test_meta_inplace___ror___cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732792Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2732941Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733085Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733240Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733392Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733533Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733683Z test_meta.py::TestMetaCUDA::test_meta_inplace___rpow___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733831Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2733974Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734115Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734256Z test_meta.py::TestMetaCUDA::test_meta_inplace___rsub___cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734398Z test_meta.py::TestMetaCUDA::test_meta_inplace___rxor___cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734565Z test_meta.py::TestMetaCUDA::test_meta_inplace__batch_norm_with_update_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734715Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2734870Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2735018Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 67%] 2025-12-04T12:25:43.2735168Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2735315Z test_meta.py::TestMetaCUDA::test_meta_inplace__chunk_cat_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2735439Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_float64 PASSED [0.0150s] [ 68%] 2025-12-04T12:25:43.2735547Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_abs_cuda_int64 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2735660Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_bfloat16 PASSED [0.0145s] [ 68%] 2025-12-04T12:25:43.2735765Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_bool XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2735877Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_complex64 PASSED [1.1769s] [ 68%] 2025-12-04T12:25:43.2735984Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int16 XFAIL [0.0058s] [ 68%] 2025-12-04T12:25:43.2736090Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_int64 XFAIL [1.1582s] [ 68%] 2025-12-04T12:25:43.2736195Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_acos_cuda_uint8 XFAIL [1.1676s] [ 68%] 2025-12-04T12:25:43.2736300Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_bool XFAIL [1.1588s] [ 68%] 2025-12-04T12:25:43.2736410Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_complex128 XFAIL [1.1034s] [ 68%] 2025-12-04T12:25:43.2736519Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_float16 XFAIL [1.0828s] [ 68%] 2025-12-04T12:25:43.2736622Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_add_cuda_int32 XFAIL [1.0769s] [ 68%] 2025-12-04T12:25:43.2736737Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_bfloat16 PASSED [1.2358s] [ 68%] 2025-12-04T12:25:43.2736857Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_bool XFAIL [0.0074s] [ 68%] 2025-12-04T12:25:43.2736984Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_complex128 PASSED [1.2841s] [ 68%] 2025-12-04T12:25:43.2737100Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_complex64 PASSED [0.1978s] [ 68%] 2025-12-04T12:25:43.2737214Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_float64 PASSED [0.1496s] [ 68%] 2025-12-04T12:25:43.2737323Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int32 XFAIL [0.0077s] [ 68%] 2025-12-04T12:25:43.2737446Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcdiv_cuda_int64 XFAIL [1.0826s] [ 68%] 2025-12-04T12:25:43.2737554Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_bool XFAIL [1.0736s] [ 68%] 2025-12-04T12:25:43.2737667Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_float64 PASSED [1.2183s] [ 68%] 2025-12-04T12:25:43.2737777Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int64 PASSED [0.1030s] [ 68%] 2025-12-04T12:25:43.2737886Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_int8 PASSED [0.1020s] [ 68%] 2025-12-04T12:25:43.2737996Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_addcmul_cuda_uint8 PASSED [0.1020s] [ 68%] 2025-12-04T12:25:43.2738153Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_complex128 PASSED [0.0145s] [ 68%] 2025-12-04T12:25:43.2738266Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_complex64 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2738373Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float16 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2738481Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_float64 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2738586Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int16 XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2738690Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_asin_cuda_int32 XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2738799Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_bfloat16 PASSED [1.0922s] [ 68%] 2025-12-04T12:25:43.2738911Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_complex64 PASSED [0.0148s] [ 68%] 2025-12-04T12:25:43.2739033Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_float64 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2739138Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int64 XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2739244Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_int8 XFAIL [1.0860s] [ 68%] 2025-12-04T12:25:43.2739349Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_atan_cuda_uint8 XFAIL [1.0648s] [ 68%] 2025-12-04T12:25:43.2739456Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_float16 PASSED [1.0827s] [ 68%] 2025-12-04T12:25:43.2739566Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_ceil_cuda_int16 PASSED [0.0145s] [ 68%] 2025-12-04T12:25:43.2739676Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_bool XFAIL [0.0140s] [ 68%] 2025-12-04T12:25:43.2739793Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_complex64 XFAIL [1.0789s] [ 68%] 2025-12-04T12:25:43.2739907Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_int64 PASSED [1.1825s] [ 68%] 2025-12-04T12:25:43.2740019Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_max_cuda_uint8 PASSED [0.1012s] [ 68%] 2025-12-04T12:25:43.2740136Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float16 PASSED [0.1504s] [ 68%] 2025-12-04T12:25:43.2740249Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_float32 PASSED [0.1484s] [ 68%] 2025-12-04T12:25:43.2740360Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_int8 PASSED [0.1014s] [ 68%] 2025-12-04T12:25:43.2740471Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_clamp_min_cuda_uint8 PASSED [0.1012s] [ 68%] 2025-12-04T12:25:43.2740592Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_float32 PASSED [0.0150s] [ 68%] 2025-12-04T12:25:43.2740716Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_copy_cuda_int8 PASSED [0.0149s] [ 68%] 2025-12-04T12:25:43.2740821Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_bool XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2740933Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_float32 PASSED [0.0145s] [ 68%] 2025-12-04T12:25:43.2741053Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_float64 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2741155Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int16 XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2741258Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cos_cuda_int8 XFAIL [0.0053s] [ 68%] 2025-12-04T12:25:43.2741370Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_complex128 PASSED [1.0944s] [ 68%] 2025-12-04T12:25:43.2741475Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int32 XFAIL [0.0059s] [ 68%] 2025-12-04T12:25:43.2741577Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_int8 XFAIL [1.0700s] [ 68%] 2025-12-04T12:25:43.2741682Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_cosh_cuda_uint8 XFAIL [1.0632s] [ 68%] 2025-12-04T12:25:43.2741790Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_bfloat16 PASSED [1.1479s] [ 68%] 2025-12-04T12:25:43.2741894Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_bool XFAIL [0.0066s] [ 68%] 2025-12-04T12:25:43.2742005Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_complex128 PASSED [1.1597s] [ 68%] 2025-12-04T12:25:43.2742112Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_float16 PASSED [0.0749s] [ 68%] 2025-12-04T12:25:43.2742216Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_div_cuda_uint8 XFAIL [0.0065s] [ 68%] 2025-12-04T12:25:43.2742323Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_complex64 XFAIL [1.0693s] [ 68%] 2025-12-04T12:25:43.2742431Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erf_cuda_float64 PASSED [1.0918s] [ 68%] 2025-12-04T12:25:43.2742540Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_bfloat16 PASSED [0.0149s] [ 68%] 2025-12-04T12:25:43.2742665Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_complex128 XFAIL [0.0057s] [ 68%] 2025-12-04T12:25:43.2742775Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_complex64 XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2742884Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_float32 PASSED [1.0820s] [ 68%] 2025-12-04T12:25:43.2742987Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_erfc_cuda_int8 XFAIL [0.0059s] [ 68%] 2025-12-04T12:25:43.2743098Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_complex64 PASSED [1.0975s] [ 68%] 2025-12-04T12:25:43.2743206Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_float16 PASSED [0.0148s] [ 68%] 2025-12-04T12:25:43.2743314Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_float32 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2743416Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_exp_cuda_int32 XFAIL [0.0056s] [ 68%] 2025-12-04T12:25:43.2743528Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_bfloat16 PASSED [1.0826s] [ 68%] 2025-12-04T12:25:43.2743634Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int16 XFAIL [0.0059s] [ 68%] 2025-12-04T12:25:43.2743743Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_int64 XFAIL [1.0814s] [ 68%] 2025-12-04T12:25:43.2743848Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_expm1_cuda_uint8 XFAIL [1.0856s] [ 68%] 2025-12-04T12:25:43.2743954Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_bool XFAIL [1.0825s] [ 68%] 2025-12-04T12:25:43.2744062Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_float16 PASSED [1.0780s] [ 68%] 2025-12-04T12:25:43.2744181Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_floor_cuda_int16 PASSED [0.0147s] [ 68%] 2025-12-04T12:25:43.2744296Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int16 XFAIL [0.0056s] [ 68%] 2025-12-04T12:25:43.2744401Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int32 XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2744504Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_frac_cuda_int8 XFAIL [1.0780s] [ 68%] 2025-12-04T12:25:43.2744618Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_bool XFAIL [1.1018s] [ 68%] 2025-12-04T12:25:43.2744727Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_float64 PASSED [1.1883s] [ 68%] 2025-12-04T12:25:43.2744829Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_int8 XFAIL [0.0075s] [ 68%] 2025-12-04T12:25:43.2744933Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lerp_cuda_uint8 XFAIL [1.1333s] [ 68%] 2025-12-04T12:25:43.2745115Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_int32 SKIPPED [0.0002s] (In-place lgamma not supported for integral tensors) [ 68%] 2025-12-04T12:25:43.2745293Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_lgamma_cuda_int8 SKIPPED [0.0001s] (In-place lgamma not supported for integral tensors) [ 68%] 2025-12-04T12:25:43.2745406Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_bfloat16 PASSED [1.1040s] [ 68%] 2025-12-04T12:25:43.2745521Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log10_cuda_complex128 PASSED [0.0148s] [ 68%] 2025-12-04T12:25:43.2745626Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_bool XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2745739Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_complex64 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2745850Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_float32 PASSED [0.0142s] [ 68%] 2025-12-04T12:25:43.2745959Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log1p_cuda_int32 XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2746067Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float16 PASSED [1.0937s] [ 68%] 2025-12-04T12:25:43.2746175Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_float32 PASSED [0.0148s] [ 68%] 2025-12-04T12:25:43.2746291Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log2_cuda_int32 XFAIL [0.0055s] [ 68%] 2025-12-04T12:25:43.2746403Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_complex128 PASSED [0.0145s] [ 68%] 2025-12-04T12:25:43.2746510Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float16 PASSED [0.0144s] [ 68%] 2025-12-04T12:25:43.2746618Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float32 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2746724Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_float64 PASSED [0.0142s] [ 68%] 2025-12-04T12:25:43.2746829Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_log_cuda_uint8 XFAIL [0.0054s] [ 68%] 2025-12-04T12:25:43.2746982Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_bool SKIPPED [1.0646s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2747141Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_complex128 SKIPPED [0.0015s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2747300Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2747455Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2747606Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_max_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2747720Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_complex128 XFAIL [0.0074s] [ 68%] 2025-12-04T12:25:43.2747842Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int16 PASSED [1.1721s] [ 68%] 2025-12-04T12:25:43.2747962Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_int64 PASSED [0.1016s] [ 68%] 2025-12-04T12:25:43.2748106Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_maximum_cuda_uint8 PASSED [0.1014s] [ 68%] 2025-12-04T12:25:43.2748215Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_bool XFAIL [0.0139s] [ 68%] 2025-12-04T12:25:43.2748347Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_complex64 XFAIL [1.0885s] [ 68%] 2025-12-04T12:25:43.2748460Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_float16 PASSED [1.2174s] [ 68%] 2025-12-04T12:25:43.2748572Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_float32 PASSED [0.1487s] [ 68%] 2025-12-04T12:25:43.2748681Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_int16 PASSED [0.1015s] [ 68%] 2025-12-04T12:25:43.2748795Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_minimum_cuda_uint8 PASSED [0.1014s] [ 68%] 2025-12-04T12:25:43.2748904Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_bfloat16 PASSED [0.0750s] [ 68%] 2025-12-04T12:25:43.2749006Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_bool XFAIL [0.0096s] [ 68%] 2025-12-04T12:25:43.2749112Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_int16 PASSED [1.1349s] [ 68%] 2025-12-04T12:25:43.2749219Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_mul_cuda_uint8 PASSED [0.0559s] [ 68%] 2025-12-04T12:25:43.2749330Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_complex128 PASSED [0.0146s] [ 68%] 2025-12-04T12:25:43.2749438Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_float16 PASSED [0.0146s] [ 68%] 2025-12-04T12:25:43.2749546Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_float64 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2749652Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_neg_cuda_int64 PASSED [0.0143s] [ 68%] 2025-12-04T12:25:43.2749812Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2749967Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2750137Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_norm_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 68%] 2025-12-04T12:25:43.2750242Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_bool XFAIL [0.0063s] [ 68%] 2025-12-04T12:25:43.2750348Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int16 PASSED [1.1330s] [ 68%] 2025-12-04T12:25:43.2750452Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int32 PASSED [0.0568s] [ 68%] 2025-12-04T12:25:43.2750557Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_pow_cuda_int8 PASSED [0.0584s] [ 68%] 2025-12-04T12:25:43.2750668Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_bool XFAIL [0.0059s] [ 68%] 2025-12-04T12:25:43.2750788Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_float16 PASSED [0.0148s] [ 68%] 2025-12-04T12:25:43.2750901Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int32 XFAIL [0.0054s] [ 69%] 2025-12-04T12:25:43.2751014Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_int64 XFAIL [0.0055s] [ 69%] 2025-12-04T12:25:43.2751126Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_reciprocal_cuda_uint8 XFAIL [1.0684s] [ 69%] 2025-12-04T12:25:43.2751237Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_bfloat16 PASSED [1.0812s] [ 69%] 2025-12-04T12:25:43.2751345Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float16 PASSED [0.0145s] [ 69%] 2025-12-04T12:25:43.2751454Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float32 PASSED [0.0142s] [ 69%] 2025-12-04T12:25:43.2751575Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_round_cuda_float64 PASSED [0.0140s] [ 69%] 2025-12-04T12:25:43.2751693Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_bool XFAIL [0.0055s] [ 69%] 2025-12-04T12:25:43.2751801Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_int64 XFAIL [0.0054s] [ 69%] 2025-12-04T12:25:43.2751905Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_int8 XFAIL [1.0977s] [ 69%] 2025-12-04T12:25:43.2752022Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_rsqrt_cuda_uint8 XFAIL [1.0911s] [ 69%] 2025-12-04T12:25:43.2752135Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_bfloat16 PASSED [1.1086s] [ 69%] 2025-12-04T12:25:43.2752251Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_complex64 PASSED [0.0230s] [ 69%] 2025-12-04T12:25:43.2752363Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_float16 PASSED [0.0228s] [ 69%] 2025-12-04T12:25:43.2752473Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int64 XFAIL [0.0056s] [ 69%] 2025-12-04T12:25:43.2752581Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_int8 XFAIL [0.0054s] [ 69%] 2025-12-04T12:25:43.2752690Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sigmoid_cuda_uint8 XFAIL [1.0726s] [ 69%] 2025-12-04T12:25:43.2752797Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_bfloat16 PASSED [1.0718s] [ 69%] 2025-12-04T12:25:43.2752906Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sign_cuda_float32 PASSED [0.0147s] [ 69%] 2025-12-04T12:25:43.2753016Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_complex128 PASSED [0.0145s] [ 69%] 2025-12-04T12:25:43.2753124Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_float16 PASSED [0.0146s] [ 69%] 2025-12-04T12:25:43.2753231Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_float32 PASSED [0.0144s] [ 69%] 2025-12-04T12:25:43.2753335Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_int32 XFAIL [0.0056s] [ 69%] 2025-12-04T12:25:43.2753438Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sin_cuda_uint8 XFAIL [1.0579s] [ 69%] 2025-12-04T12:25:43.2753547Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_bfloat16 PASSED [1.0668s] [ 69%] 2025-12-04T12:25:43.2753664Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_bool XFAIL [0.0059s] [ 69%] 2025-12-04T12:25:43.2753767Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sinh_cuda_int8 XFAIL [1.0702s] [ 69%] 2025-12-04T12:25:43.2753879Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_complex128 PASSED [1.0839s] [ 69%] 2025-12-04T12:25:43.2753987Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_float64 PASSED [0.0148s] [ 69%] 2025-12-04T12:25:43.2754091Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int16 XFAIL [0.0057s] [ 69%] 2025-12-04T12:25:43.2754197Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sqrt_cuda_int64 XFAIL [0.0055s] [ 69%] 2025-12-04T12:25:43.2754308Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_bfloat16 XFAIL [1.0882s] [ 69%] 2025-12-04T12:25:43.2754417Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_complex128 XFAIL [0.0160s] [ 69%] 2025-12-04T12:25:43.2754521Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_int8 XFAIL [1.0978s] [ 69%] 2025-12-04T12:25:43.2754624Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_sub_cuda_uint8 XFAIL [1.0955s] [ 69%] 2025-12-04T12:25:43.2754732Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float16 PASSED [1.0935s] [ 69%] 2025-12-04T12:25:43.2754838Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_float64 PASSED [0.0146s] [ 69%] 2025-12-04T12:25:43.2754940Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tan_cuda_int16 XFAIL [0.0055s] [ 69%] 2025-12-04T12:25:43.2755058Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_bool XFAIL [0.0054s] [ 69%] 2025-12-04T12:25:43.2755180Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_complex128 PASSED [1.0953s] [ 69%] 2025-12-04T12:25:43.2755288Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_float32 PASSED [0.0148s] [ 69%] 2025-12-04T12:25:43.2755391Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_tanh_cuda_int8 XFAIL [0.0057s] [ 69%] 2025-12-04T12:25:43.2755499Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_float64 PASSED [0.0144s] [ 69%] 2025-12-04T12:25:43.2755619Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_int16 PASSED [0.0142s] [ 69%] 2025-12-04T12:25:43.2755725Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_trunc_cuda_int64 PASSED [0.0142s] [ 69%] 2025-12-04T12:25:43.2755830Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_bool PASSED [0.0122s] [ 69%] 2025-12-04T12:25:43.2755941Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_complex64 PASSED [0.0123s] [ 69%] 2025-12-04T12:25:43.2756049Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_float16 PASSED [0.0122s] [ 69%] 2025-12-04T12:25:43.2756157Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int16 PASSED [0.0122s] [ 69%] 2025-12-04T12:25:43.2756267Z test_meta.py::TestMetaCUDA::test_meta_inplace__foreach_zero_cuda_int32 PASSED [0.0122s] [ 69%] 2025-12-04T12:25:43.2756441Z test_meta.py::TestMetaCUDA::test_meta_inplace__softmax_backward_data_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2756611Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2756776Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2756940Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2757101Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2757288Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2757487Z test_meta.py::TestMetaCUDA::test_meta_inplace__unsafe_masked_index_put_accumulate_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2757657Z test_meta.py::TestMetaCUDA::test_meta_inplace__upsample_bilinear2d_aa_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2757758Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_complex128 XFAIL [0.0028s] [ 69%] 2025-12-04T12:25:43.2757853Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int16 PASSED [2.1931s] [ 69%] 2025-12-04T12:25:43.2757948Z test_meta.py::TestMetaCUDA::test_meta_inplace_abs_cuda_int32 PASSED [0.0040s] [ 69%] 2025-12-04T12:25:43.2758047Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_bfloat16 PASSED [0.0037s] [ 69%] 2025-12-04T12:25:43.2758276Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2758477Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2758677Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2758777Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_float32 PASSED [0.0035s] [ 69%] 2025-12-04T12:25:43.2758980Z test_meta.py::TestMetaCUDA::test_meta_inplace_acos_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2759093Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_bfloat16 PASSED [0.0035s] [ 69%] 2025-12-04T12:25:43.2759296Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2759513Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2759612Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_float16 PASSED [1.0629s] [ 69%] 2025-12-04T12:25:43.2759807Z test_meta.py::TestMetaCUDA::test_meta_inplace_acosh_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 69%] 2025-12-04T12:25:43.2759908Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_complex128 PASSED [0.0074s] [ 69%] 2025-12-04T12:25:43.2760009Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_float32 PASSED [0.0061s] [ 69%] 2025-12-04T12:25:43.2760104Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int16 PASSED [0.0059s] [ 69%] 2025-12-04T12:25:43.2760196Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int32 PASSED [0.0058s] [ 69%] 2025-12-04T12:25:43.2760288Z test_meta.py::TestMetaCUDA::test_meta_inplace_add_cuda_int64 PASSED [0.0058s] [ 69%] 2025-12-04T12:25:43.2760392Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_bfloat16 PASSED [1.0773s] [ 69%] 2025-12-04T12:25:43.2760495Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_complex64 PASSED [0.0066s] [ 69%] 2025-12-04T12:25:43.2760593Z test_meta.py::TestMetaCUDA::test_meta_inplace_addbmm_cuda_float16 PASSED [0.0045s] [ 69%] 2025-12-04T12:25:43.2760694Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_bfloat16 PASSED [0.0077s] [ 69%] 2025-12-04T12:25:43.2760794Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float16 PASSED [0.0073s] [ 69%] 2025-12-04T12:25:43.2760894Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcdiv_cuda_float64 PASSED [0.0073s] [ 69%] 2025-12-04T12:25:43.2760997Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_bfloat16 PASSED [0.0073s] [ 69%] 2025-12-04T12:25:43.2761115Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_complex64 PASSED [0.0074s] [ 69%] 2025-12-04T12:25:43.2761215Z test_meta.py::TestMetaCUDA::test_meta_inplace_addcmul_cuda_float32 PASSED [0.0073s] [ 69%] 2025-12-04T12:25:43.2761315Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_bfloat16 PASSED [0.0052s] [ 69%] 2025-12-04T12:25:43.2761414Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_complex64 PASSED [0.0059s] [ 69%] 2025-12-04T12:25:43.2761511Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_float16 PASSED [0.0050s] [ 69%] 2025-12-04T12:25:43.2761607Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_cuda_float64 PASSED [0.0049s] [ 69%] 2025-12-04T12:25:43.2761729Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_complex128 PASSED [0.0058s] [ 69%] 2025-12-04T12:25:43.2761848Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmm_decomposed_cuda_complex64 PASSED [0.0057s] [ 69%] 2025-12-04T12:25:43.2761949Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_bfloat16 PASSED [0.0045s] [ 69%] 2025-12-04T12:25:43.2762046Z test_meta.py::TestMetaCUDA::test_meta_inplace_addmv_cuda_float16 PASSED [0.0045s] [ 69%] 2025-12-04T12:25:43.2762141Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_bool PASSED [1.0857s] [ 69%] 2025-12-04T12:25:43.2762240Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_complex128 PASSED [0.0059s] [ 69%] 2025-12-04T12:25:43.2762337Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_float64 PASSED [0.0047s] [ 69%] 2025-12-04T12:25:43.2762432Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int16 PASSED [0.0041s] [ 69%] 2025-12-04T12:25:43.2762527Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_int32 PASSED [1.0824s] [ 69%] 2025-12-04T12:25:43.2762634Z test_meta.py::TestMetaCUDA::test_meta_inplace_addr_cuda_uint8 PASSED [0.0061s] [ 69%] 2025-12-04T12:25:43.2762803Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2762959Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763118Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763265Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763411Z test_meta.py::TestMetaCUDA::test_meta_inplace_alias_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763551Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763690Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763826Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2763964Z test_meta.py::TestMetaCUDA::test_meta_inplace_all_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764117Z test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764263Z test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764411Z test_meta.py::TestMetaCUDA::test_meta_inplace_allclose_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764554Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764696Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764846Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2764986Z test_meta.py::TestMetaCUDA::test_meta_inplace_amax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765123Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765265Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765401Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765540Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765678Z test_meta.py::TestMetaCUDA::test_meta_inplace_amin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765819Z test_meta.py::TestMetaCUDA::test_meta_inplace_aminmax_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2765966Z test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766104Z test_meta.py::TestMetaCUDA::test_meta_inplace_angle_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766239Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766379Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766529Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766675Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766813Z test_meta.py::TestMetaCUDA::test_meta_inplace_any_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2766968Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767115Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767255Z test_meta.py::TestMetaCUDA::test_meta_inplace_arange_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767408Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767551Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767692Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767834Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2767978Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 69%] 2025-12-04T12:25:43.2768160Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2768299Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2768441Z test_meta.py::TestMetaCUDA::test_meta_inplace_argmin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2768586Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2768731Z test_meta.py::TestMetaCUDA::test_meta_inplace_argsort_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2768897Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769045Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769192Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769341Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769494Z test_meta.py::TestMetaCUDA::test_meta_inplace_argwhere_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769658Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769814Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2769969Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2770121Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2770248Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_partial_views_cuda_complex128 XFAIL [0.0041s] [ 70%] 2025-12-04T12:25:43.2770427Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_bfloat16 SKIPPED [1.0804s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2770596Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_bool SKIPPED [0.0015s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2770761Z test_meta.py::TestMetaCUDA::test_meta_inplace_as_strided_scatter_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2770975Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2771076Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_float16 PASSED [1.0750s] [ 70%] 2025-12-04T12:25:43.2771174Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_float64 PASSED [0.0041s] [ 70%] 2025-12-04T12:25:43.2771367Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2771554Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2771748Z test_meta.py::TestMetaCUDA::test_meta_inplace_asin_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2771954Z test_meta.py::TestMetaCUDA::test_meta_inplace_asinh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2772146Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2772337Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan2_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2772535Z test_meta.py::TestMetaCUDA::test_meta_inplace_atan_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2772638Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_bfloat16 PASSED [1.0787s] [ 70%] 2025-12-04T12:25:43.2772838Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2772940Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_float64 PASSED [1.0822s] [ 70%] 2025-12-04T12:25:43.2773131Z test_meta.py::TestMetaCUDA::test_meta_inplace_atanh_cuda_int16 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 70%] 2025-12-04T12:25:43.2773287Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_bfloat16 SKIPPED [0.0014s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2773434Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2773581Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_1d_cuda_int8 SKIPPED [0.0012s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2773732Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2773883Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774033Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774179Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774324Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_2d_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774489Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774652Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774804Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2774963Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2775110Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2775457Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2775643Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2775807Z test_meta.py::TestMetaCUDA::test_meta_inplace_atleast_3d_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2775934Z test_meta.py::TestMetaCUDA::test_meta_inplace_baddbmm_cuda_complex64 PASSED [0.0076s] [ 70%] 2025-12-04T12:25:43.2776087Z test_meta.py::TestMetaCUDA::test_meta_inplace_bernoulli_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2776239Z test_meta.py::TestMetaCUDA::test_meta_inplace_bernoulli_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2776417Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2776563Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2776711Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2776859Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777017Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777163Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777305Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777451Z test_meta.py::TestMetaCUDA::test_meta_inplace_bfloat16_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777593Z test_meta.py::TestMetaCUDA::test_meta_inplace_bincount_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2777701Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int16 PASSED [1.0857s] [ 70%] 2025-12-04T12:25:43.2777806Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int32 PASSED [0.0075s] [ 70%] 2025-12-04T12:25:43.2777912Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_and_cuda_int64 PASSED [0.0057s] [ 70%] 2025-12-04T12:25:43.2778026Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_left_shift_cuda_int8 PASSED [0.0055s] [ 70%] 2025-12-04T12:25:43.2778182Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_bool PASSED [0.0034s] [ 70%] 2025-12-04T12:25:43.2778285Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int32 PASSED [1.0730s] [ 70%] 2025-12-04T12:25:43.2778389Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_not_cuda_int8 PASSED [0.0053s] [ 70%] 2025-12-04T12:25:43.2778491Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_bool PASSED [0.0060s] [ 70%] 2025-12-04T12:25:43.2778608Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_or_cuda_uint8 PASSED [0.0056s] [ 70%] 2025-12-04T12:25:43.2778741Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int16 PASSED [0.0055s] [ 70%] 2025-12-04T12:25:43.2778856Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int32 PASSED [0.0055s] [ 70%] 2025-12-04T12:25:43.2778972Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_right_shift_cuda_int8 PASSED [0.0054s] [ 70%] 2025-12-04T12:25:43.2779091Z test_meta.py::TestMetaCUDA::test_meta_inplace_bitwise_xor_cuda_int16 PASSED [0.0054s] [ 70%] 2025-12-04T12:25:43.2779247Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2779400Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2779548Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2779694Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2779843Z test_meta.py::TestMetaCUDA::test_meta_inplace_block_diag_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2779985Z test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780132Z test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780274Z test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780415Z test_meta.py::TestMetaCUDA::test_meta_inplace_bmm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780557Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780698Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2780845Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781004Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781147Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781283Z test_meta.py::TestMetaCUDA::test_meta_inplace_bool_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781446Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_shapes_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781611Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781772Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2781929Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782089Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_tensors_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782243Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782395Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782561Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782723Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2782876Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783041Z test_meta.py::TestMetaCUDA::test_meta_inplace_broadcast_to_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783192Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783344Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783492Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783638Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783783Z test_meta.py::TestMetaCUDA::test_meta_inplace_bucketize_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2783927Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784065Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784206Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784344Z test_meta.py::TestMetaCUDA::test_meta_inplace_byte_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784501Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784656Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784820Z test_meta.py::TestMetaCUDA::test_meta_inplace_cartesian_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2784964Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2785109Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2785248Z test_meta.py::TestMetaCUDA::test_meta_inplace_cat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2785352Z test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_bfloat16 PASSED [0.0045s] [ 70%] 2025-12-04T12:25:43.2785453Z test_meta.py::TestMetaCUDA::test_meta_inplace_cauchy_cuda_float16 PASSED [0.0043s] [ 70%] 2025-12-04T12:25:43.2785596Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2785746Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2785894Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2786037Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2786180Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2786320Z test_meta.py::TestMetaCUDA::test_meta_inplace_cdouble_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2786432Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_bfloat16 PASSED [1.0736s] [ 70%] 2025-12-04T12:25:43.2786541Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_float64 PASSED [0.0043s] [ 70%] 2025-12-04T12:25:43.2786637Z test_meta.py::TestMetaCUDA::test_meta_inplace_ceil_cuda_int64 PASSED [1.0752s] [ 70%] 2025-12-04T12:25:43.2786788Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_complex64 SKIPPED [0.0016s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2786941Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787083Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787224Z test_meta.py::TestMetaCUDA::test_meta_inplace_cfloat_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787371Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787515Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787654Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787796Z test_meta.py::TestMetaCUDA::test_meta_inplace_chalf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2787942Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2788116Z test_meta.py::TestMetaCUDA::test_meta_inplace_char_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2788265Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2788431Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_inverse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 70%] 2025-12-04T12:25:43.2788591Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2788761Z test_meta.py::TestMetaCUDA::test_meta_inplace_cholesky_solve_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2788908Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2789055Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2789198Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2789339Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2789478Z test_meta.py::TestMetaCUDA::test_meta_inplace_chunk_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2789580Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_float32 PASSED [0.0094s] [ 71%] 2025-12-04T12:25:43.2789682Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_float64 PASSED [0.0082s] [ 71%] 2025-12-04T12:25:43.2789781Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int32 PASSED [0.0081s] [ 71%] 2025-12-04T12:25:43.2789878Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_cuda_int8 PASSED [0.0081s] [ 71%] 2025-12-04T12:25:43.2789987Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_bfloat16 PASSED [1.0793s] [ 71%] 2025-12-04T12:25:43.2790086Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_bool PASSED [0.0080s] [ 71%] 2025-12-04T12:25:43.2790192Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_float16 PASSED [1.0768s] [ 71%] 2025-12-04T12:25:43.2790308Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_float32 PASSED [0.0083s] [ 71%] 2025-12-04T12:25:43.2790422Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int64 PASSED [1.0836s] [ 71%] 2025-12-04T12:25:43.2790520Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_max_cuda_int8 PASSED [0.0082s] [ 71%] 2025-12-04T12:25:43.2790623Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int16 PASSED [1.0721s] [ 71%] 2025-12-04T12:25:43.2790735Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int64 PASSED [0.0086s] [ 71%] 2025-12-04T12:25:43.2790834Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_int8 PASSED [1.0945s] [ 71%] 2025-12-04T12:25:43.2790932Z test_meta.py::TestMetaCUDA::test_meta_inplace_clamp_min_cuda_uint8 PASSED [0.0086s] [ 71%] 2025-12-04T12:25:43.2791081Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791226Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791368Z test_meta.py::TestMetaCUDA::test_meta_inplace_clone_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791526Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791683Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791836Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2791986Z test_meta.py::TestMetaCUDA::test_meta_inplace_column_stack_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792144Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792294Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792452Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792617Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792773Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2792923Z test_meta.py::TestMetaCUDA::test_meta_inplace_combinations_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793070Z test_meta.py::TestMetaCUDA::test_meta_inplace_complex_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793213Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793352Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793499Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793644Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793786Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2793922Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2794062Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2794182Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_bool PASSED [1.0876s] [ 71%] 2025-12-04T12:25:43.2794304Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_float64 PASSED [0.0039s] [ 71%] 2025-12-04T12:25:43.2794413Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int64 PASSED [1.0754s] [ 71%] 2025-12-04T12:25:43.2794520Z test_meta.py::TestMetaCUDA::test_meta_inplace_conj_physical_cuda_int8 PASSED [0.0040s] [ 71%] 2025-12-04T12:25:43.2794689Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2794841Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_bool SKIPPED [0.0012s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795000Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795152Z test_meta.py::TestMetaCUDA::test_meta_inplace_constant_pad_nd_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795300Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795450Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795598Z test_meta.py::TestMetaCUDA::test_meta_inplace_contiguous_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2795705Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_bfloat16 PASSED [1.0792s] [ 71%] 2025-12-04T12:25:43.2795906Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2796105Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2796303Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2796511Z test_meta.py::TestMetaCUDA::test_meta_inplace_copysign_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2796665Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2796812Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2796960Z test_meta.py::TestMetaCUDA::test_meta_inplace_corrcoef_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2797157Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2797354Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_complex32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2797545Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2797735Z test_meta.py::TestMetaCUDA::test_meta_inplace_cos_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2797837Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_bfloat16 PASSED [0.0044s] [ 71%] 2025-12-04T12:25:43.2798034Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2798200Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_float16 PASSED [0.0035s] [ 71%] 2025-12-04T12:25:43.2798324Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_float64 PASSED [1.0783s] [ 71%] 2025-12-04T12:25:43.2798512Z test_meta.py::TestMetaCUDA::test_meta_inplace_cosh_cuda_int8 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2798681Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_int16 SKIPPED [0.0012s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2798833Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2798984Z test_meta.py::TestMetaCUDA::test_meta_inplace_count_nonzero_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799130Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799270Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799410Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799547Z test_meta.py::TestMetaCUDA::test_meta_inplace_cov_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799697Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799843Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2799981Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800123Z test_meta.py::TestMetaCUDA::test_meta_inplace_cross_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800267Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800412Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800570Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800714Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2800855Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummax_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2801000Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2801141Z test_meta.py::TestMetaCUDA::test_meta_inplace_cummin_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2801265Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumprod_cuda_complex128 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T12:25:43.2801382Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_bfloat16 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T12:25:43.2801500Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_float16 SKIPPED [0.0010s] (Skipped) [ 71%] 2025-12-04T12:25:43.2801614Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_float64 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T12:25:43.2801730Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int64 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T12:25:43.2801841Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumsum_cuda_int8 SKIPPED [0.0009s] (Skipped) [ 71%] 2025-12-04T12:25:43.2802013Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2802193Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2802369Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2802533Z test_meta.py::TestMetaCUDA::test_meta_inplace_cumulative_trapezoid_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2802742Z test_meta.py::TestMetaCUDA::test_meta_inplace_deg2rad_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2802882Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803025Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803165Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803303Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803444Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803588Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803740Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2803886Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804032Z test_meta.py::TestMetaCUDA::test_meta_inplace_diag_embed_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804183Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804330Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804488Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804632Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804777Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagflat_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2804934Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805092Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805249Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805404Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805557Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805712Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2805862Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_copy_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806024Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806183Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806330Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806477Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806640Z test_meta.py::TestMetaCUDA::test_meta_inplace_diagonal_scatter_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806785Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2806922Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2807062Z test_meta.py::TestMetaCUDA::test_meta_inplace_diff_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2807258Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2807363Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_float16 PASSED [0.0044s] [ 71%] 2025-12-04T12:25:43.2807467Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_float32 PASSED [1.0814s] [ 71%] 2025-12-04T12:25:43.2807666Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2807861Z test_meta.py::TestMetaCUDA::test_meta_inplace_digamma_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 71%] 2025-12-04T12:25:43.2808006Z test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2808324Z test_meta.py::TestMetaCUDA::test_meta_inplace_dist_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 71%] 2025-12-04T12:25:43.2808439Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_float16 PASSED [0.0132s] [ 71%] 2025-12-04T12:25:43.2808576Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int32 PASSED [0.0080s] [ 71%] 2025-12-04T12:25:43.2808692Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_int64 PASSED [0.0078s] [ 72%] 2025-12-04T12:25:43.2808804Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_floor_rounding_cuda_uint8 PASSED [1.0914s] [ 72%] 2025-12-04T12:25:43.2809013Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2809225Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int16 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2809435Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_no_rounding_mode_cuda_int32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2809554Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_float64 PASSED [1.4279s] [ 72%] 2025-12-04T12:25:43.2809667Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int16 PASSED [0.0082s] [ 72%] 2025-12-04T12:25:43.2809782Z test_meta.py::TestMetaCUDA::test_meta_inplace_div_trunc_rounding_cuda_int8 PASSED [0.0061s] [ 72%] 2025-12-04T12:25:43.2809929Z test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810076Z test_meta.py::TestMetaCUDA::test_meta_inplace_dot_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810236Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810399Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810542Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810696Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810837Z test_meta.py::TestMetaCUDA::test_meta_inplace_double_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2810985Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811134Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811279Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811424Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811568Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811712Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2811851Z test_meta.py::TestMetaCUDA::test_meta_inplace_dsplit_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812001Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812147Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812288Z test_meta.py::TestMetaCUDA::test_meta_inplace_dstack_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812450Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812600Z test_meta.py::TestMetaCUDA::test_meta_inplace_einsum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812747Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2812894Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813039Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813192Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813348Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813503Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813652Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_like_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813810Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2813968Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814137Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814302Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_permuted_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814458Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814619Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814773Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2814927Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815079Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815228Z test_meta.py::TestMetaCUDA::test_meta_inplace_empty_strided_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815324Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int64 PASSED [0.0061s] [ 72%] 2025-12-04T12:25:43.2815420Z test_meta.py::TestMetaCUDA::test_meta_inplace_eq_cuda_int8 PASSED [0.0058s] [ 72%] 2025-12-04T12:25:43.2815561Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815705Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815846Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2815984Z test_meta.py::TestMetaCUDA::test_meta_inplace_equal_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2816086Z test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_bfloat16 PASSED [1.6605s] [ 72%] 2025-12-04T12:25:43.2816186Z test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_float64 PASSED [0.0046s] [ 72%] 2025-12-04T12:25:43.2816390Z test_meta.py::TestMetaCUDA::test_meta_inplace_erf_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2816492Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_bfloat16 PASSED [1.8026s] [ 72%] 2025-12-04T12:25:43.2816591Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_float32 PASSED [0.0052s] [ 72%] 2025-12-04T12:25:43.2816785Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2816976Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2817167Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfc_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2817271Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_bfloat16 PASSED [0.0028s] [ 72%] 2025-12-04T12:25:43.2817374Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float16 PASSED [1.6659s] [ 72%] 2025-12-04T12:25:43.2817473Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float32 PASSED [0.0046s] [ 72%] 2025-12-04T12:25:43.2817572Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_float64 PASSED [1.6650s] [ 72%] 2025-12-04T12:25:43.2817768Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int16 SKIPPED [0.0017s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2817974Z test_meta.py::TestMetaCUDA::test_meta_inplace_erfinv_cuda_int64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2818223Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp2_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2818422Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_complex64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2818625Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2818815Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2819000Z test_meta.py::TestMetaCUDA::test_meta_inplace_exp_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2819158Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2819306Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_as_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2819463Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2819617Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2819761Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2819912Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_copy_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820053Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820205Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820368Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820514Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820657Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820799Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2820939Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2821083Z test_meta.py::TestMetaCUDA::test_meta_inplace_expand_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2821186Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_bfloat16 PASSED [1.6710s] [ 72%] 2025-12-04T12:25:43.2821379Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_bool SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2821581Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_complex64 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2821774Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int16 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2821967Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2822169Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2822376Z test_meta.py::TestMetaCUDA::test_meta_inplace_expm1_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 72%] 2025-12-04T12:25:43.2822488Z test_meta.py::TestMetaCUDA::test_meta_inplace_exponential_cuda_float64 PASSED [1.6543s] [ 72%] 2025-12-04T12:25:43.2822645Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_complex128 SKIPPED [0.0016s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2822792Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2822944Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_float8_e4m3fnuz SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823085Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823223Z test_meta.py::TestMetaCUDA::test_meta_inplace_eye_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823370Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823522Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823671Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823813Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2823958Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft2_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824098Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824248Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824403Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824547Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824696Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824840Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2824985Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825145Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825303Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825462Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825610Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_fftshift_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825762Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2825915Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826072Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826224Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826365Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_bool SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826523Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826666Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826816Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2826965Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827109Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827256Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827397Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827546Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_hfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827692Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827837Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft2_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2827978Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828172Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828338Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828481Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifft_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828628Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828773Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2828920Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2829078Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2829235Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2829467Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2829622Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 72%] 2025-12-04T12:25:43.2829770Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ifftshift_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2829919Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830088Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830251Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830397Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft2_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830562Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830707Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2830857Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831004Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831149Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831294Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831440Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_ihfftn_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831585Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831733Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2831878Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832022Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft2_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832171Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832339Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832492Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832638Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832786Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2832930Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833081Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833226Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833370Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfft_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833524Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833676Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833828Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2833983Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834141Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_irfftn_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834285Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft2_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834443Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834588Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834734Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2834875Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfft_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2835021Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2835167Z test_meta.py::TestMetaCUDA::test_meta_inplace_fft_rfftn_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2835268Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float32 PASSED [1.6480s] [ 73%] 2025-12-04T12:25:43.2835369Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_float64 PASSED [0.0049s] [ 73%] 2025-12-04T12:25:43.2835531Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_int64 PASSED [0.0035s] [ 73%] 2025-12-04T12:25:43.2835627Z test_meta.py::TestMetaCUDA::test_meta_inplace_fill_cuda_int8 PASSED [1.6680s] [ 73%] 2025-12-04T12:25:43.2835771Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_bool SKIPPED [0.0016s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2835920Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836066Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836223Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836367Z test_meta.py::TestMetaCUDA::test_meta_inplace_flatten_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836513Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836652Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836793Z test_meta.py::TestMetaCUDA::test_meta_inplace_flip_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2836943Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837095Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837239Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837385Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837525Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837667Z test_meta.py::TestMetaCUDA::test_meta_inplace_fliplr_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837815Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2837967Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838167Z test_meta.py::TestMetaCUDA::test_meta_inplace_flipud_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838316Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838482Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838626Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838767Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 73%] 2025-12-04T12:25:43.2838980Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_complex128 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 73%] 2025-12-04T12:25:43.2839091Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float16 XFAIL [0.0034s] [ 73%] 2025-12-04T12:25:43.2839199Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_float32 XFAIL [0.0029s] [ 73%] 2025-12-04T12:25:43.2839402Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int32 SKIPPED [1.6794s] (Op promotes to float, which is impossible for inplace with non-float input) [ 73%] 2025-12-04T12:25:43.2839606Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int64 SKIPPED [0.0016s] (Op promotes to float, which is impossible for inplace with non-float input) [ 73%] 2025-12-04T12:25:43.2839811Z test_meta.py::TestMetaCUDA::test_meta_inplace_float_power_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 73%] 2025-12-04T12:25:43.2839930Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_bfloat16 PASSED [0.0038s] [ 73%] 2025-12-04T12:25:43.2840008Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16 2025-12-04T12:25:43.2840013Z 2025-12-04T12:25:43.2840192Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-aadb16c8e7d2ceb7.xml - 2025-12-04T12:25:43.2840271Z !!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T12:25:43.2840434Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py:2653: KeyboardInterrupt 2025-12-04T12:25:43.2840516Z (to show a full traceback on KeyboardInterrupt use --full-trace) 2025-12-04T12:25:43.2840598Z ========= 5866 passed, 3923 skipped, 289 xfailed in 1794.67s (0:29:54) ========= 2025-12-04T12:25:43.2840646Z Command took >30min, returning 124 2025-12-04T12:25:43.2840686Z Got exit code 124 2025-12-04T12:25:43.2840729Z Retrying single test... 2025-12-04T12:25:43.2840860Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-0ab73ce12fc5c265.xml 2025-12-04T12:25:43.2840921Z ============================= test session starts ============================== 2025-12-04T12:25:43.2841039Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:25:43.2841081Z cachedir: .pytest_cache 2025-12-04T12:25:43.2841250Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:25:43.2841303Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:25:43.2841347Z configfile: pytest.ini 2025-12-04T12:25:43.2841516Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:25:43.2841766Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T12:25:43.2841852Z collected 40725 items / 13710 deselected / 27015 selected 2025-12-04T12:25:43.2842029Z stepcurrent: skipping 10078 already run items. Running only test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16 2025-12-04T12:25:43.2842075Z Running 1 items in this shard 2025-12-04T12:25:43.2842079Z 2025-12-04T12:25:43.2842184Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16 PASSED [0.0899s] [100%] 2025-12-04T12:25:43.2842186Z 2025-12-04T12:25:43.2842368Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-0ab73ce12fc5c265.xml - 2025-12-04T12:25:43.2842437Z ===================== 1 passed, 13710 deselected in 1.66s ====================== 2025-12-04T12:25:43.2842476Z Got exit code 0 2025-12-04T12:25:43.2842560Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T12:25:43.2842685Z Test results will be stored in test-reports/python-pytest/test_meta/test_meta-2408f44677edc4c0.xml 2025-12-04T12:25:43.2842747Z ============================= test session starts ============================== 2025-12-04T12:25:43.2842859Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T12:25:43.2842901Z cachedir: .pytest_cache 2025-12-04T12:25:43.2843063Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T12:25:43.2843109Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T12:25:43.2843152Z configfile: pytest.ini 2025-12-04T12:25:43.2843314Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T12:25:43.2843563Z collecting ... /var/lib/jenkins/pytorch/test/test_meta.py:0: PytestCollectionWarning: cannot collect test class 'TestExpect' because it has a __new__ constructor (from: test/test_meta.py) 2025-12-04T12:25:43.2843631Z collected 40725 items / 10079 deselected / 30646 selected 2025-12-04T12:25:43.2843691Z stepcurrent: skipping 10079 already run items. 2025-12-04T12:25:43.2843737Z Running 3632 items in this shard 2025-12-04T12:25:43.2843740Z 2025-12-04T12:25:43.2843851Z test_meta.py::TestMetaCUDA::test_meta_inplace_floor_divide_cuda_uint8 PASSED [0.1804s] [ 0%] 2025-12-04T12:25:43.2844012Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844156Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844295Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844434Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmax_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844578Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844718Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmin_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2844820Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_bfloat16 PASSED [0.0954s] [ 0%] 2025-12-04T12:25:43.2844921Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_float32 PASSED [0.0542s] [ 0%] 2025-12-04T12:25:43.2845018Z test_meta.py::TestMetaCUDA::test_meta_inplace_fmod_cuda_int32 PASSED [0.0266s] [ 0%] 2025-12-04T12:25:43.2845117Z test_meta.py::TestMetaCUDA::test_meta_inplace_frac_cuda_bfloat16 PASSED [1.0440s] [ 0%] 2025-12-04T12:25:43.2845265Z test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2845411Z test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float16 SKIPPED [0.0012s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2845566Z test_meta.py::TestMetaCUDA::test_meta_inplace_frexp_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2845725Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2845867Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846008Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846160Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846297Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846435Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846572Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846712Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2846852Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847003Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847157Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847301Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847444Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847590Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint16 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847737Z test_meta.py::TestMetaCUDA::test_meta_inplace_full_like_cuda_uint32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2847897Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848053Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848279Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848424Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848569Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848713Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848854Z test_meta.py::TestMetaCUDA::test_meta_inplace_gather_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 0%] 2025-12-04T12:25:43.2848952Z test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int16 PASSED [0.8853s] [ 1%] 2025-12-04T12:25:43.2849049Z test_meta.py::TestMetaCUDA::test_meta_inplace_gcd_cuda_int64 PASSED [0.0098s] [ 1%] 2025-12-04T12:25:43.2849151Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_bfloat16 PASSED [0.0142s] [ 1%] 2025-12-04T12:25:43.2849249Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float16 PASSED [0.0055s] [ 1%] 2025-12-04T12:25:43.2849345Z test_meta.py::TestMetaCUDA::test_meta_inplace_ge_cuda_float64 PASSED [0.0054s] [ 1%] 2025-12-04T12:25:43.2849462Z test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_int32 PASSED [0.0074s] [ 1%] 2025-12-04T12:25:43.2849578Z test_meta.py::TestMetaCUDA::test_meta_inplace_geometric_cuda_uint8 PASSED [0.8456s] [ 1%] 2025-12-04T12:25:43.2849731Z test_meta.py::TestMetaCUDA::test_meta_inplace_geqrf_cuda_complex128 SKIPPED [0.0015s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2849880Z test_meta.py::TestMetaCUDA::test_meta_inplace_geqrf_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2850044Z test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2850190Z test_meta.py::TestMetaCUDA::test_meta_inplace_gradient_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2850351Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2850507Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2850638Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 1%] 2025-12-04T12:25:43.2850766Z test_meta.py::TestMetaCUDA::test_meta_inplace_grid_sampler_3d_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 1%] 2025-12-04T12:25:43.2850862Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int32 PASSED [0.0079s] [ 1%] 2025-12-04T12:25:43.2850958Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int64 PASSED [0.0070s] [ 1%] 2025-12-04T12:25:43.2851054Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_int8 PASSED [0.0063s] [ 1%] 2025-12-04T12:25:43.2851146Z test_meta.py::TestMetaCUDA::test_meta_inplace_gt_cuda_uint8 PASSED [0.0067s] [ 1%] 2025-12-04T12:25:43.2851294Z test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2851434Z test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2851573Z test_meta.py::TestMetaCUDA::test_meta_inplace_half_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2851736Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2853357Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2853506Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2853651Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2853801Z test_meta.py::TestMetaCUDA::test_meta_inplace_hash_tensor_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2853913Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_bfloat16 PASSED [0.0304s] [ 1%] 2025-12-04T12:25:43.2854018Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_bool PASSED [0.0086s] [ 1%] 2025-12-04T12:25:43.2854125Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_float32 PASSED [0.8574s] [ 1%] 2025-12-04T12:25:43.2854232Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_float64 PASSED [0.0107s] [ 1%] 2025-12-04T12:25:43.2854334Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int32 PASSED [0.8930s] [ 1%] 2025-12-04T12:25:43.2854435Z test_meta.py::TestMetaCUDA::test_meta_inplace_heaviside_cuda_int64 PASSED [0.0090s] [ 1%] 2025-12-04T12:25:43.2854581Z test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2854727Z test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2854923Z test_meta.py::TestMetaCUDA::test_meta_inplace_histc_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 1%] 2025-12-04T12:25:43.2855103Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2855255Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2855417Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2855563Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2855706Z test_meta.py::TestMetaCUDA::test_meta_inplace_hsplit_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2855856Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856002Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856150Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856292Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856435Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856574Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856716Z test_meta.py::TestMetaCUDA::test_meta_inplace_hstack_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2856908Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.2857008Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_float64 PASSED [0.8592s] [ 2%] 2025-12-04T12:25:43.2857209Z test_meta.py::TestMetaCUDA::test_meta_inplace_i0_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 2%] 2025-12-04T12:25:43.2857359Z test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2857504Z test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2857650Z test_meta.py::TestMetaCUDA::test_meta_inplace_imag_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 2%] 2025-12-04T12:25:43.2857759Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_bfloat16 PASSED [0.0863s] [ 2%] 2025-12-04T12:25:43.2857862Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_bool PASSED [0.0069s] [ 2%] 2025-12-04T12:25:43.2857973Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_complex64 PASSED [0.0090s] [ 2%] 2025-12-04T12:25:43.2858111Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float32 PASSED [0.0083s] [ 2%] 2025-12-04T12:25:43.2858216Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_float64 PASSED [0.8718s] [ 2%] 2025-12-04T12:25:43.2858318Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int16 PASSED [0.0093s] [ 2%] 2025-12-04T12:25:43.2858420Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_add_cuda_int32 PASSED [0.0077s] [ 2%] 2025-12-04T12:25:43.2858528Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_bfloat16 PASSED [0.0039s] [ 2%] 2025-12-04T12:25:43.2858640Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_complex64 PASSED [0.8630s] [ 2%] 2025-12-04T12:25:43.2858759Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int16 PASSED [0.0053s] [ 2%] 2025-12-04T12:25:43.2858882Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int32 PASSED [0.0040s] [ 2%] 2025-12-04T12:25:43.2858985Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_copy_cuda_int64 PASSED [0.0051s] [ 2%] 2025-12-04T12:25:43.2859099Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_complex128 PASSED [0.0069s] [ 2%] 2025-12-04T12:25:43.2859220Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_float32 PASSED [0.0062s] [ 2%] 2025-12-04T12:25:43.2859324Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_fill_cuda_int64 PASSED [0.0052s] [ 2%] 2025-12-04T12:25:43.2859430Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_float16 PASSED [0.9123s] [ 2%] 2025-12-04T12:25:43.2859531Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_int64 PASSED [0.8643s] [ 2%] 2025-12-04T12:25:43.2859632Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_put_cuda_uint8 PASSED [0.0061s] [ 2%] 2025-12-04T12:25:43.2859751Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_float16 PASSED [0.0063s] [ 3%] 2025-12-04T12:25:43.2859867Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_float64 PASSED [0.0061s] [ 3%] 2025-12-04T12:25:43.2859982Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amax_cuda_uint8 PASSED [0.0060s] [ 3%] 2025-12-04T12:25:43.2860099Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_bfloat16 PASSED [0.0073s] [ 3%] 2025-12-04T12:25:43.2860213Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_float16 PASSED [0.0073s] [ 3%] 2025-12-04T12:25:43.2860328Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_float32 PASSED [0.0074s] [ 3%] 2025-12-04T12:25:43.2860442Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_amin_cuda_int64 PASSED [0.0069s] [ 3%] 2025-12-04T12:25:43.2860560Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_bfloat16 PASSED [0.0176s] [ 3%] 2025-12-04T12:25:43.2860674Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_float16 PASSED [0.0065s] [ 3%] 2025-12-04T12:25:43.2860792Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int16 PASSED [0.0258s] [ 3%] 2025-12-04T12:25:43.2860918Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_mean_cuda_int32 PASSED [0.0067s] [ 3%] 2025-12-04T12:25:43.2861035Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_float16 PASSED [0.0075s] [ 3%] 2025-12-04T12:25:43.2861147Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int32 PASSED [0.0067s] [ 3%] 2025-12-04T12:25:43.2861261Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_reduce_prod_cuda_int64 PASSED [0.0073s] [ 3%] 2025-12-04T12:25:43.2861419Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2861580Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2861738Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2861893Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862045Z test_meta.py::TestMetaCUDA::test_meta_inplace_index_select_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862196Z test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862344Z test_meta.py::TestMetaCUDA::test_meta_inplace_inner_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862493Z test_meta.py::TestMetaCUDA::test_meta_inplace_int_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862655Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862801Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2862955Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863110Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863258Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863404Z test_meta.py::TestMetaCUDA::test_meta_inplace_isclose_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863562Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863711Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2863857Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864002Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864149Z test_meta.py::TestMetaCUDA::test_meta_inplace_isfinite_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864293Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864441Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864585Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864731Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 3%] 2025-12-04T12:25:43.2864892Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865034Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865177Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865317Z test_meta.py::TestMetaCUDA::test_meta_inplace_isinf_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865465Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865611Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865752Z test_meta.py::TestMetaCUDA::test_meta_inplace_isnan_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2865900Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866048Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866193Z test_meta.py::TestMetaCUDA::test_meta_inplace_isneginf_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866344Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866501Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866657Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_int16 SKIPPED [0.0013s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866806Z test_meta.py::TestMetaCUDA::test_meta_inplace_isposinf_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2866964Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867115Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867256Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867401Z test_meta.py::TestMetaCUDA::test_meta_inplace_isreal_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867547Z test_meta.py::TestMetaCUDA::test_meta_inplace_istft_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867687Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867833Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2867978Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868149Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868287Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868425Z test_meta.py::TestMetaCUDA::test_meta_inplace_item_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868600Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_2inputs_2outputs_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868782Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2868976Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869157Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869335Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_4inputs_with_extra_args_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869494Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869654Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869812Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2869968Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2870140Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2870320Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 4%] 2025-12-04T12:25:43.2870510Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2870699Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2870875Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871059Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_binary_return_by_ref_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871215Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871373Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871528Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871689Z test_meta.py::TestMetaCUDA::test_meta_inplace_jiterator_unary_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871836Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2871980Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872124Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872267Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872404Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872545Z test_meta.py::TestMetaCUDA::test_meta_inplace_kron_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872696Z test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_bfloat16 SKIPPED [0.0008s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2872858Z test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2873005Z test_meta.py::TestMetaCUDA::test_meta_inplace_kthvalue_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2873102Z test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int16 PASSED [0.0109s] [ 5%] 2025-12-04T12:25:43.2873201Z test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int64 PASSED [0.0081s] [ 5%] 2025-12-04T12:25:43.2873296Z test_meta.py::TestMetaCUDA::test_meta_inplace_lcm_cuda_int8 PASSED [0.0088s] [ 5%] 2025-12-04T12:25:43.2873492Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2873690Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2873889Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2874084Z test_meta.py::TestMetaCUDA::test_meta_inplace_ldexp_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2874185Z test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_bfloat16 PASSED [0.0055s] [ 5%] 2025-12-04T12:25:43.2874279Z test_meta.py::TestMetaCUDA::test_meta_inplace_le_cuda_int8 PASSED [0.0053s] [ 5%] 2025-12-04T12:25:43.2874384Z test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_complex128 PASSED [0.0155s] [ 5%] 2025-12-04T12:25:43.2874501Z test_meta.py::TestMetaCUDA::test_meta_inplace_lerp_cuda_complex32 PASSED [0.8659s] [ 5%] 2025-12-04T12:25:43.2874707Z test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_bool SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2874808Z test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_float64 PASSED [0.8452s] [ 5%] 2025-12-04T12:25:43.2875017Z test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_int64 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2875216Z test_meta.py::TestMetaCUDA::test_meta_inplace_lgamma_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 5%] 2025-12-04T12:25:43.2875377Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cholesky_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2875537Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2875690Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2875845Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_cross_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 5%] 2025-12-04T12:25:43.2875996Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_det_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876154Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876317Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876475Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876632Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876787Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_diagonal_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2876952Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877107Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877258Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eig_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877414Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877570Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigh_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877726Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvals_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2877884Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_eigvalsh_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878037Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878231Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878387Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878556Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_inv_ex_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878730Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2878897Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_factor_ex_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2879108Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_ldl_solve_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 6%] 2025-12-04T12:25:43.2879263Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2879442Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lstsq_grad_oriented_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2879596Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2879761Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2879922Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_factor_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880084Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_lu_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880247Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880407Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880568Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880738Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_power_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2880916Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881082Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_matrix_rank_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881246Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_multi_dot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881400Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881555Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881739Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_norm_subgradients_at_zero_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2881893Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 6%] 2025-12-04T12:25:43.2882045Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2882211Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_hermitian_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2882407Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_pinv_singular_cuda_float64 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 7%] 2025-12-04T12:25:43.2882570Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_qr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2882739Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_complex128 SKIPPED [0.0008s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2882900Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_slogdet_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883057Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_ex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883241Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_solve_triangular_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883397Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883547Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svd_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883710Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_svdvals_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2883875Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorinv_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884040Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_tensorsolve_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884197Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884352Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vander_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884509Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884666Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vecdot_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2884831Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885008Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885172Z test_meta.py::TestMetaCUDA::test_meta_inplace_linalg_vector_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885325Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885477Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885627Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885773Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2885944Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2886112Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2886277Z test_meta.py::TestMetaCUDA::test_meta_inplace_linspace_tensor_overload_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 7%] 2025-12-04T12:25:43.2886481Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2886692Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2886898Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2887096Z test_meta.py::TestMetaCUDA::test_meta_inplace_log10_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2887209Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float16 PASSED [0.8583s] [ 7%] 2025-12-04T12:25:43.2887311Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_float64 PASSED [0.8464s] [ 7%] 2025-12-04T12:25:43.2887506Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2887700Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2887895Z test_meta.py::TestMetaCUDA::test_meta_inplace_log1p_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 7%] 2025-12-04T12:25:43.2888126Z test_meta.py::TestMetaCUDA::test_meta_inplace_log2_cuda_bool SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.2888324Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.2888519Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_complex64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.2888619Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_float16 PASSED [0.8687s] [ 8%] 2025-12-04T12:25:43.2888720Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_float32 PASSED [0.0049s] [ 8%] 2025-12-04T12:25:43.2888912Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_int64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.2889124Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 8%] 2025-12-04T12:25:43.2889234Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_normal_cuda_float16 PASSED [0.0081s] [ 8%] 2025-12-04T12:25:43.2889389Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2889543Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2889705Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2889875Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890042Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890207Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890369Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890531Z test_meta.py::TestMetaCUDA::test_meta_inplace_log_softmax_with_dtype_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890696Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2890866Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891019Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891173Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891338Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891488Z test_meta.py::TestMetaCUDA::test_meta_inplace_logaddexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891646Z test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891801Z test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2891957Z test_meta.py::TestMetaCUDA::test_meta_inplace_logcumsumexp_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2892112Z test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2892266Z test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2892413Z test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2892559Z test_meta.py::TestMetaCUDA::test_meta_inplace_logdet_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 8%] 2025-12-04T12:25:43.2892672Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_complex128 PASSED [0.8623s] [ 8%] 2025-12-04T12:25:43.2892788Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_complex64 PASSED [0.0093s] [ 8%] 2025-12-04T12:25:43.2892898Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_float32 PASSED [0.0278s] [ 8%] 2025-12-04T12:25:43.2893017Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_int32 PASSED [0.0053s] [ 8%] 2025-12-04T12:25:43.2893122Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_and_cuda_int8 PASSED [0.0050s] [ 8%] 2025-12-04T12:25:43.2893227Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_bool PASSED [0.8754s] [ 8%] 2025-12-04T12:25:43.2893341Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_not_cuda_complex128 PASSED [0.0047s] [ 8%] 2025-12-04T12:25:43.2893449Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_bfloat16 PASSED [0.0054s] [ 9%] 2025-12-04T12:25:43.2893561Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_complex64 PASSED [0.0077s] [ 9%] 2025-12-04T12:25:43.2893669Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_float32 PASSED [0.0051s] [ 9%] 2025-12-04T12:25:43.2893773Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_or_cuda_int8 PASSED [0.0050s] [ 9%] 2025-12-04T12:25:43.2893883Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_complex128 PASSED [0.0070s] [ 9%] 2025-12-04T12:25:43.2893997Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_complex64 PASSED [0.2765s] [ 9%] 2025-12-04T12:25:43.2894103Z test_meta.py::TestMetaCUDA::test_meta_inplace_logical_xor_cuda_int64 PASSED [0.0054s] [ 9%] 2025-12-04T12:25:43.2894205Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_bfloat16 PASSED [0.0150s] [ 9%] 2025-12-04T12:25:43.2894304Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_float64 PASSED [0.0054s] [ 9%] 2025-12-04T12:25:43.2894500Z test_meta.py::TestMetaCUDA::test_meta_inplace_logit_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 9%] 2025-12-04T12:25:43.2894665Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2894857Z test_meta.py::TestMetaCUDA::test_meta_inplace_logspace_tensor_overload_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895014Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895173Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895321Z test_meta.py::TestMetaCUDA::test_meta_inplace_logsumexp_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895466Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895605Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895751Z test_meta.py::TestMetaCUDA::test_meta_inplace_long_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2895849Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float16 PASSED [0.8559s] [ 9%] 2025-12-04T12:25:43.2895947Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_float64 PASSED [0.0072s] [ 9%] 2025-12-04T12:25:43.2896044Z test_meta.py::TestMetaCUDA::test_meta_inplace_lt_cuda_int64 PASSED [0.0057s] [ 9%] 2025-12-04T12:25:43.2896187Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2896343Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2896494Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2896643Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_solve_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2896792Z test_meta.py::TestMetaCUDA::test_meta_inplace_lu_unpack_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2896945Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897082Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897222Z test_meta.py::TestMetaCUDA::test_meta_inplace_mH_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897360Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897498Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897634Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897771Z test_meta.py::TestMetaCUDA::test_meta_inplace_mT_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2897923Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2898145Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amax_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2898301Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2898453Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 9%] 2025-12-04T12:25:43.2898618Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_amin_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2898791Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2898947Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899114Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899269Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899419Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899570Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmax_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899726Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2899883Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900036Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_argmin_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900199Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900355Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900508Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumprod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900663Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900820Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2900991Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2901145Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2901299Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2901450Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2901603Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_cumsum_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2901717Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_complex32 PASSED [0.0065s] [ 10%] 2025-12-04T12:25:43.2901828Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float16 PASSED [0.0064s] [ 10%] 2025-12-04T12:25:43.2901938Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_float64 PASSED [0.0062s] [ 10%] 2025-12-04T12:25:43.2902045Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_int16 PASSED [0.0061s] [ 10%] 2025-12-04T12:25:43.2902149Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_fill_cuda_uint8 PASSED [0.0062s] [ 10%] 2025-12-04T12:25:43.2902311Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2902475Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_log_softmax_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2902646Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logaddexp_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2902820Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2902978Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903150Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903305Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_logsumexp_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903459Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903610Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_mean_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903765Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2903918Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 10%] 2025-12-04T12:25:43.2904070Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2904236Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2904397Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_normalize_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2904552Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2904699Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2904866Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2905016Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2905168Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2905315Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2905467Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_prod_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2905584Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_bfloat16 PASSED [0.8667s] [ 11%] 2025-12-04T12:25:43.2905699Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float16 PASSED [0.0054s] [ 11%] 2025-12-04T12:25:43.2905811Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_float64 PASSED [0.0041s] [ 11%] 2025-12-04T12:25:43.2905921Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_int8 PASSED [0.0039s] [ 11%] 2025-12-04T12:25:43.2906033Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_scatter_cuda_uint8 PASSED [0.8542s] [ 11%] 2025-12-04T12:25:43.2906186Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_select_cuda_int16 SKIPPED [0.0015s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2906342Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmax_cuda_float64 SKIPPED [0.0012s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2906507Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2906675Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_softmin_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2906828Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2906989Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907136Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907283Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907430Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907578Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907729Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_std_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2907886Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908039Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908236Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908384Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_sum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908532Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908679Z test_meta.py::TestMetaCUDA::test_meta_inplace_masked_var_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908841Z test_meta.py::TestMetaCUDA::test_meta_inplace_matmul_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2908990Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2909139Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2909286Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_binary_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 11%] 2025-12-04T12:25:43.2909467Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2909646Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2909808Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2909972Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2910130Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_no_dim_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2910292Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2910452Z test_meta.py::TestMetaCUDA::test_meta_inplace_max_reduction_with_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2910698Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2910862Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911007Z test_meta.py::TestMetaCUDA::test_meta_inplace_maximum_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911163Z test_meta.py::TestMetaCUDA::test_meta_inplace_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911311Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911457Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911601Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911746Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int16 SKIPPED [0.0011s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2911890Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912034Z test_meta.py::TestMetaCUDA::test_meta_inplace_median_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912201Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912372Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912536Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912698Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_list_of_tensors_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2912865Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913048Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913215Z test_meta.py::TestMetaCUDA::test_meta_inplace_meshgrid_variadic_tensors_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913361Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913511Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913657Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_binary_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913821Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2913979Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914137Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_no_dim_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914301Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914461Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914637Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914806Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2914966Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2915135Z test_meta.py::TestMetaCUDA::test_meta_inplace_min_reduction_with_dim_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2915281Z test_meta.py::TestMetaCUDA::test_meta_inplace_minimum_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 12%] 2025-12-04T12:25:43.2915423Z test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2915570Z test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2915711Z test_meta.py::TestMetaCUDA::test_meta_inplace_mm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2915854Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2915992Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916132Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916268Z test_meta.py::TestMetaCUDA::test_meta_inplace_mode_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916418Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916566Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916711Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2916867Z test_meta.py::TestMetaCUDA::test_meta_inplace_movedim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2917006Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2917151Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2917292Z test_meta.py::TestMetaCUDA::test_meta_inplace_msort_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2917395Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_bfloat16 PASSED [0.8555s] [ 13%] 2025-12-04T12:25:43.2917492Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_bool PASSED [0.0068s] [ 13%] 2025-12-04T12:25:43.2917593Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_complex32 PASSED [0.0083s] [ 13%] 2025-12-04T12:25:43.2917690Z test_meta.py::TestMetaCUDA::test_meta_inplace_mul_cuda_float32 PASSED [0.0052s] [ 13%] 2025-12-04T12:25:43.2917832Z test_meta.py::TestMetaCUDA::test_meta_inplace_mv_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2917957Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_bfloat16 PASSED [0.8890s] [ 13%] 2025-12-04T12:25:43.2918134Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0143s] [ 13%] 2025-12-04T12:25:43.2918256Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_float64 PASSED [0.8572s] [ 13%] 2025-12-04T12:25:43.2918486Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_1_cuda_int8 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.2918627Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_bfloat16 PASSED [0.0116s] [ 13%] 2025-12-04T12:25:43.2918748Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_float64 PASSED [0.0100s] [ 13%] 2025-12-04T12:25:43.2918965Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_3_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.2919191Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int16 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.2919405Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.2919617Z test_meta.py::TestMetaCUDA::test_meta_inplace_mvlgamma_mvlgamma_p_5_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 13%] 2025-12-04T12:25:43.2919728Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_float64 PASSED [0.0282s] [ 13%] 2025-12-04T12:25:43.2919834Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int32 PASSED [0.8501s] [ 13%] 2025-12-04T12:25:43.2919939Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int64 PASSED [0.0048s] [ 13%] 2025-12-04T12:25:43.2920042Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_int8 PASSED [0.0035s] [ 13%] 2025-12-04T12:25:43.2920146Z test_meta.py::TestMetaCUDA::test_meta_inplace_nan_to_num_cuda_uint8 PASSED [0.8565s] [ 13%] 2025-12-04T12:25:43.2920297Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2920451Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmean_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 13%] 2025-12-04T12:25:43.2920601Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2920747Z test_meta.py::TestMetaCUDA::test_meta_inplace_nanmedian_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2920908Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921056Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921203Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921348Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921493Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921634Z test_meta.py::TestMetaCUDA::test_meta_inplace_nansum_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921793Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2921949Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922099Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922246Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_copy_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922390Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922548Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922703Z test_meta.py::TestMetaCUDA::test_meta_inplace_narrow_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2922872Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2923055Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_dropout_backward_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2923215Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2923376Z test_meta.py::TestMetaCUDA::test_meta_inplace_native_layer_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2923480Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_complex128 PASSED [0.8685s] [ 14%] 2025-12-04T12:25:43.2923579Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_float16 PASSED [0.0073s] [ 14%] 2025-12-04T12:25:43.2923677Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_float32 PASSED [0.0057s] [ 14%] 2025-12-04T12:25:43.2923771Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_int32 PASSED [0.0055s] [ 14%] 2025-12-04T12:25:43.2923864Z test_meta.py::TestMetaCUDA::test_meta_inplace_ne_cuda_uint8 PASSED [0.0054s] [ 14%] 2025-12-04T12:25:43.2923964Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_complex32 PASSED [0.8745s] [ 14%] 2025-12-04T12:25:43.2924064Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_float16 PASSED [0.0041s] [ 14%] 2025-12-04T12:25:43.2924158Z test_meta.py::TestMetaCUDA::test_meta_inplace_neg_cuda_uint8 PASSED [0.8497s] [ 14%] 2025-12-04T12:25:43.2924311Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2924466Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_complex32 SKIPPED [0.0013s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2924619Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2924797Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_bfloat16 SKIPPED [0.0012s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2924962Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2925123Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2925278Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2925436Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_empty_strided_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2925586Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 14%] 2025-12-04T12:25:43.2925735Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2925883Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926026Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926168Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926322Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_full_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926481Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926631Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926777Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2926930Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927072Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_ones_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927215Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927370Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927523Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927673Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927823Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2927976Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2928161Z test_meta.py::TestMetaCUDA::test_meta_inplace_new_zeros_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2928268Z test_meta.py::TestMetaCUDA::test_meta_inplace_nextafter_cuda_float32 PASSED [0.0066s] [ 15%] 2025-12-04T12:25:43.2928452Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2928634Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2928829Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool1d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929013Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929191Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool2d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929372Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929552Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_avg_pool3d_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929731Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2929911Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2930090Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2930268Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool2d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2930463Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2930655Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_adaptive_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2930787Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_alpha_dropout_cuda_bfloat16 PASSED [0.8895s] [ 15%] 2025-12-04T12:25:43.2930930Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_alpha_dropout_cuda_float64 PASSED [0.0162s] [ 15%] 2025-12-04T12:25:43.2931098Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool1d_cuda_float32 SKIPPED [0.0012s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2931268Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool2d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2931437Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2931605Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 15%] 2025-12-04T12:25:43.2931773Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2931940Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_avg_pool3d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2932108Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_batch_norm_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2932280Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2932447Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_bilinear_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2932629Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2932821Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2933018Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_binary_cross_entropy_with_logits_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2933138Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_bfloat16 PASSED [0.8811s] [ 16%] 2025-12-04T12:25:43.2933253Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_celu_cuda_float32 PASSED [0.0060s] [ 16%] 2025-12-04T12:25:43.2933430Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2933610Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2933785Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2933958Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2934126Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2934298Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_channel_shuffle_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2934484Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2934660Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2934825Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935004Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv2d_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935168Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935334Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935498Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935675Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2935858Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose1d_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936035Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936212Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936386Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_conv_transpose3d_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936570Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936753Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2936951Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2937133Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_embedding_loss_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2937312Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cosine_similarity_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2937485Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2937657Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_cross_entropy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 16%] 2025-12-04T12:25:43.2937785Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_bfloat16 PASSED [0.0107s] [ 16%] 2025-12-04T12:25:43.2937912Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float32 PASSED [0.0104s] [ 17%] 2025-12-04T12:25:43.2938037Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout2d_cuda_float64 PASSED [0.0102s] [ 17%] 2025-12-04T12:25:43.2938201Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout3d_cuda_float16 PASSED [0.0091s] [ 17%] 2025-12-04T12:25:43.2938323Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_bfloat16 PASSED [0.0139s] [ 17%] 2025-12-04T12:25:43.2938441Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float16 PASSED [0.0137s] [ 17%] 2025-12-04T12:25:43.2938575Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float32 PASSED [0.0138s] [ 17%] 2025-12-04T12:25:43.2938706Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_dropout_cuda_float64 PASSED [0.0137s] [ 17%] 2025-12-04T12:25:43.2938823Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_elu_cuda_bfloat16 PASSED [0.0044s] [ 17%] 2025-12-04T12:25:43.2938996Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2939182Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2939351Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_embedding_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2939503Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float16 PASSED [0.0071s] [ 17%] 2025-12-04T12:25:43.2939657Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64 PASSED [0.0069s] [ 17%] 2025-12-04T12:25:43.2939817Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.2939972Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int32 PASSED [0.0080s] [ 17%] 2025-12-04T12:25:43.2940125Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_feature_alpha_dropout_without_train_cuda_int8 PASSED [0.0081s] [ 17%] 2025-12-04T12:25:43.2940307Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2940490Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool2d_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2940673Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_fractional_max_pool3d_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2940849Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941040Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gaussian_nll_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941200Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_gelu_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941359Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941521Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_glu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941693Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2941863Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_grid_sample_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2942033Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_group_norm_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2942203Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardshrink_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2942332Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardsigmoid_cuda_float64 PASSED [0.8708s] [ 17%] 2025-12-04T12:25:43.2942498Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0014s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2942673Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2942846Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2943008Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2943180Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hardtanh_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2943361Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_hinge_embedding_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2943526Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_huber_loss_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 17%] 2025-12-04T12:25:43.2943706Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2943878Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944051Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_instance_norm_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944230Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944406Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944581Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_area_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944764Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bicubic_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2944947Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bilinear_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2945143Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_bilinear_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2945336Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest-exact_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2945516Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_nearest_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2945699Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2945867Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2946035Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2946199Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2946362Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_l1_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2946527Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_layer_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2946663Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float32 PASSED [0.8731s] [ 18%] 2025-12-04T12:25:43.2946798Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_leaky_relu_cuda_float64 PASSED [0.0094s] [ 18%] 2025-12-04T12:25:43.2946962Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2947125Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_linear_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2947316Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2947495Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_local_response_norm_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2947665Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_logsigmoid_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2947845Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948023Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948243Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948419Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948595Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_margin_ranking_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948764Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2948931Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949119Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool1d_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949284Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949452Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_pool3d_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949621Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949799Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_grad_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2949974Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool1d_grad_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 18%] 2025-12-04T12:25:43.2950149Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_max_unpool3d_grad_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2950266Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_bfloat16 PASSED [0.0126s] [ 19%] 2025-12-04T12:25:43.2950387Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float32 PASSED [0.0042s] [ 19%] 2025-12-04T12:25:43.2950504Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mish_cuda_float64 PASSED [0.0041s] [ 19%] 2025-12-04T12:25:43.2950673Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_mse_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2950880Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_head_attention_forward_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2951071Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2951259Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multi_margin_loss_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2951452Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2951642Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2951831Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_multilabel_soft_margin_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952000Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952165Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_nll_loss_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952336Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952508Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_normalize_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952684Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2952854Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953021Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_circular_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953207Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_constant_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953377Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953549Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953715Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2953884Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_reflect_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954058Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954231Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954402Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954575Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954745Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2954927Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2955126Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2955308Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2955504Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2955684Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pad_replicate_negative_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2955865Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2956040Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2956217Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pairwise_distance_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 19%] 2025-12-04T12:25:43.2956383Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pdist_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2956556Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2956725Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2956896Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957069Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957249Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957419Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_shuffle_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957590Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957761Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_pixel_unshuffle_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2957931Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958154Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958325Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958497Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_poisson_nll_loss_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958659Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958817Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu6_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2958992Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959163Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959324Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959493Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959651Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959805Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2959962Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_relu_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2960130Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2960300Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rms_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2960419Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_rrelu_cuda_float32 PASSED [0.0083s] [ 20%] 2025-12-04T12:25:43.2960614Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_scaled_dot_product_attention_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2960730Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_selu_cuda_float16 PASSED [0.0043s] [ 20%] 2025-12-04T12:25:43.2960862Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_silu_complex_cuda_complex128 PASSED [0.0095s] [ 20%] 2025-12-04T12:25:43.2961036Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_smooth_l1_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2961210Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_soft_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2961389Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2961553Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2961739Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2961919Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2962099Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softmin_with_dtype_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2962265Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 20%] 2025-12-04T12:25:43.2962432Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softplus_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2962602Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softshrink_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2962767Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_softsign_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2962936Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2963110Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2963283Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_tanhshrink_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2963409Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float16 PASSED [0.8859s] [ 21%] 2025-12-04T12:25:43.2963551Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_float32 PASSED [0.0067s] [ 21%] 2025-12-04T12:25:43.2963670Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int16 PASSED [0.0051s] [ 21%] 2025-12-04T12:25:43.2963789Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_int64 PASSED [0.0047s] [ 21%] 2025-12-04T12:25:43.2963906Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_threshold_cuda_uint8 PASSED [0.0046s] [ 21%] 2025-12-04T12:25:43.2964088Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2964265Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2964443Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2964618Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_loss_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2964813Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965006Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965171Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965349Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_unfold_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965529Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_bilinear_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965706Z test_meta.py::TestMetaCUDA::test_meta_inplace_nn_functional_upsample_nearest_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2965857Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2966004Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2966153Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2966297Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2966439Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_bfloat16 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2966572Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_bool SKIPPED [0.0006s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2966712Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex32 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2966851Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_complex64 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2966999Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2967147Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_float64 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2967283Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int32 SKIPPED [0.0006s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2967418Z test_meta.py::TestMetaCUDA::test_meta_inplace_nonzero_static_cuda_int64 SKIPPED [0.0005s] (Only runs on cpu) [ 21%] 2025-12-04T12:25:43.2967578Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2967718Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2967866Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2968011Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_fro_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 21%] 2025-12-04T12:25:43.2968240Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_inf_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2968386Z test_meta.py::TestMetaCUDA::test_meta_inplace_norm_nuc_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2968535Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2968652Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_in_place_cuda_bfloat16 PASSED [0.8659s] [ 22%] 2025-12-04T12:25:43.2968814Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_number_mean_cuda_bfloat16 SKIPPED [0.0016s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2968977Z test_meta.py::TestMetaCUDA::test_meta_inplace_normal_number_mean_cuda_float16 SKIPPED [0.0013s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969124Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969266Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969419Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969558Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969700Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969843Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2969985Z test_meta.py::TestMetaCUDA::test_meta_inplace_ones_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970129Z test_meta.py::TestMetaCUDA::test_meta_inplace_ormqr_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970273Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970421Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970567Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970714Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2970857Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971008Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971162Z test_meta.py::TestMetaCUDA::test_meta_inplace_outer_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971322Z test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971490Z test_meta.py::TestMetaCUDA::test_meta_inplace_pca_lowrank_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971648Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971802Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2971951Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972101Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972252Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972403Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972553Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972699Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972843Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2972985Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2973132Z test_meta.py::TestMetaCUDA::test_meta_inplace_permute_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2973296Z test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2973446Z test_meta.py::TestMetaCUDA::test_meta_inplace_pinverse_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 22%] 2025-12-04T12:25:43.2973589Z test_meta.py::TestMetaCUDA::test_meta_inplace_polar_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2973716Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.0101s] [ 23%] 2025-12-04T12:25:43.2973841Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0076s] [ 23%] 2025-12-04T12:25:43.2974061Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_int32 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2974277Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_0_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2974404Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_1_cuda_float32 PASSED [0.8657s] [ 23%] 2025-12-04T12:25:43.2974531Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.0081s] [ 23%] 2025-12-04T12:25:43.2974656Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_float32 PASSED [0.8650s] [ 23%] 2025-12-04T12:25:43.2974872Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2975097Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_2_cuda_int8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2975323Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2975447Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float16 PASSED [0.0091s] [ 23%] 2025-12-04T12:25:43.2975581Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_3_cuda_float64 PASSED [0.0073s] [ 23%] 2025-12-04T12:25:43.2975704Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0060s] [ 23%] 2025-12-04T12:25:43.2975920Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_int8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2976136Z test_meta.py::TestMetaCUDA::test_meta_inplace_polygamma_polygamma_n_4_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 23%] 2025-12-04T12:25:43.2976290Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2976442Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2976593Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2976739Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2976884Z test_meta.py::TestMetaCUDA::test_meta_inplace_positive_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2976989Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_bfloat16 PASSED [0.8905s] [ 23%] 2025-12-04T12:25:43.2977090Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_complex32 PASSED [0.0100s] [ 23%] 2025-12-04T12:25:43.2977191Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_float16 PASSED [0.0056s] [ 23%] 2025-12-04T12:25:43.2977297Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_int8 PASSED [0.0054s] [ 23%] 2025-12-04T12:25:43.2977392Z test_meta.py::TestMetaCUDA::test_meta_inplace_pow_cuda_uint8 PASSED [0.0054s] [ 23%] 2025-12-04T12:25:43.2977540Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2977681Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2977820Z test_meta.py::TestMetaCUDA::test_meta_inplace_prod_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2977919Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int16 PASSED [0.0145s] [ 23%] 2025-12-04T12:25:43.2978013Z test_meta.py::TestMetaCUDA::test_meta_inplace_put_cuda_int64 PASSED [0.0143s] [ 23%] 2025-12-04T12:25:43.2978192Z test_meta.py::TestMetaCUDA::test_meta_inplace_qr_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2978297Z test_meta.py::TestMetaCUDA::test_meta_inplace_rad2deg_cuda_bfloat16 PASSED [0.8678s] [ 23%] 2025-12-04T12:25:43.2978448Z test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_bfloat16 SKIPPED [0.0015s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2978599Z test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_complex32 SKIPPED [0.0012s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2978746Z test_meta.py::TestMetaCUDA::test_meta_inplace_rand_like_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 23%] 2025-12-04T12:25:43.2978907Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979074Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979220Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979362Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979527Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979679Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979832Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2979980Z test_meta.py::TestMetaCUDA::test_meta_inplace_randint_like_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980125Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980271Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980426Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980580Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980736Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2980887Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981036Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981201Z test_meta.py::TestMetaCUDA::test_meta_inplace_randn_like_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981348Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981496Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981634Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981775Z test_meta.py::TestMetaCUDA::test_meta_inplace_ravel_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2981919Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2982061Z test_meta.py::TestMetaCUDA::test_meta_inplace_real_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2982272Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_complex64 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 24%] 2025-12-04T12:25:43.2982384Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_float64 PASSED [0.8819s] [ 24%] 2025-12-04T12:25:43.2982587Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 24%] 2025-12-04T12:25:43.2982791Z test_meta.py::TestMetaCUDA::test_meta_inplace_reciprocal_cuda_int64 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 24%] 2025-12-04T12:25:43.2982910Z test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_bfloat16 PASSED [0.0075s] [ 24%] 2025-12-04T12:25:43.2983025Z test_meta.py::TestMetaCUDA::test_meta_inplace_remainder_cuda_int8 PASSED [0.0061s] [ 24%] 2025-12-04T12:25:43.2983129Z test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_complex64 PASSED [0.0206s] [ 24%] 2025-12-04T12:25:43.2983230Z test_meta.py::TestMetaCUDA::test_meta_inplace_renorm_cuda_float32 PASSED [0.0063s] [ 24%] 2025-12-04T12:25:43.2983396Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2983538Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2983682Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2983846Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2984009Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2984171Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 24%] 2025-12-04T12:25:43.2984333Z test_meta.py::TestMetaCUDA::test_meta_inplace_repeat_interleave_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2984484Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2984631Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2984785Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2984938Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985085Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985244Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985393Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985538Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_as_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985685Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985832Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2985978Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2986123Z test_meta.py::TestMetaCUDA::test_meta_inplace_reshape_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2986227Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_float64 PASSED [0.8567s] [ 25%] 2025-12-04T12:25:43.2986328Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int32 PASSED [0.0047s] [ 25%] 2025-12-04T12:25:43.2986428Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int64 PASSED [0.0035s] [ 25%] 2025-12-04T12:25:43.2986527Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize__cuda_int8 PASSED [0.8568s] [ 25%] 2025-12-04T12:25:43.2986630Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int32 PASSED [0.0048s] [ 25%] 2025-12-04T12:25:43.2986744Z test_meta.py::TestMetaCUDA::test_meta_inplace_resize_as__cuda_int64 PASSED [0.0036s] [ 25%] 2025-12-04T12:25:43.2986912Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987068Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987233Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987383Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987533Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987683Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_conj_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987833Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2987984Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988192Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_int8 SKIPPED [0.0011s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988341Z test_meta.py::TestMetaCUDA::test_meta_inplace_resolve_neg_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988480Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988629Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988774Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2988917Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2989072Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2989211Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2989351Z test_meta.py::TestMetaCUDA::test_meta_inplace_roll_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2989491Z test_meta.py::TestMetaCUDA::test_meta_inplace_rot90_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 25%] 2025-12-04T12:25:43.2989591Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_float32 PASSED [0.8535s] [ 26%] 2025-12-04T12:25:43.2989691Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_cuda_int8 PASSED [0.0041s] [ 26%] 2025-12-04T12:25:43.2989808Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_bfloat16 PASSED [0.0040s] [ 26%] 2025-12-04T12:25:43.2989922Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float16 PASSED [0.8669s] [ 26%] 2025-12-04T12:25:43.2990038Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_0_cuda_float64 PASSED [0.0050s] [ 26%] 2025-12-04T12:25:43.2990158Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_bfloat16 PASSED [0.0037s] [ 26%] 2025-12-04T12:25:43.2990280Z test_meta.py::TestMetaCUDA::test_meta_inplace_round_decimals_neg_3_cuda_float64 PASSED [0.8757s] [ 26%] 2025-12-04T12:25:43.2990481Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_complex128 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 26%] 2025-12-04T12:25:43.2990581Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_float64 PASSED [0.8640s] [ 26%] 2025-12-04T12:25:43.2990792Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsqrt_cuda_int32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 26%] 2025-12-04T12:25:43.2990955Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_complex128 SKIPPED [0.0012s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991093Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991243Z test_meta.py::TestMetaCUDA::test_meta_inplace_rsub_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991400Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991556Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991709Z test_meta.py::TestMetaCUDA::test_meta_inplace_scalar_tensor_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 26%] 2025-12-04T12:25:43.2991816Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_bool PASSED [0.0154s] [ 26%] 2025-12-04T12:25:43.2991926Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_add_cuda_float64 PASSED [0.0075s] [ 26%] 2025-12-04T12:25:43.2992025Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_bool PASSED [0.0117s] [ 26%] 2025-12-04T12:25:43.2992134Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_complex128 PASSED [0.0115s] [ 26%] 2025-12-04T12:25:43.2992236Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_float64 PASSED [0.0152s] [ 26%] 2025-12-04T12:25:43.2992334Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_int32 PASSED [0.0114s] [ 26%] 2025-12-04T12:25:43.2992431Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_cuda_uint8 PASSED [0.0114s] [ 26%] 2025-12-04T12:25:43.2992552Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float32 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2992669Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_float64 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2992784Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int16 PASSED [0.0130s] [ 26%] 2025-12-04T12:25:43.2992910Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amax_cuda_int8 PASSED [0.0132s] [ 26%] 2025-12-04T12:25:43.2993031Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float16 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2993148Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_float64 PASSED [0.0132s] [ 26%] 2025-12-04T12:25:43.2993264Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_int16 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2993377Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_amin_cuda_int64 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2993498Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_bfloat16 PASSED [0.0140s] [ 26%] 2025-12-04T12:25:43.2993614Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_float16 PASSED [0.0140s] [ 26%] 2025-12-04T12:25:43.2993727Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_int16 PASSED [0.0140s] [ 26%] 2025-12-04T12:25:43.2993841Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_mean_cuda_uint8 PASSED [0.0138s] [ 26%] 2025-12-04T12:25:43.2993960Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_float32 PASSED [0.0131s] [ 26%] 2025-12-04T12:25:43.2994071Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_prod_cuda_int16 PASSED [0.0130s] [ 27%] 2025-12-04T12:25:43.2994184Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_int32 PASSED [0.0130s] [ 27%] 2025-12-04T12:25:43.2994297Z test_meta.py::TestMetaCUDA::test_meta_inplace_scatter_reduce_sum_cuda_int8 PASSED [0.0130s] [ 27%] 2025-12-04T12:25:43.2994469Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2994634Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2994787Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int32 SKIPPED [0.0011s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2994950Z test_meta.py::TestMetaCUDA::test_meta_inplace_searchsorted_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995097Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995245Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995390Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995533Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995689Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2995843Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2996000Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2996153Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2996305Z test_meta.py::TestMetaCUDA::test_meta_inplace_select_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2996407Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_bfloat16 PASSED [0.8657s] [ 27%] 2025-12-04T12:25:43.2996502Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_bool PASSED [0.0042s] [ 27%] 2025-12-04T12:25:43.2996602Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex32 PASSED [0.8610s] [ 27%] 2025-12-04T12:25:43.2996714Z test_meta.py::TestMetaCUDA::test_meta_inplace_sgn_cuda_complex64 PASSED [0.0061s] [ 27%] 2025-12-04T12:25:43.2996860Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2997006Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2997144Z test_meta.py::TestMetaCUDA::test_meta_inplace_short_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2997250Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_bfloat16 PASSED [0.0043s] [ 27%] 2025-12-04T12:25:43.2997352Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_float32 PASSED [0.0038s] [ 27%] 2025-12-04T12:25:43.2997456Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_float64 PASSED [0.8964s] [ 27%] 2025-12-04T12:25:43.2997655Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int16 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 27%] 2025-12-04T12:25:43.2997852Z test_meta.py::TestMetaCUDA::test_meta_inplace_sigmoid_cuda_int8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 27%] 2025-12-04T12:25:43.2997951Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_bfloat16 PASSED [0.8716s] [ 27%] 2025-12-04T12:25:43.2998047Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_bool PASSED [0.0041s] [ 27%] 2025-12-04T12:25:43.2998228Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_float64 PASSED [0.8488s] [ 27%] 2025-12-04T12:25:43.2998336Z test_meta.py::TestMetaCUDA::test_meta_inplace_sign_cuda_int8 PASSED [0.0042s] [ 27%] 2025-12-04T12:25:43.2998519Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_bartlett_cuda_float32 SKIPPED [0.0011s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2998689Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_blackman_cuda_float64 SKIPPED [0.0011s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2998867Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_gaussian_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2999034Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_gaussian_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 27%] 2025-12-04T12:25:43.2999210Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.2999374Z test_meta.py::TestMetaCUDA::test_meta_inplace_signal_windows_hamming_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.2999523Z test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.2999670Z test_meta.py::TestMetaCUDA::test_meta_inplace_signbit_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.2999863Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_bool SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.2999962Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_float16 PASSED [0.8713s] [ 28%] 2025-12-04T12:25:43.3000060Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_float64 PASSED [0.0040s] [ 28%] 2025-12-04T12:25:43.3000249Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_int64 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3000438Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_int8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3000629Z test_meta.py::TestMetaCUDA::test_meta_inplace_sin_cuda_uint8 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3000742Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float16 PASSED [0.0058s] [ 28%] 2025-12-04T12:25:43.3000840Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinc_cuda_float32 PASSED [0.0050s] [ 28%] 2025-12-04T12:25:43.3000938Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_bfloat16 PASSED [0.8614s] [ 28%] 2025-12-04T12:25:43.3001136Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_complex64 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3001325Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int16 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3001519Z test_meta.py::TestMetaCUDA::test_meta_inplace_sinh_cuda_int64 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 28%] 2025-12-04T12:25:43.3001669Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3001817Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3001962Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002120Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002286Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_float16 SKIPPED [0.0008s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002449Z test_meta.py::TestMetaCUDA::test_meta_inplace_slice_scatter_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002596Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002759Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3002927Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003085Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003241Z test_meta.py::TestMetaCUDA::test_meta_inplace_softmax_with_dtype_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003385Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003524Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003667Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003812Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3003949Z test_meta.py::TestMetaCUDA::test_meta_inplace_sort_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3004092Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_bfloat16 SKIPPED [0.0005s] (Only runs on cpu) [ 28%] 2025-12-04T12:25:43.3004229Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_mm_reduce_cuda_float32 SKIPPED [0.0005s] (Only runs on cpu) [ 28%] 2025-12-04T12:25:43.3004401Z test_meta.py::TestMetaCUDA::test_meta_inplace_sparse_sampled_addmm_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3004559Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3004727Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 28%] 2025-12-04T12:25:43.3004880Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005034Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005186Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_airy_ai_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005340Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005498Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005654Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j0_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005815Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3005968Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006123Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_j1_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006287Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006456Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006609Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y0_cuda_int64 SKIPPED [0.0011s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006772Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_bessel_y1_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3006952Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007127Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007300Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_t_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007474Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_bool SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007652Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007825Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3007995Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3008209Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_u_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3008390Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3008565Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3008764Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int32 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3008937Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009109Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009282Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009454Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009627Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_chebyshev_polynomial_w_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009778Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3009929Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010078Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_entr_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010235Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010399Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010564Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_erfcx_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010737Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 29%] 2025-12-04T12:25:43.3010919Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_h_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011097Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011272Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011443Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_hermite_polynomial_he_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011597Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011748Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3011896Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012044Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012190Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i0e_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012343Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012491Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012641Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012806Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3012959Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013120Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013270Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013418Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_i1e_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013594Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013772Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3013945Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_laguerre_polynomial_l_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014116Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_legendre_polynomial_p_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014272Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014438Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_int32 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014600Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014755Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_log_ndtr_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3014932Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015097Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015264Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_i1_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015430Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015597Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015762Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3015929Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k0_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3016093Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3016262Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3016430Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3016595Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 30%] 2025-12-04T12:25:43.3016771Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_modified_bessel_k1_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3016925Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017075Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017223Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtr_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017374Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_bool SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017530Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017683Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017834Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3017987Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_ndtri_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3018230Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3018435Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3018633Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3018820Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_polygamma_special_polygamma_n_0_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019012Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k0_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019185Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019363Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_float32 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019541Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_float64 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019717Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_scaled_modified_bessel_k1_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3019902Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3020088Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3020273Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_t_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3020459Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3020643Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_int64 SKIPPED [0.0008s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3020826Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_u_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021031Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021216Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021399Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_v_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021586Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021770Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3021953Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_shifted_chebyshev_polynomial_w_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022122Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022290Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_spherical_bessel_j0_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022445Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022611Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022775Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3022931Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3023101Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_xlog1py_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 31%] 2025-12-04T12:25:43.3023253Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3023403Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3023553Z test_meta.py::TestMetaCUDA::test_meta_inplace_special_zeta_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3023703Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3023851Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3023996Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024138Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024291Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024445Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024598Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_list_args_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024758Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3024934Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025100Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025261Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025422Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025582Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025739Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_copy_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3025899Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3026058Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3026209Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3026361Z test_meta.py::TestMetaCUDA::test_meta_inplace_split_with_sizes_cuda_uint8 SKIPPED [0.0008s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3026572Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_complex128 SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 32%] 2025-12-04T12:25:43.3026685Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_float16 PASSED [0.8603s] [ 32%] 2025-12-04T12:25:43.3026879Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_int32 SKIPPED [0.0014s] (Op promotes to float, which is impossible for inplace with non-float input) [ 32%] 2025-12-04T12:25:43.3027082Z test_meta.py::TestMetaCUDA::test_meta_inplace_sqrt_cuda_uint8 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 32%] 2025-12-04T12:25:43.3027180Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_bool XFAIL [0.0038s] [ 32%] 2025-12-04T12:25:43.3027287Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_complex128 PASSED [0.8655s] [ 32%] 2025-12-04T12:25:43.3027388Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_float16 PASSED [0.0039s] [ 32%] 2025-12-04T12:25:43.3027490Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_float32 PASSED [0.8705s] [ 32%] 2025-12-04T12:25:43.3027587Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_int8 PASSED [0.0055s] [ 32%] 2025-12-04T12:25:43.3027685Z test_meta.py::TestMetaCUDA::test_meta_inplace_square_cuda_uint8 PASSED [0.0040s] [ 32%] 2025-12-04T12:25:43.3027837Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3027994Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3028190Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3028340Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 32%] 2025-12-04T12:25:43.3028445Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_bfloat16 PASSED [0.0051s] [ 32%] 2025-12-04T12:25:43.3028554Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex128 PASSED [0.0050s] [ 33%] 2025-12-04T12:25:43.3028659Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex32 PASSED [0.0049s] [ 33%] 2025-12-04T12:25:43.3028780Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_complex64 PASSED [0.0050s] [ 33%] 2025-12-04T12:25:43.3028883Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int32 PASSED [0.0049s] [ 33%] 2025-12-04T12:25:43.3028981Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_cuda_int64 PASSED [0.0049s] [ 33%] 2025-12-04T12:25:43.3029096Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_float64 PASSED [0.0042s] [ 33%] 2025-12-04T12:25:43.3029206Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int32 PASSED [0.8614s] [ 33%] 2025-12-04T12:25:43.3029316Z test_meta.py::TestMetaCUDA::test_meta_inplace_squeeze_multiple_cuda_int8 PASSED [0.0059s] [ 33%] 2025-12-04T12:25:43.3029458Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3029604Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3031819Z test_meta.py::TestMetaCUDA::test_meta_inplace_stack_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3031962Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032100Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032248Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032396Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032560Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032721Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3032898Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_mean_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3033066Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3033220Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3033374Z test_meta.py::TestMetaCUDA::test_meta_inplace_std_unbiased_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3033518Z test_meta.py::TestMetaCUDA::test_meta_inplace_stft_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3033619Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_bfloat16 PASSED [0.0061s] [ 33%] 2025-12-04T12:25:43.3033719Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_complex64 PASSED [0.0060s] [ 33%] 2025-12-04T12:25:43.3033817Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float16 PASSED [0.0059s] [ 33%] 2025-12-04T12:25:43.3033915Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_float64 PASSED [0.0058s] [ 33%] 2025-12-04T12:25:43.3034008Z test_meta.py::TestMetaCUDA::test_meta_inplace_sub_cuda_int8 PASSED [0.0058s] [ 33%] 2025-12-04T12:25:43.3034153Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3034290Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3034430Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_cuda_uint8 SKIPPED [0.0011s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3034580Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3034726Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3034870Z test_meta.py::TestMetaCUDA::test_meta_inplace_sum_to_size_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3035017Z test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_complex128 SKIPPED [0.0011s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3035156Z test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3035297Z test_meta.py::TestMetaCUDA::test_meta_inplace_svd_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 33%] 2025-12-04T12:25:43.3035449Z test_meta.py::TestMetaCUDA::test_meta_inplace_svd_lowrank_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3035596Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3035793Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3035940Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3036084Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3036226Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_copy_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3036339Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_complex64 PASSED [0.0032s] [ 34%] 2025-12-04T12:25:43.3036441Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_int8 PASSED [0.8669s] [ 34%] 2025-12-04T12:25:43.3036535Z test_meta.py::TestMetaCUDA::test_meta_inplace_t_cuda_uint8 PASSED [0.0049s] [ 34%] 2025-12-04T12:25:43.3036695Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_complex64 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3036864Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_float16 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3037015Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_along_dim_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3037156Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3037293Z test_meta.py::TestMetaCUDA::test_meta_inplace_take_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3037396Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_bfloat16 PASSED [0.8842s] [ 34%] 2025-12-04T12:25:43.3037596Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_complex32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.3037788Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_int32 SKIPPED [0.0013s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.3037981Z test_meta.py::TestMetaCUDA::test_meta_inplace_tan_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.3038219Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_complex32 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.3038323Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float32 PASSED [0.8719s] [ 34%] 2025-12-04T12:25:43.3038422Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_float64 PASSED [0.8711s] [ 34%] 2025-12-04T12:25:43.3038615Z test_meta.py::TestMetaCUDA::test_meta_inplace_tanh_cuda_int16 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 34%] 2025-12-04T12:25:43.3038775Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_bfloat16 SKIPPED [0.0011s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3038933Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039082Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensor_split_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039236Z test_meta.py::TestMetaCUDA::test_meta_inplace_tensordot_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039383Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039530Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039672Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039838Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3039979Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040116Z test_meta.py::TestMetaCUDA::test_meta_inplace_tile_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040258Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040425Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_complex128 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040577Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int32 SKIPPED [0.0008s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040713Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3040881Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_bfloat16 SKIPPED [0.0010s] (No inplace variable for this op) [ 34%] 2025-12-04T12:25:43.3041024Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041178Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041328Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041478Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041621Z test_meta.py::TestMetaCUDA::test_meta_inplace_to_sparse_cuda_int8 SKIPPED [0.0008s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041811Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3041999Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__flash_attention_forward_cuda_float16 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3042184Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3042363Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3042543Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3042722Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3042899Z test_meta.py::TestMetaCUDA::test_meta_inplace_torch_ops_aten__safe_softmax_default_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3043040Z test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_bool SKIPPED [0.0008s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3043180Z test_meta.py::TestMetaCUDA::test_meta_inplace_trace_cuda_int64 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3043335Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3043487Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3043597Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_bfloat16 PASSED [0.8742s] [ 35%] 2025-12-04T12:25:43.3043722Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex128 PASSED [0.0067s] [ 35%] 2025-12-04T12:25:43.3043834Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_complex32 PASSED [0.8714s] [ 35%] 2025-12-04T12:25:43.3043939Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_float64 PASSED [0.0066s] [ 35%] 2025-12-04T12:25:43.3044042Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int32 PASSED [0.8665s] [ 35%] 2025-12-04T12:25:43.3044143Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_int8 PASSED [0.0065s] [ 35%] 2025-12-04T12:25:43.3044256Z test_meta.py::TestMetaCUDA::test_meta_inplace_transpose_cuda_uint8 PASSED [0.8652s] [ 35%] 2025-12-04T12:25:43.3044417Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_complex128 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3044567Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_complex64 SKIPPED [0.0012s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3044714Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3044867Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3045012Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3045152Z test_meta.py::TestMetaCUDA::test_meta_inplace_trapz_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3045314Z test_meta.py::TestMetaCUDA::test_meta_inplace_triangular_solve_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3045415Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_bfloat16 PASSED [0.0147s] [ 35%] 2025-12-04T12:25:43.3045519Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex128 PASSED [0.0076s] [ 35%] 2025-12-04T12:25:43.3045619Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_complex32 PASSED [0.0075s] [ 35%] 2025-12-04T12:25:43.3045720Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_float32 PASSED [0.0074s] [ 35%] 2025-12-04T12:25:43.3045816Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_cuda_int32 PASSED [0.8664s] [ 35%] 2025-12-04T12:25:43.3045967Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_indices_cuda_int32 SKIPPED [0.0015s] (No inplace variable for this op) [ 35%] 2025-12-04T12:25:43.3046117Z test_meta.py::TestMetaCUDA::test_meta_inplace_tril_indices_cuda_int64 SKIPPED [0.0012s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3046220Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_complex128 PASSED [0.0090s] [ 36%] 2025-12-04T12:25:43.3046321Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_complex64 PASSED [0.0076s] [ 36%] 2025-12-04T12:25:43.3046417Z test_meta.py::TestMetaCUDA::test_meta_inplace_triu_cuda_int8 PASSED [0.8681s] [ 36%] 2025-12-04T12:25:43.3046631Z test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_complex32 SKIPPED [0.0015s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.3046833Z test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_int32 SKIPPED [0.0012s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.3047034Z test_meta.py::TestMetaCUDA::test_meta_inplace_true_divide_cuda_uint8 SKIPPED [0.0010s] (Op promotes to float, which is impossible for inplace with non-float input) [ 36%] 2025-12-04T12:25:43.3047134Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_bfloat16 PASSED [0.8711s] [ 36%] 2025-12-04T12:25:43.3047233Z test_meta.py::TestMetaCUDA::test_meta_inplace_trunc_cuda_int8 PASSED [0.0040s] [ 36%] 2025-12-04T12:25:43.3047381Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_bool SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3047549Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex32 SKIPPED [0.0011s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3047705Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3047857Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048003Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_copy_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048210Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048364Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048516Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048666Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048825Z test_meta.py::TestMetaCUDA::test_meta_inplace_unbind_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3048977Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049125Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049275Z test_meta.py::TestMetaCUDA::test_meta_inplace_unflatten_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049430Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_copy_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049577Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049717Z test_meta.py::TestMetaCUDA::test_meta_inplace_unfold_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3049877Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050039Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050198Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_consecutive_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050341Z test_meta.py::TestMetaCUDA::test_meta_inplace_unique_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050496Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050647Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050800Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3050952Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_chunk_cuda_int64 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3051101Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3051252Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_int8 SKIPPED [0.0010s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3051400Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsafe_split_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 36%] 2025-12-04T12:25:43.3051578Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3051741Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3051894Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3052045Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_copy_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3052166Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_float64 PASSED [0.0055s] [ 37%] 2025-12-04T12:25:43.3052278Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int64 PASSED [0.0055s] [ 37%] 2025-12-04T12:25:43.3052380Z test_meta.py::TestMetaCUDA::test_meta_inplace_unsqueeze_cuda_int8 PASSED [0.0053s] [ 37%] 2025-12-04T12:25:43.3052522Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3052674Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3052821Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_mean_cuda_float64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3052973Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053130Z test_meta.py::TestMetaCUDA::test_meta_inplace_var_unbiased_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053285Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_complex_cuda_float32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053435Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053584Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053728Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3053868Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054022Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_as_real_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054166Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_copy_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054313Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054455Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054595Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int16 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054734Z test_meta.py::TestMetaCUDA::test_meta_inplace_view_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3054881Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055021Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_bool SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055170Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_complex32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055317Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055485Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055628Z test_meta.py::TestMetaCUDA::test_meta_inplace_vsplit_cuda_int32 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055775Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3055925Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056083Z test_meta.py::TestMetaCUDA::test_meta_inplace_vstack_cuda_complex64 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056240Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_complex64 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056383Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_float16 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056524Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int32 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056673Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3056815Z test_meta.py::TestMetaCUDA::test_meta_inplace_where_cuda_uint8 SKIPPED [0.0010s] (No inplace variable for this op) [ 37%] 2025-12-04T12:25:43.3057010Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_bool SKIPPED [0.0009s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.3057111Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_float16 PASSED [0.0305s] [ 38%] 2025-12-04T12:25:43.3057307Z test_meta.py::TestMetaCUDA::test_meta_inplace_xlogy_cuda_uint8 SKIPPED [0.0011s] (Op promotes to float, which is impossible for inplace with non-float input) [ 38%] 2025-12-04T12:25:43.3057415Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_complex64 PASSED [0.8553s] [ 38%] 2025-12-04T12:25:43.3059445Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_float64 PASSED [0.0047s] [ 38%] 2025-12-04T12:25:43.3059553Z test_meta.py::TestMetaCUDA::test_meta_inplace_zero__cuda_uint8 PASSED [0.0035s] [ 38%] 2025-12-04T12:25:43.3059702Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_bfloat16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3059848Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_float64 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3059989Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_int16 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3060131Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_cuda_int8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3060286Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_complex128 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3060439Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_float32 SKIPPED [0.0010s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3060585Z test_meta.py::TestMetaCUDA::test_meta_inplace_zeros_like_cuda_uint8 SKIPPED [0.0009s] (No inplace variable for this op) [ 38%] 2025-12-04T12:25:43.3060686Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_bfloat16 PASSED [0.0027s] [ 38%] 2025-12-04T12:25:43.3060784Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_complex128 PASSED [0.8564s] [ 38%] 2025-12-04T12:25:43.3060880Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_int16 PASSED [0.0039s] [ 38%] 2025-12-04T12:25:43.3060975Z test_meta.py::TestMetaCUDA::test_meta_outplace_H_cuda_uint8 PASSED [0.8644s] [ 38%] 2025-12-04T12:25:43.3061076Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_complex64 PASSED [0.0042s] [ 38%] 2025-12-04T12:25:43.3061168Z test_meta.py::TestMetaCUDA::test_meta_outplace_T_cuda_int32 PASSED [0.8664s] [ 38%] 2025-12-04T12:25:43.3061310Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_complex64 PASSED [0.0101s] [ 38%] 2025-12-04T12:25:43.3061420Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_float32 PASSED [0.0085s] [ 38%] 2025-12-04T12:25:43.3061526Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int32 PASSED [0.0083s] [ 38%] 2025-12-04T12:25:43.3061630Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_int64 PASSED [0.0083s] [ 38%] 2025-12-04T12:25:43.3061735Z test_meta.py::TestMetaCUDA::test_meta_outplace___getitem___cuda_uint8 PASSED [0.0083s] [ 38%] 2025-12-04T12:25:43.3061859Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_complex128 PASSED [0.0060s] [ 38%] 2025-12-04T12:25:43.3061981Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_complex64 PASSED [0.0059s] [ 38%] 2025-12-04T12:25:43.3062083Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_float64 PASSED [0.0058s] [ 38%] 2025-12-04T12:25:43.3062184Z test_meta.py::TestMetaCUDA::test_meta_outplace___radd___cuda_int8 PASSED [0.0058s] [ 38%] 2025-12-04T12:25:43.3062303Z test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_bool PASSED [0.0165s] [ 38%] 2025-12-04T12:25:43.3062404Z test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int32 PASSED [0.0060s] [ 38%] 2025-12-04T12:25:43.3062501Z test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_int8 PASSED [0.0058s] [ 38%] 2025-12-04T12:25:43.3062600Z test_meta.py::TestMetaCUDA::test_meta_outplace___rand___cuda_uint8 PASSED [0.0058s] [ 38%] 2025-12-04T12:25:43.3062705Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_bfloat16 PASSED [0.0068s] [ 38%] 2025-12-04T12:25:43.3062802Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_bool PASSED [0.0066s] [ 38%] 2025-12-04T12:25:43.3062909Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_complex64 PASSED [0.0066s] [ 38%] 2025-12-04T12:25:43.3063012Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_float16 PASSED [0.0066s] [ 38%] 2025-12-04T12:25:43.3063114Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_float32 PASSED [0.0065s] [ 39%] 2025-12-04T12:25:43.3063214Z test_meta.py::TestMetaCUDA::test_meta_outplace___rdiv___cuda_int16 PASSED [0.0065s] [ 39%] 2025-12-04T12:25:43.3063328Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_complex64 PASSED [1.0247s] [ 39%] 2025-12-04T12:25:43.3063434Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float16 PASSED [1.5316s] [ 39%] 2025-12-04T12:25:43.3063543Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmatmul___cuda_float64 PASSED [0.9456s] [ 39%] 2025-12-04T12:25:43.3063646Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float32 PASSED [0.0081s] [ 39%] 2025-12-04T12:25:43.3063752Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_float64 PASSED [0.0061s] [ 39%] 2025-12-04T12:25:43.3063850Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_int32 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3063953Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmod___cuda_uint8 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3064060Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_complex128 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3064168Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_complex64 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3064268Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_float64 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3064368Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int16 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3064466Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_int64 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3064567Z test_meta.py::TestMetaCUDA::test_meta_outplace___rmul___cuda_uint8 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3064668Z test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int16 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3064779Z test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int32 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3064877Z test_meta.py::TestMetaCUDA::test_meta_outplace___ror___cuda_int8 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3064985Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_complex64 PASSED [0.0060s] [ 39%] 2025-12-04T12:25:43.3065089Z test_meta.py::TestMetaCUDA::test_meta_outplace___rpow___cuda_float16 PASSED [0.0060s] [ 39%] 2025-12-04T12:25:43.3065196Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_complex128 PASSED [0.0060s] [ 39%] 2025-12-04T12:25:43.3065309Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_complex64 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3065418Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int16 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3065528Z test_meta.py::TestMetaCUDA::test_meta_outplace___rsub___cuda_int32 PASSED [0.0059s] [ 39%] 2025-12-04T12:25:43.3065626Z test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_bool PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3065728Z test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_int32 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3065838Z test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_int8 PASSED [0.0058s] [ 39%] 2025-12-04T12:25:43.3065943Z test_meta.py::TestMetaCUDA::test_meta_outplace___rxor___cuda_uint8 PASSED [0.0061s] [ 39%] 2025-12-04T12:25:43.3066050Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_bfloat16 PASSED [0.0243s] [ 39%] 2025-12-04T12:25:43.3066154Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_bool PASSED [0.0112s] [ 39%] 2025-12-04T12:25:43.3066266Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_complex128 PASSED [0.0111s] [ 39%] 2025-12-04T12:25:43.3066379Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_complex32 PASSED [0.0111s] [ 39%] 2025-12-04T12:25:43.3066488Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_float16 PASSED [0.0109s] [ 39%] 2025-12-04T12:25:43.3066593Z test_meta.py::TestMetaCUDA::test_meta_outplace__chunk_cat_cuda_int16 PASSED [0.0110s] [ 39%] 2025-12-04T12:25:43.3066702Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_bfloat16 PASSED [0.0232s] [ 39%] 2025-12-04T12:25:43.3066818Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_complex64 PASSED [0.0100s] [ 39%] 2025-12-04T12:25:43.3066922Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_abs_cuda_int8 PASSED [0.0078s] [ 40%] 2025-12-04T12:25:43.3067034Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_complex64 PASSED [0.0079s] [ 40%] 2025-12-04T12:25:43.3067144Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_float16 PASSED [0.0079s] [ 40%] 2025-12-04T12:25:43.3067254Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_acos_cuda_uint8 PASSED [0.0124s] [ 40%] 2025-12-04T12:25:43.3067365Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_bfloat16 XFAIL [0.0453s] [ 40%] 2025-12-04T12:25:43.3067472Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_float32 XFAIL [0.8902s] [ 40%] 2025-12-04T12:25:43.3067582Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int16 XFAIL [0.8816s] [ 40%] 2025-12-04T12:25:43.3067687Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int32 XFAIL [0.8883s] [ 40%] 2025-12-04T12:25:43.3067794Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_int8 XFAIL [0.8758s] [ 40%] 2025-12-04T12:25:43.3067903Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_add_cuda_uint8 XFAIL [0.8744s] [ 40%] 2025-12-04T12:25:43.3068027Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_complex128 PASSED [0.9898s] [ 40%] 2025-12-04T12:25:43.3068190Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_float16 PASSED [0.0978s] [ 40%] 2025-12-04T12:25:43.3068302Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int16 XFAIL [0.0058s] [ 40%] 2025-12-04T12:25:43.3068427Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcdiv_cuda_int8 XFAIL [0.8734s] [ 40%] 2025-12-04T12:25:43.3068547Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_complex64 PASSED [0.9951s] [ 40%] 2025-12-04T12:25:43.3068661Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_float64 PASSED [0.0960s] [ 40%] 2025-12-04T12:25:43.3068778Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int16 PASSED [0.0944s] [ 40%] 2025-12-04T12:25:43.3068893Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_addcmul_cuda_int64 PASSED [0.0672s] [ 40%] 2025-12-04T12:25:43.3069025Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_complex64 PASSED [0.0082s] [ 40%] 2025-12-04T12:25:43.3069148Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_float16 PASSED [0.0081s] [ 40%] 2025-12-04T12:25:43.3069259Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int64 PASSED [0.0125s] [ 40%] 2025-12-04T12:25:43.3069368Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_asin_cuda_int8 PASSED [0.0080s] [ 40%] 2025-12-04T12:25:43.3069479Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_float16 PASSED [0.0081s] [ 40%] 2025-12-04T12:25:43.3069610Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_atan_cuda_int8 PASSED [0.0107s] [ 40%] 2025-12-04T12:25:43.3069726Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_complex128 XFAIL [0.0036s] [ 40%] 2025-12-04T12:25:43.3069839Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_complex64 XFAIL [0.8705s] [ 40%] 2025-12-04T12:25:43.3069950Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_float16 PASSED [0.8743s] [ 40%] 2025-12-04T12:25:43.3070062Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_ceil_cuda_int64 PASSED [0.0084s] [ 40%] 2025-12-04T12:25:43.3070175Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_bool XFAIL [0.0424s] [ 40%] 2025-12-04T12:25:43.3070291Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int16 PASSED [0.9358s] [ 40%] 2025-12-04T12:25:43.3070404Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_int32 PASSED [0.0748s] [ 40%] 2025-12-04T12:25:43.3070520Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_max_cuda_uint8 PASSED [0.0748s] [ 40%] 2025-12-04T12:25:43.3070638Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_complex64 XFAIL [0.0046s] [ 40%] 2025-12-04T12:25:43.3070757Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_float32 PASSED [0.9740s] [ 40%] 2025-12-04T12:25:43.3070868Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int16 PASSED [0.0753s] [ 40%] 2025-12-04T12:25:43.3070983Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int32 PASSED [0.0754s] [ 40%] 2025-12-04T12:25:43.3071092Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_int8 PASSED [0.0751s] [ 40%] 2025-12-04T12:25:43.3071206Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_clamp_min_cuda_uint8 PASSED [0.0750s] [ 41%] 2025-12-04T12:25:43.3071318Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_bfloat16 PASSED [0.0090s] [ 41%] 2025-12-04T12:25:43.3071434Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_complex128 PASSED [0.0091s] [ 41%] 2025-12-04T12:25:43.3071545Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_float32 PASSED [0.0089s] [ 41%] 2025-12-04T12:25:43.3071656Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int16 PASSED [0.0088s] [ 41%] 2025-12-04T12:25:43.3071764Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int32 PASSED [0.0089s] [ 41%] 2025-12-04T12:25:43.3071872Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_copy_cuda_int64 PASSED [0.0089s] [ 41%] 2025-12-04T12:25:43.3071976Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_bool PASSED [0.0122s] [ 41%] 2025-12-04T12:25:43.3072098Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int32 PASSED [0.0081s] [ 41%] 2025-12-04T12:25:43.3072207Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cos_cuda_int64 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3072317Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_bool PASSED [0.0114s] [ 41%] 2025-12-04T12:25:43.3072430Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_complex128 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3072541Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_complex64 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3072659Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_cosh_cuda_int32 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3072776Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_bool PASSED [0.0418s] [ 41%] 2025-12-04T12:25:43.3072887Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_div_cuda_float16 PASSED [0.0594s] [ 41%] 2025-12-04T12:25:43.3072997Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_float64 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3073104Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erf_cuda_int16 PASSED [0.0081s] [ 41%] 2025-12-04T12:25:43.3073220Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_bool PASSED [0.0089s] [ 41%] 2025-12-04T12:25:43.3073332Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_float32 PASSED [0.0078s] [ 41%] 2025-12-04T12:25:43.3073439Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int16 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3073549Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int32 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3073656Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_erfc_cuda_int8 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3073772Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_complex64 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3073879Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_float64 PASSED [0.0078s] [ 41%] 2025-12-04T12:25:43.3073987Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int16 PASSED [0.0081s] [ 41%] 2025-12-04T12:25:43.3074092Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int32 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3074197Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_exp_cuda_int64 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3074308Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_bfloat16 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3074422Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_float64 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3074531Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int32 PASSED [0.0079s] [ 41%] 2025-12-04T12:25:43.3074641Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_int8 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3074749Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_expm1_cuda_uint8 PASSED [0.0080s] [ 41%] 2025-12-04T12:25:43.3074864Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_complex128 XFAIL [0.0035s] [ 41%] 2025-12-04T12:25:43.3074978Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_floor_cuda_complex64 XFAIL [0.8756s] [ 41%] 2025-12-04T12:25:43.3075087Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_bfloat16 PASSED [0.8764s] [ 41%] 2025-12-04T12:25:43.3075194Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_bool XFAIL [0.0039s] [ 42%] 2025-12-04T12:25:43.3075305Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_complex128 XFAIL [0.8650s] [ 42%] 2025-12-04T12:25:43.3075421Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_complex64 XFAIL [0.8749s] [ 42%] 2025-12-04T12:25:43.3075533Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_float16 PASSED [0.8995s] [ 42%] 2025-12-04T12:25:43.3075639Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_int16 XFAIL [0.0040s] [ 42%] 2025-12-04T12:25:43.3075757Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_frac_cuda_uint8 XFAIL [0.8667s] [ 42%] 2025-12-04T12:25:43.3075862Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_bool XFAIL [0.8707s] [ 42%] 2025-12-04T12:25:43.3075975Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_complex128 PASSED [0.9719s] [ 42%] 2025-12-04T12:25:43.3076087Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float16 PASSED [0.0595s] [ 42%] 2025-12-04T12:25:43.3076194Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_float32 PASSED [0.0594s] [ 42%] 2025-12-04T12:25:43.3076314Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int32 XFAIL [0.0054s] [ 42%] 2025-12-04T12:25:43.3076432Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lerp_cuda_int8 XFAIL [1.2988s] [ 42%] 2025-12-04T12:25:43.3076548Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float16 PASSED [1.5191s] [ 42%] 2025-12-04T12:25:43.3076660Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_float32 PASSED [0.0084s] [ 42%] 2025-12-04T12:25:43.3076783Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_int32 PASSED [0.0097s] [ 42%] 2025-12-04T12:25:43.3076893Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_lgamma_cuda_uint8 PASSED [0.0080s] [ 42%] 2025-12-04T12:25:43.3077002Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_bool PASSED [0.0080s] [ 42%] 2025-12-04T12:25:43.3077117Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_complex128 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3077232Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_complex64 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3077342Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_float32 PASSED [0.0077s] [ 42%] 2025-12-04T12:25:43.3077451Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int16 PASSED [0.0080s] [ 42%] 2025-12-04T12:25:43.3077560Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int32 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3077669Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log10_cuda_int64 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3077779Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log1p_cuda_int32 PASSED [0.0081s] [ 42%] 2025-12-04T12:25:43.3077890Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_bfloat16 PASSED [0.0080s] [ 42%] 2025-12-04T12:25:43.3077997Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_bool PASSED [0.0078s] [ 42%] 2025-12-04T12:25:43.3078139Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_complex128 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3078254Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_complex64 PASSED [0.0078s] [ 42%] 2025-12-04T12:25:43.3078360Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log2_cuda_int8 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3078470Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_bfloat16 PASSED [0.0080s] [ 42%] 2025-12-04T12:25:43.3078587Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_complex128 PASSED [0.0078s] [ 42%] 2025-12-04T12:25:43.3078697Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_log_cuda_float16 PASSED [0.0079s] [ 42%] 2025-12-04T12:25:43.3078801Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_bool PASSED [0.0111s] [ 42%] 2025-12-04T12:25:43.3078913Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_complex128 XFAIL [0.0036s] [ 42%] 2025-12-04T12:25:43.3079021Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_float16 PASSED [1.4792s] [ 42%] 2025-12-04T12:25:43.3079127Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_int8 PASSED [1.4841s] [ 42%] 2025-12-04T12:25:43.3079232Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_max_cuda_uint8 PASSED [0.0069s] [ 43%] 2025-12-04T12:25:43.3079363Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_bfloat16 PASSED [0.1113s] [ 43%] 2025-12-04T12:25:43.3079473Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_bool XFAIL [0.0156s] [ 43%] 2025-12-04T12:25:43.3079589Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_float32 PASSED [1.5862s] [ 43%] 2025-12-04T12:25:43.3079701Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_maximum_cuda_uint8 PASSED [0.0750s] [ 43%] 2025-12-04T12:25:43.3079817Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_bfloat16 PASSED [0.1115s] [ 43%] 2025-12-04T12:25:43.3079945Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_complex64 XFAIL [0.0051s] [ 43%] 2025-12-04T12:25:43.3080073Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_float32 PASSED [1.5846s] [ 43%] 2025-12-04T12:25:43.3080186Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_int16 PASSED [0.0752s] [ 43%] 2025-12-04T12:25:43.3080298Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_minimum_cuda_uint8 PASSED [0.0745s] [ 43%] 2025-12-04T12:25:43.3080421Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_bfloat16 PASSED [0.0550s] [ 43%] 2025-12-04T12:25:43.3080534Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_complex64 PASSED [0.0666s] [ 43%] 2025-12-04T12:25:43.3080646Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_float64 PASSED [0.0534s] [ 43%] 2025-12-04T12:25:43.3080753Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_mul_cuda_int16 PASSED [0.0403s] [ 43%] 2025-12-04T12:25:43.3080867Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_bfloat16 PASSED [0.0081s] [ 43%] 2025-12-04T12:25:43.3080971Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_bool XFAIL [0.0035s] [ 43%] 2025-12-04T12:25:43.3081080Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int16 PASSED [1.4695s] [ 43%] 2025-12-04T12:25:43.3081186Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int64 PASSED [0.0084s] [ 43%] 2025-12-04T12:25:43.3081293Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_neg_cuda_int8 PASSED [0.0080s] [ 43%] 2025-12-04T12:25:43.3081398Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_bool XFAIL [0.0035s] [ 43%] 2025-12-04T12:25:43.3081511Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_norm_cuda_float64 PASSED [1.5710s] [ 43%] 2025-12-04T12:25:43.3081620Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_float64 PASSED [0.0451s] [ 43%] 2025-12-04T12:25:43.3081728Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_pow_cuda_int32 PASSED [0.0324s] [ 43%] 2025-12-04T12:25:43.3081844Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int16 PASSED [0.0081s] [ 43%] 2025-12-04T12:25:43.3081959Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_reciprocal_cuda_int8 PASSED [0.0080s] [ 43%] 2025-12-04T12:25:43.3082071Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_bfloat16 PASSED [0.0078s] [ 43%] 2025-12-04T12:25:43.3082184Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_complex64 XFAIL [0.0036s] [ 43%] 2025-12-04T12:25:43.3082295Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_float32 PASSED [1.4832s] [ 43%] 2025-12-04T12:25:43.3082403Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_round_cuda_uint8 PASSED [1.4955s] [ 43%] 2025-12-04T12:25:43.3082516Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_bfloat16 PASSED [0.0103s] [ 43%] 2025-12-04T12:25:43.3082633Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_complex64 PASSED [0.0085s] [ 43%] 2025-12-04T12:25:43.3082747Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_rsqrt_cuda_float64 PASSED [0.0080s] [ 43%] 2025-12-04T12:25:43.3082864Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_bfloat16 PASSED [0.0071s] [ 43%] 2025-12-04T12:25:43.3082998Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_complex128 PASSED [0.0072s] [ 43%] 2025-12-04T12:25:43.3083114Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_float16 PASSED [0.0071s] [ 43%] 2025-12-04T12:25:43.3083228Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_float32 PASSED [0.0070s] [ 43%] 2025-12-04T12:25:43.3083339Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int16 PASSED [0.0071s] [ 43%] 2025-12-04T12:25:43.3083450Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int32 PASSED [0.0071s] [ 44%] 2025-12-04T12:25:43.3083569Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_int8 PASSED [0.0071s] [ 44%] 2025-12-04T12:25:43.3083701Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sigmoid_cuda_uint8 PASSED [0.0070s] [ 44%] 2025-12-04T12:25:43.3083814Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_complex64 XFAIL [0.0036s] [ 44%] 2025-12-04T12:25:43.3083929Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_float16 PASSED [1.5076s] [ 44%] 2025-12-04T12:25:43.3084049Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_float32 PASSED [0.0085s] [ 44%] 2025-12-04T12:25:43.3084161Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_int64 PASSED [0.0081s] [ 44%] 2025-12-04T12:25:43.3084268Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sign_cuda_uint8 PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3084379Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_bfloat16 PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3084486Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_bool PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3084596Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float32 PASSED [0.0078s] [ 44%] 2025-12-04T12:25:43.3084704Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_float64 PASSED [0.0078s] [ 44%] 2025-12-04T12:25:43.3084816Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_int16 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3084923Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sin_cuda_uint8 PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3085034Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_bool PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3085149Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_complex128 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3085257Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float16 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3085365Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_float32 PASSED [0.0077s] [ 44%] 2025-12-04T12:25:43.3085474Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int64 PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3085583Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sinh_cuda_int8 PASSED [0.0080s] [ 44%] 2025-12-04T12:25:43.3085694Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_complex64 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3085806Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_float64 PASSED [0.0078s] [ 44%] 2025-12-04T12:25:43.3085911Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sqrt_cuda_int8 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3086020Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_float64 XFAIL [0.0106s] [ 44%] 2025-12-04T12:25:43.3086125Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int32 XFAIL [0.0087s] [ 44%] 2025-12-04T12:25:43.3086231Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_sub_cuda_int64 XFAIL [1.5065s] [ 44%] 2025-12-04T12:25:43.3086336Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_bool PASSED [1.4927s] [ 44%] 2025-12-04T12:25:43.3086452Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_complex64 PASSED [0.0126s] [ 44%] 2025-12-04T12:25:43.3086557Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int16 PASSED [0.0093s] [ 44%] 2025-12-04T12:25:43.3086676Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tan_cuda_int64 PASSED [0.0084s] [ 44%] 2025-12-04T12:25:43.3086785Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_float16 PASSED [0.0082s] [ 44%] 2025-12-04T12:25:43.3086895Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_float32 PASSED [0.0079s] [ 44%] 2025-12-04T12:25:43.3087003Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_tanh_cuda_int32 PASSED [0.0081s] [ 44%] 2025-12-04T12:25:43.3087110Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_bool XFAIL [0.0035s] [ 44%] 2025-12-04T12:25:43.3087235Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_complex128 XFAIL [1.4815s] [ 44%] 2025-12-04T12:25:43.3087361Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_complex64 XFAIL [1.4884s] [ 44%] 2025-12-04T12:25:43.3087472Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float16 PASSED [1.4712s] [ 45%] 2025-12-04T12:25:43.3087585Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_float32 PASSED [0.0086s] [ 45%] 2025-12-04T12:25:43.3087713Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int16 PASSED [0.0083s] [ 45%] 2025-12-04T12:25:43.3087822Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int32 PASSED [0.0080s] [ 45%] 2025-12-04T12:25:43.3087936Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_trunc_cuda_int8 PASSED [0.0079s] [ 45%] 2025-12-04T12:25:43.3088049Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_complex64 PASSED [0.0061s] [ 45%] 2025-12-04T12:25:43.3088239Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_float32 PASSED [0.0061s] [ 45%] 2025-12-04T12:25:43.3088347Z test_meta.py::TestMetaCUDA::test_meta_outplace__foreach_zero_cuda_uint8 PASSED [0.0061s] [ 45%] 2025-12-04T12:25:43.3088475Z test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_float16 PASSED [0.0303s] [ 45%] 2025-12-04T12:25:43.3088601Z test_meta.py::TestMetaCUDA::test_meta_outplace__native_batch_norm_legit_cuda_float32 PASSED [0.0132s] [ 45%] 2025-12-04T12:25:43.3088727Z test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_bfloat16 PASSED [1.5531s] [ 45%] 2025-12-04T12:25:43.3088851Z test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float32 PASSED [0.0386s] [ 45%] 2025-12-04T12:25:43.3088973Z test_meta.py::TestMetaCUDA::test_meta_outplace__segment_reduce_offsets_cuda_float64 PASSED [1.6148s] [ 45%] 2025-12-04T12:25:43.3089100Z test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float16 PASSED [0.0280s] [ 45%] 2025-12-04T12:25:43.3089228Z test_meta.py::TestMetaCUDA::test_meta_outplace__softmax_backward_data_cuda_float64 PASSED [0.0045s] [ 45%] 2025-12-04T12:25:43.3089350Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_complex128 PASSED [0.0143s] [ 45%] 2025-12-04T12:25:43.3089473Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_complex64 PASSED [0.0135s] [ 45%] 2025-12-04T12:25:43.3089593Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_float64 PASSED [0.0133s] [ 45%] 2025-12-04T12:25:43.3089711Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int16 PASSED [0.0134s] [ 45%] 2025-12-04T12:25:43.3089826Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_cuda_int8 PASSED [0.0133s] [ 45%] 2025-12-04T12:25:43.3089969Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bfloat16 PASSED [0.0143s] [ 45%] 2025-12-04T12:25:43.3090111Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_bool PASSED [0.0141s] [ 45%] 2025-12-04T12:25:43.3090251Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_float16 PASSED [0.0141s] [ 45%] 2025-12-04T12:25:43.3090393Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int32 PASSED [1.5442s] [ 45%] 2025-12-04T12:25:43.3090551Z test_meta.py::TestMetaCUDA::test_meta_outplace__unsafe_masked_index_put_accumulate_cuda_int8 PASSED [0.0173s] [ 45%] 2025-12-04T12:25:43.3090681Z test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_bfloat16 PASSED [0.0095s] [ 45%] 2025-12-04T12:25:43.3090805Z test_meta.py::TestMetaCUDA::test_meta_outplace__upsample_bilinear2d_aa_cuda_float32 PASSED [1.5485s] [ 45%] 2025-12-04T12:25:43.3090903Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_bool PASSED [0.0046s] [ 45%] 2025-12-04T12:25:43.3091015Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_float16 PASSED [1.5157s] [ 45%] 2025-12-04T12:25:43.3091128Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_int32 PASSED [0.0045s] [ 45%] 2025-12-04T12:25:43.3091225Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_int64 PASSED [1.5167s] [ 45%] 2025-12-04T12:25:43.3091324Z test_meta.py::TestMetaCUDA::test_meta_outplace_abs_cuda_uint8 PASSED [0.0046s] [ 45%] 2025-12-04T12:25:43.3091428Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_complex64 PASSED [0.0042s] [ 45%] 2025-12-04T12:25:43.3091541Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_float16 PASSED [1.5327s] [ 45%] 2025-12-04T12:25:43.3091639Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_float64 PASSED [0.0058s] [ 45%] 2025-12-04T12:25:43.3091736Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_int32 PASSED [0.0039s] [ 45%] 2025-12-04T12:25:43.3091831Z test_meta.py::TestMetaCUDA::test_meta_outplace_acos_cuda_uint8 PASSED [1.5702s] [ 46%] 2025-12-04T12:25:43.3091935Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_bfloat16 PASSED [0.0143s] [ 46%] 2025-12-04T12:25:43.3092039Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_complex64 PASSED [0.0039s] [ 46%] 2025-12-04T12:25:43.3092141Z test_meta.py::TestMetaCUDA::test_meta_outplace_acosh_cuda_float64 PASSED [1.5420s] [ 46%] 2025-12-04T12:25:43.3092237Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_bool PASSED [0.0118s] [ 46%] 2025-12-04T12:25:43.3092338Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex128 PASSED [0.0100s] [ 46%] 2025-12-04T12:25:43.3092439Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex32 PASSED [0.0099s] [ 46%] 2025-12-04T12:25:43.3092537Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_complex64 PASSED [1.5348s] [ 46%] 2025-12-04T12:25:43.3092630Z test_meta.py::TestMetaCUDA::test_meta_outplace_add_cuda_int32 PASSED [0.0120s] [ 46%] 2025-12-04T12:25:43.3092730Z test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_float16 PASSED [0.0076s] [ 46%] 2025-12-04T12:25:43.3092831Z test_meta.py::TestMetaCUDA::test_meta_outplace_addbmm_cuda_float64 PASSED [0.0069s] [ 46%] 2025-12-04T12:25:43.3092938Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_bfloat16 PASSED [0.0120s] [ 46%] 2025-12-04T12:25:43.3093045Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_complex64 PASSED [0.0150s] [ 46%] 2025-12-04T12:25:43.3093150Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcdiv_cuda_float16 PASSED [0.0120s] [ 46%] 2025-12-04T12:25:43.3093257Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_bfloat16 PASSED [0.0120s] [ 46%] 2025-12-04T12:25:43.3093362Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float16 PASSED [0.0119s] [ 46%] 2025-12-04T12:25:43.3093465Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_float32 PASSED [0.0119s] [ 46%] 2025-12-04T12:25:43.3093562Z test_meta.py::TestMetaCUDA::test_meta_outplace_addcmul_cuda_int32 PASSED [0.0118s] [ 46%] 2025-12-04T12:25:43.3093667Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_complex128 PASSED [0.0907s] [ 46%] 2025-12-04T12:25:43.3093770Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_complex64 PASSED [0.0134s] [ 46%] 2025-12-04T12:25:43.3093871Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float16 PASSED [0.0095s] [ 46%] 2025-12-04T12:25:43.3093980Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_cuda_float32 PASSED [1.1868s] [ 46%] 2025-12-04T12:25:43.3094103Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_complex128 PASSED [0.0102s] [ 46%] 2025-12-04T12:25:43.3094222Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_complex64 PASSED [0.0095s] [ 46%] 2025-12-04T12:25:43.3094339Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmm_decomposed_cuda_float16 PASSED [0.0090s] [ 46%] 2025-12-04T12:25:43.3094439Z test_meta.py::TestMetaCUDA::test_meta_outplace_addmv_cuda_float16 PASSED [1.5360s] [ 46%] 2025-12-04T12:25:43.3094537Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_float32 PASSED [0.0213s] [ 46%] 2025-12-04T12:25:43.3094643Z test_meta.py::TestMetaCUDA::test_meta_outplace_addr_cuda_int16 PASSED [0.0071s] [ 46%] 2025-12-04T12:25:43.3094761Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float16 PASSED [0.0032s] [ 46%] 2025-12-04T12:25:43.3094869Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float32 PASSED [1.5190s] [ 46%] 2025-12-04T12:25:43.3094978Z test_meta.py::TestMetaCUDA::test_meta_outplace_alias_copy_cuda_float64 PASSED [0.0048s] [ 46%] 2025-12-04T12:25:43.3095089Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_complex128 PASSED [0.0254s] [ 46%] 2025-12-04T12:25:43.3095188Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_complex64 PASSED [0.0182s] [ 46%] 2025-12-04T12:25:43.3095286Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_float64 PASSED [0.0179s] [ 46%] 2025-12-04T12:25:43.3095382Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int16 PASSED [0.0179s] [ 46%] 2025-12-04T12:25:43.3095480Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_int64 PASSED [0.0178s] [ 46%] 2025-12-04T12:25:43.3095577Z test_meta.py::TestMetaCUDA::test_meta_outplace_all_cuda_uint8 PASSED [1.5820s] [ 46%] 2025-12-04T12:25:43.3095684Z test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_bfloat16 PASSED [0.0319s] [ 47%] 2025-12-04T12:25:43.3095795Z test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_complex64 PASSED [0.0288s] [ 47%] 2025-12-04T12:25:43.3095897Z test_meta.py::TestMetaCUDA::test_meta_outplace_allclose_cuda_float32 PASSED [0.0276s] [ 47%] 2025-12-04T12:25:43.3095997Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_bfloat16 PASSED [0.0126s] [ 47%] 2025-12-04T12:25:43.3096096Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_float16 PASSED [0.0123s] [ 47%] 2025-12-04T12:25:43.3096192Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int16 PASSED [0.0113s] [ 47%] 2025-12-04T12:25:43.3096291Z test_meta.py::TestMetaCUDA::test_meta_outplace_amax_cuda_int32 PASSED [0.0114s] [ 47%] 2025-12-04T12:25:43.3096391Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_bfloat16 PASSED [0.0123s] [ 47%] 2025-12-04T12:25:43.3096489Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float32 PASSED [0.0113s] [ 47%] 2025-12-04T12:25:43.3096585Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_float64 PASSED [0.0113s] [ 47%] 2025-12-04T12:25:43.3096683Z test_meta.py::TestMetaCUDA::test_meta_outplace_amin_cuda_int16 PASSED [0.0113s] [ 47%] 2025-12-04T12:25:43.3096781Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_bool PASSED [0.0092s] [ 47%] 2025-12-04T12:25:43.3096880Z test_meta.py::TestMetaCUDA::test_meta_outplace_aminmax_cuda_int8 PASSED [0.0044s] [ 47%] 2025-12-04T12:25:43.3096981Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_complex32 PASSED [0.0037s] [ 47%] 2025-12-04T12:25:43.3097081Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_float32 PASSED [1.5536s] [ 47%] 2025-12-04T12:25:43.3097180Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_float64 PASSED [0.0043s] [ 47%] 2025-12-04T12:25:43.3097283Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_int64 PASSED [1.5225s] [ 47%] 2025-12-04T12:25:43.3097379Z test_meta.py::TestMetaCUDA::test_meta_outplace_angle_cuda_uint8 PASSED [0.0044s] [ 47%] 2025-12-04T12:25:43.3097480Z test_meta.py::TestMetaCUDA::test_meta_outplace_any_cuda_float16 PASSED [0.0151s] [ 47%] 2025-12-04T12:25:43.3097610Z test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_bfloat16 PASSED [0.0131s] [ 47%] 2025-12-04T12:25:43.3097715Z test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_float16 PASSED [0.0128s] [ 47%] 2025-12-04T12:25:43.3097812Z test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_int16 PASSED [0.0101s] [ 47%] 2025-12-04T12:25:43.3097911Z test_meta.py::TestMetaCUDA::test_meta_outplace_arange_cuda_uint8 PASSED [0.0097s] [ 47%] 2025-12-04T12:25:43.3098011Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_int64 PASSED [0.0127s] [ 47%] 2025-12-04T12:25:43.3098162Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmax_cuda_uint8 PASSED [0.0076s] [ 47%] 2025-12-04T12:25:43.3098279Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_bfloat16 PASSED [1.5322s] [ 47%] 2025-12-04T12:25:43.3098379Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_float32 PASSED [0.0094s] [ 47%] 2025-12-04T12:25:43.3098480Z test_meta.py::TestMetaCUDA::test_meta_outplace_argmin_cuda_float64 PASSED [0.0077s] [ 47%] 2025-12-04T12:25:43.3098584Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float16 PASSED [0.2109s] [ 47%] 2025-12-04T12:25:43.3098703Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float32 PASSED [0.0135s] [ 47%] 2025-12-04T12:25:43.3098804Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_float64 PASSED [0.0131s] [ 47%] 2025-12-04T12:25:43.3098902Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int16 PASSED [0.0132s] [ 47%] 2025-12-04T12:25:43.3099000Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_int64 PASSED [0.0148s] [ 47%] 2025-12-04T12:25:43.3099103Z test_meta.py::TestMetaCUDA::test_meta_outplace_argsort_cuda_uint8 PASSED [0.0131s] [ 47%] 2025-12-04T12:25:43.3099207Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_float64 PASSED [1.6326s] [ 47%] 2025-12-04T12:25:43.3099306Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int16 PASSED [0.0061s] [ 47%] 2025-12-04T12:25:43.3099406Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int32 PASSED [0.0043s] [ 48%] 2025-12-04T12:25:43.3099505Z test_meta.py::TestMetaCUDA::test_meta_outplace_argwhere_cuda_int8 PASSED [1.5510s] [ 48%] 2025-12-04T12:25:43.3099620Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_complex32 PASSED [0.0067s] [ 48%] 2025-12-04T12:25:43.3099731Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_int64 PASSED [0.0048s] [ 48%] 2025-12-04T12:25:43.3099840Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_copy_cuda_uint8 PASSED [0.0046s] [ 48%] 2025-12-04T12:25:43.3099951Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_bfloat16 PASSED [1.5448s] [ 48%] 2025-12-04T12:25:43.3100054Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_bool PASSED [0.0053s] [ 48%] 2025-12-04T12:25:43.3100164Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_complex64 PASSED [0.0039s] [ 48%] 2025-12-04T12:25:43.3100270Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_float64 PASSED [1.5459s] [ 48%] 2025-12-04T12:25:43.3100375Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int16 PASSED [0.0051s] [ 48%] 2025-12-04T12:25:43.3100476Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_cuda_int8 PASSED [0.0037s] [ 48%] 2025-12-04T12:25:43.3100598Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_bool PASSED [1.5367s] [ 48%] 2025-12-04T12:25:43.3100725Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_complex32 PASSED [0.0051s] [ 48%] 2025-12-04T12:25:43.3100850Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_partial_views_cuda_uint8 PASSED [0.0035s] [ 48%] 2025-12-04T12:25:43.3100964Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_bool PASSED [1.5377s] [ 48%] 2025-12-04T12:25:43.3101077Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_float32 PASSED [0.0063s] [ 48%] 2025-12-04T12:25:43.3101208Z test_meta.py::TestMetaCUDA::test_meta_outplace_as_strided_scatter_cuda_int16 PASSED [0.0047s] [ 48%] 2025-12-04T12:25:43.3101310Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_complex32 PASSED [1.5167s] [ 48%] 2025-12-04T12:25:43.3101410Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_complex64 PASSED [0.0046s] [ 48%] 2025-12-04T12:25:43.3101506Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int16 PASSED [1.5502s] [ 48%] 2025-12-04T12:25:43.3101606Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int32 PASSED [0.0047s] [ 48%] 2025-12-04T12:25:43.3101712Z test_meta.py::TestMetaCUDA::test_meta_outplace_asin_cuda_int8 PASSED [1.5248s] [ 48%] 2025-12-04T12:25:43.3101827Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_complex32 PASSED [0.0099s] [ 48%] 2025-12-04T12:25:43.3101926Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float32 PASSED [1.5237s] [ 48%] 2025-12-04T12:25:43.3102025Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_float64 PASSED [0.0048s] [ 48%] 2025-12-04T12:25:43.3102122Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_int32 PASSED [1.5366s] [ 48%] 2025-12-04T12:25:43.3102234Z test_meta.py::TestMetaCUDA::test_meta_outplace_asinh_cuda_uint8 PASSED [0.0045s] [ 48%] 2025-12-04T12:25:43.3102329Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_int16 PASSED [0.0229s] [ 48%] 2025-12-04T12:25:43.3102426Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_int8 PASSED [0.0086s] [ 48%] 2025-12-04T12:25:43.3102522Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan2_cuda_uint8 PASSED [0.0084s] [ 48%] 2025-12-04T12:25:43.3102625Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_bool PASSED [1.5319s] [ 48%] 2025-12-04T12:25:43.3102727Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_complex32 PASSED [0.1761s] [ 48%] 2025-12-04T12:25:43.3102825Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_float64 PASSED [1.5505s] [ 48%] 2025-12-04T12:25:43.3102922Z test_meta.py::TestMetaCUDA::test_meta_outplace_atan_cuda_int16 PASSED [0.0047s] [ 48%] 2025-12-04T12:25:43.3103021Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_float32 PASSED [1.5684s] [ 48%] 2025-12-04T12:25:43.3103116Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int16 PASSED [0.0048s] [ 48%] 2025-12-04T12:25:43.3103212Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int64 PASSED [1.5389s] [ 49%] 2025-12-04T12:25:43.3103308Z test_meta.py::TestMetaCUDA::test_meta_outplace_atanh_cuda_int8 PASSED [0.0047s] [ 49%] 2025-12-04T12:25:43.3103417Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_bfloat16 PASSED [0.0046s] [ 49%] 2025-12-04T12:25:43.3103526Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_float32 PASSED [1.5717s] [ 49%] 2025-12-04T12:25:43.3103632Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_1d_cuda_int32 PASSED [0.0056s] [ 49%] 2025-12-04T12:25:43.3103746Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_complex128 PASSED [1.5409s] [ 49%] 2025-12-04T12:25:43.3103854Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_float64 PASSED [0.0041s] [ 49%] 2025-12-04T12:25:43.3103960Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_2d_cuda_int64 PASSED [1.5622s] [ 49%] 2025-12-04T12:25:43.3104072Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex128 PASSED [0.0043s] [ 49%] 2025-12-04T12:25:43.3104182Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_complex64 PASSED [1.5474s] [ 49%] 2025-12-04T12:25:43.3104287Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float16 PASSED [0.0042s] [ 49%] 2025-12-04T12:25:43.3104395Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_float64 PASSED [1.5499s] [ 49%] 2025-12-04T12:25:43.3104498Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_int64 PASSED [0.0041s] [ 49%] 2025-12-04T12:25:43.3104601Z test_meta.py::TestMetaCUDA::test_meta_outplace_atleast_3d_cuda_uint8 PASSED [1.5305s] [ 49%] 2025-12-04T12:25:43.3104720Z test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_complex128 PASSED [0.0162s] [ 49%] 2025-12-04T12:25:43.3104824Z test_meta.py::TestMetaCUDA::test_meta_outplace_baddbmm_cuda_float16 PASSED [1.5258s] [ 49%] 2025-12-04T12:25:43.3104931Z test_meta.py::TestMetaCUDA::test_meta_outplace_bernoulli_cuda_float64 PASSED [0.0077s] [ 49%] 2025-12-04T12:25:43.3105037Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_bfloat16 PASSED [1.5372s] [ 49%] 2025-12-04T12:25:43.3105146Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex128 PASSED [0.0054s] [ 49%] 2025-12-04T12:25:43.3105263Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_complex32 PASSED [0.0038s] [ 49%] 2025-12-04T12:25:43.3105375Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_float16 PASSED [1.5378s] [ 49%] 2025-12-04T12:25:43.3105476Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int64 PASSED [0.0053s] [ 49%] 2025-12-04T12:25:43.3105576Z test_meta.py::TestMetaCUDA::test_meta_outplace_bfloat16_cuda_int8 PASSED [0.0038s] [ 49%] 2025-12-04T12:25:43.3105677Z test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_int16 PASSED [0.0138s] [ 49%] 2025-12-04T12:25:43.3105791Z test_meta.py::TestMetaCUDA::test_meta_outplace_bincount_cuda_uint8 PASSED [0.0070s] [ 49%] 2025-12-04T12:25:43.3105899Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_and_cuda_int64 PASSED [0.0085s] [ 49%] 2025-12-04T12:25:43.3106014Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_int64 PASSED [0.0225s] [ 49%] 2025-12-04T12:25:43.3106129Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_left_shift_cuda_uint8 PASSED [0.0084s] [ 49%] 2025-12-04T12:25:43.3106234Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_not_cuda_bool PASSED [0.0037s] [ 49%] 2025-12-04T12:25:43.3106337Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_or_cuda_int64 PASSED [0.0082s] [ 49%] 2025-12-04T12:25:43.3106455Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_right_shift_cuda_int16 PASSED [0.0084s] [ 49%] 2025-12-04T12:25:43.3106563Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int16 PASSED [0.0082s] [ 49%] 2025-12-04T12:25:43.3106670Z test_meta.py::TestMetaCUDA::test_meta_outplace_bitwise_xor_cuda_int32 PASSED [0.0081s] [ 49%] 2025-12-04T12:25:43.3106772Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int32 PASSED [1.5448s] [ 49%] 2025-12-04T12:25:43.3106877Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_int64 PASSED [0.0066s] [ 49%] 2025-12-04T12:25:43.3106979Z test_meta.py::TestMetaCUDA::test_meta_outplace_block_diag_cuda_uint8 PASSED [0.0048s] [ 49%] 2025-12-04T12:25:43.3107078Z test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_float16 PASSED [1.5380s] [ 50%] 2025-12-04T12:25:43.3107176Z test_meta.py::TestMetaCUDA::test_meta_outplace_bmm_cuda_float32 PASSED [0.0045s] [ 50%] 2025-12-04T12:25:43.3107277Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_complex64 PASSED [0.0039s] [ 50%] 2025-12-04T12:25:43.3107376Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_float64 PASSED [1.5124s] [ 50%] 2025-12-04T12:25:43.3107473Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int16 PASSED [0.0052s] [ 50%] 2025-12-04T12:25:43.3107568Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int32 PASSED [0.0036s] [ 50%] 2025-12-04T12:25:43.3107664Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_int64 PASSED [1.5162s] [ 50%] 2025-12-04T12:25:43.3107760Z test_meta.py::TestMetaCUDA::test_meta_outplace_bool_cuda_uint8 PASSED [0.0051s] [ 50%] 2025-12-04T12:25:43.3107876Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_shapes_cuda_float32 PASSED [0.0032s] [ 50%] 2025-12-04T12:25:43.3107995Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_tensors_cuda_bfloat16 PASSED [1.5351s] [ 50%] 2025-12-04T12:25:43.3108155Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_complex64 PASSED [0.0053s] [ 50%] 2025-12-04T12:25:43.3108278Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int32 PASSED [0.0040s] [ 50%] 2025-12-04T12:25:43.3108388Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int64 PASSED [1.5347s] [ 50%] 2025-12-04T12:25:43.3108496Z test_meta.py::TestMetaCUDA::test_meta_outplace_broadcast_to_cuda_int8 PASSED [0.0057s] [ 50%] 2025-12-04T12:25:43.3108603Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_bfloat16 PASSED [0.0197s] [ 50%] 2025-12-04T12:25:43.3108709Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_float16 PASSED [0.0151s] [ 50%] 2025-12-04T12:25:43.3108814Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int16 PASSED [0.0150s] [ 50%] 2025-12-04T12:25:43.3108927Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int32 PASSED [0.0150s] [ 50%] 2025-12-04T12:25:43.3109044Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int64 PASSED [0.0150s] [ 50%] 2025-12-04T12:25:43.3109144Z test_meta.py::TestMetaCUDA::test_meta_outplace_bucketize_cuda_int8 PASSED [0.0151s] [ 50%] 2025-12-04T12:25:43.3109244Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_bfloat16 PASSED [0.0034s] [ 50%] 2025-12-04T12:25:43.3109358Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_complex64 PASSED [1.5644s] [ 50%] 2025-12-04T12:25:43.3109456Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_float16 PASSED [0.0049s] [ 50%] 2025-12-04T12:25:43.3109555Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int32 PASSED [0.0036s] [ 50%] 2025-12-04T12:25:43.3109650Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int64 PASSED [1.5669s] [ 50%] 2025-12-04T12:25:43.3109745Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_int8 PASSED [0.0049s] [ 50%] 2025-12-04T12:25:43.3109841Z test_meta.py::TestMetaCUDA::test_meta_outplace_byte_cuda_uint8 PASSED [0.0036s] [ 50%] 2025-12-04T12:25:43.3109954Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_bool PASSED [1.5916s] [ 50%] 2025-12-04T12:25:43.3110065Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_int64 PASSED [0.0038s] [ 50%] 2025-12-04T12:25:43.3110178Z test_meta.py::TestMetaCUDA::test_meta_outplace_cartesian_prod_cuda_uint8 PASSED [1.5228s] [ 50%] 2025-12-04T12:25:43.3110278Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_complex32 PASSED [0.0103s] [ 50%] 2025-12-04T12:25:43.3110373Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int16 PASSED [0.0081s] [ 50%] 2025-12-04T12:25:43.3110466Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_int8 PASSED [0.0079s] [ 50%] 2025-12-04T12:25:43.3110562Z test_meta.py::TestMetaCUDA::test_meta_outplace_cat_cuda_uint8 PASSED [1.5403s] [ 50%] 2025-12-04T12:25:43.3110666Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_bfloat16 PASSED [0.0055s] [ 50%] 2025-12-04T12:25:43.3110770Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_float16 PASSED [0.0038s] [ 50%] 2025-12-04T12:25:43.3110867Z test_meta.py::TestMetaCUDA::test_meta_outplace_cdouble_cuda_int8 PASSED [1.5223s] [ 50%] 2025-12-04T12:25:43.3110968Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_bfloat16 PASSED [0.0046s] [ 51%] 2025-12-04T12:25:43.3111065Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int16 PASSED [1.5301s] [ 51%] 2025-12-04T12:25:43.3111161Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int64 PASSED [0.0045s] [ 51%] 2025-12-04T12:25:43.3111255Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_int8 PASSED [1.5235s] [ 51%] 2025-12-04T12:25:43.3111351Z test_meta.py::TestMetaCUDA::test_meta_outplace_ceil_cuda_uint8 PASSED [0.0046s] [ 51%] 2025-12-04T12:25:43.3111451Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float16 PASSED [1.5462s] [ 51%] 2025-12-04T12:25:43.3111553Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_float32 PASSED [0.0051s] [ 51%] 2025-12-04T12:25:43.3111650Z test_meta.py::TestMetaCUDA::test_meta_outplace_cfloat_cuda_uint8 PASSED [0.0037s] [ 51%] 2025-12-04T12:25:43.3111754Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex32 PASSED [1.5303s] [ 51%] 2025-12-04T12:25:43.3111874Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_complex64 PASSED [0.0050s] [ 51%] 2025-12-04T12:25:43.3111974Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_float16 PASSED [0.0036s] [ 51%] 2025-12-04T12:25:43.3112072Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_float32 PASSED [1.5606s] [ 51%] 2025-12-04T12:25:43.3112169Z test_meta.py::TestMetaCUDA::test_meta_outplace_chalf_cuda_int64 PASSED [0.0052s] [ 51%] 2025-12-04T12:25:43.3112266Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float32 PASSED [0.0037s] [ 51%] 2025-12-04T12:25:43.3112373Z test_meta.py::TestMetaCUDA::test_meta_outplace_char_cuda_float64 PASSED [1.5213s] [ 51%] 2025-12-04T12:25:43.3112494Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_complex128 PASSED [0.0248s] [ 51%] 2025-12-04T12:25:43.3112602Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_cuda_complex64 PASSED [0.0142s] [ 51%] 2025-12-04T12:25:43.3112721Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_complex128 PASSED [0.1069s] [ 51%] 2025-12-04T12:25:43.3112846Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_complex64 PASSED [0.0968s] [ 51%] 2025-12-04T12:25:43.3112960Z test_meta.py::TestMetaCUDA::test_meta_outplace_cholesky_solve_cuda_float32 PASSED [2.9215s] [ 51%] 2025-12-04T12:25:43.3113063Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_complex128 PASSED [1.5626s] [ 51%] 2025-12-04T12:25:43.3113166Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_complex32 PASSED [0.0047s] [ 51%] 2025-12-04T12:25:43.3113265Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_float64 PASSED [1.5455s] [ 51%] 2025-12-04T12:25:43.3113363Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int32 PASSED [0.0048s] [ 51%] 2025-12-04T12:25:43.3113458Z test_meta.py::TestMetaCUDA::test_meta_outplace_chunk_cuda_int8 PASSED [1.5290s] [ 51%] 2025-12-04T12:25:43.3113558Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_float16 PASSED [0.0135s] [ 51%] 2025-12-04T12:25:43.3113655Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_float32 PASSED [1.5258s] [ 51%] 2025-12-04T12:25:43.3113752Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_cuda_int8 PASSED [0.0137s] [ 51%] 2025-12-04T12:25:43.3113858Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_bfloat16 PASSED [0.0119s] [ 51%] 2025-12-04T12:25:43.3113965Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_max_cuda_float64 PASSED [0.0114s] [ 51%] 2025-12-04T12:25:43.3114072Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_float64 PASSED [0.0113s] [ 51%] 2025-12-04T12:25:43.3114176Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int64 PASSED [0.0113s] [ 51%] 2025-12-04T12:25:43.3114276Z test_meta.py::TestMetaCUDA::test_meta_outplace_clamp_min_cuda_int8 PASSED [0.0114s] [ 51%] 2025-12-04T12:25:43.3114379Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex128 PASSED [1.5217s] [ 51%] 2025-12-04T12:25:43.3114483Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_complex32 PASSED [0.0043s] [ 51%] 2025-12-04T12:25:43.3114581Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_float64 PASSED [1.5309s] [ 51%] 2025-12-04T12:25:43.3114678Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_int16 PASSED [0.0042s] [ 52%] 2025-12-04T12:25:43.3114775Z test_meta.py::TestMetaCUDA::test_meta_outplace_clone_cuda_uint8 PASSED [1.5281s] [ 52%] 2025-12-04T12:25:43.3114881Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_bool PASSED [0.0062s] [ 52%] 2025-12-04T12:25:43.3114994Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_complex64 PASSED [0.0044s] [ 52%] 2025-12-04T12:25:43.3115105Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_float64 PASSED [0.0041s] [ 52%] 2025-12-04T12:25:43.3115212Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int16 PASSED [1.5614s] [ 52%] 2025-12-04T12:25:43.3115329Z test_meta.py::TestMetaCUDA::test_meta_outplace_column_stack_cuda_int8 PASSED [0.0058s] [ 52%] 2025-12-04T12:25:43.3115436Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_int32 PASSED [0.0133s] [ 52%] 2025-12-04T12:25:43.3115544Z test_meta.py::TestMetaCUDA::test_meta_outplace_combinations_cuda_uint8 PASSED [1.5453s] [ 52%] 2025-12-04T12:25:43.3115647Z test_meta.py::TestMetaCUDA::test_meta_outplace_complex_cuda_float64 PASSED [0.0154s] [ 52%] 2025-12-04T12:25:43.3115745Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_bfloat16 PASSED [1.5344s] [ 52%] 2025-12-04T12:25:43.3115839Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_bool PASSED [0.0043s] [ 52%] 2025-12-04T12:25:43.3115949Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex32 PASSED [1.5174s] [ 52%] 2025-12-04T12:25:43.3116059Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_complex64 PASSED [0.0047s] [ 52%] 2025-12-04T12:25:43.3116157Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_int16 PASSED [1.5198s] [ 52%] 2025-12-04T12:25:43.3116256Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_int64 PASSED [0.0044s] [ 52%] 2025-12-04T12:25:43.3116363Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_cuda_uint8 PASSED [1.5638s] [ 52%] 2025-12-04T12:25:43.3116470Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_bool PASSED [0.0043s] [ 52%] 2025-12-04T12:25:43.3116585Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_complex128 PASSED [1.5321s] [ 52%] 2025-12-04T12:25:43.3116695Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_float16 PASSED [0.0040s] [ 52%] 2025-12-04T12:25:43.3116805Z test_meta.py::TestMetaCUDA::test_meta_outplace_conj_physical_cuda_int32 PASSED [1.5325s] [ 52%] 2025-12-04T12:25:43.3116922Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_complex128 PASSED [0.0174s] [ 52%] 2025-12-04T12:25:43.3117039Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_complex64 PASSED [1.5594s] [ 52%] 2025-12-04T12:25:43.3117154Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_float16 PASSED [0.0173s] [ 52%] 2025-12-04T12:25:43.3117267Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_float32 PASSED [1.5635s] [ 52%] 2025-12-04T12:25:43.3117380Z test_meta.py::TestMetaCUDA::test_meta_outplace_constant_pad_nd_cuda_int64 PASSED [0.0173s] [ 52%] 2025-12-04T12:25:43.3117490Z test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_complex32 PASSED [1.5347s] [ 52%] 2025-12-04T12:25:43.3117598Z test_meta.py::TestMetaCUDA::test_meta_outplace_contiguous_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T12:25:43.3117703Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_float16 PASSED [0.0194s] [ 52%] 2025-12-04T12:25:43.3117805Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int16 PASSED [0.0112s] [ 52%] 2025-12-04T12:25:43.3117904Z test_meta.py::TestMetaCUDA::test_meta_outplace_copysign_cuda_int64 PASSED [0.0111s] [ 52%] 2025-12-04T12:25:43.3118004Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int32 PASSED [0.0053s] [ 52%] 2025-12-04T12:25:43.3118158Z test_meta.py::TestMetaCUDA::test_meta_outplace_corrcoef_cuda_int64 PASSED [1.5240s] [ 52%] 2025-12-04T12:25:43.3118255Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_bool PASSED [0.0057s] [ 52%] 2025-12-04T12:25:43.3118356Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex128 PASSED [0.0057s] [ 52%] 2025-12-04T12:25:43.3118455Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex32 PASSED [1.7401s] [ 52%] 2025-12-04T12:25:43.3118553Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_complex64 PASSED [0.2159s] [ 53%] 2025-12-04T12:25:43.3118651Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_float32 PASSED [0.0041s] [ 53%] 2025-12-04T12:25:43.3118746Z test_meta.py::TestMetaCUDA::test_meta_outplace_cos_cuda_int16 PASSED [1.5332s] [ 53%] 2025-12-04T12:25:43.3118844Z test_meta.py::TestMetaCUDA::test_meta_outplace_cosh_cuda_bfloat16 PASSED [0.0058s] [ 53%] 2025-12-04T12:25:43.3118967Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_bool PASSED [0.0096s] [ 53%] 2025-12-04T12:25:43.3119084Z test_meta.py::TestMetaCUDA::test_meta_outplace_count_nonzero_cuda_complex128 PASSED [0.0094s] [ 53%] 2025-12-04T12:25:43.3119183Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_complex128 PASSED [1.5566s] [ 53%] 2025-12-04T12:25:43.3119281Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_complex64 PASSED [0.0298s] [ 53%] 2025-12-04T12:25:43.3119377Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_float64 PASSED [0.0857s] [ 53%] 2025-12-04T12:25:43.3119486Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_int16 PASSED [1.5719s] [ 53%] 2025-12-04T12:25:43.3119598Z test_meta.py::TestMetaCUDA::test_meta_outplace_cov_cuda_uint8 PASSED [0.0307s] [ 53%] 2025-12-04T12:25:43.3119701Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_complex128 PASSED [0.0088s] [ 53%] 2025-12-04T12:25:43.3119801Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_float32 PASSED [1.5420s] [ 53%] 2025-12-04T12:25:43.3119897Z test_meta.py::TestMetaCUDA::test_meta_outplace_cross_cuda_uint8 PASSED [0.0058s] [ 53%] 2025-12-04T12:25:43.3120006Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_bool PASSED [0.0085s] [ 53%] 2025-12-04T12:25:43.3120106Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_float32 PASSED [1.5501s] [ 53%] 2025-12-04T12:25:43.3120204Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int16 PASSED [0.0048s] [ 53%] 2025-12-04T12:25:43.3120302Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummax_cuda_int8 PASSED [1.5577s] [ 53%] 2025-12-04T12:25:43.3120402Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_float32 PASSED [0.0049s] [ 53%] 2025-12-04T12:25:43.3120499Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int32 PASSED [1.5509s] [ 53%] 2025-12-04T12:25:43.3120596Z test_meta.py::TestMetaCUDA::test_meta_outplace_cummin_cuda_int8 PASSED [0.0049s] [ 53%] 2025-12-04T12:25:43.3120699Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float32 PASSED [0.0233s] [ 53%] 2025-12-04T12:25:43.3120803Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_float64 PASSED [1.5461s] [ 53%] 2025-12-04T12:25:43.3120900Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumprod_cuda_int8 PASSED [0.0128s] [ 53%] 2025-12-04T12:25:43.3121009Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_complex128 PASSED [0.0056s] [ 53%] 2025-12-04T12:25:43.3121107Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_float16 PASSED [0.0058s] [ 53%] 2025-12-04T12:25:43.3121205Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int32 PASSED [0.0053s] [ 53%] 2025-12-04T12:25:43.3121304Z test_meta.py::TestMetaCUDA::test_meta_outplace_cumsum_cuda_int8 PASSED [0.0052s] [ 53%] 2025-12-04T12:25:43.3121402Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_bool PASSED [1.5451s] [ 53%] 2025-12-04T12:25:43.3121503Z test_meta.py::TestMetaCUDA::test_meta_outplace_deg2rad_cuda_float64 PASSED [0.0049s] [ 53%] 2025-12-04T12:25:43.3121602Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_bfloat16 PASSED [0.0108s] [ 53%] 2025-12-04T12:25:43.3121698Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float16 PASSED [0.0096s] [ 53%] 2025-12-04T12:25:43.3121796Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float32 PASSED [1.5388s] [ 53%] 2025-12-04T12:25:43.3121891Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_float64 PASSED [0.0122s] [ 53%] 2025-12-04T12:25:43.3121987Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int16 PASSED [0.0100s] [ 53%] 2025-12-04T12:25:43.3122080Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_cuda_int8 PASSED [0.0096s] [ 53%] 2025-12-04T12:25:43.3122184Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_bool PASSED [1.5564s] [ 53%] 2025-12-04T12:25:43.3122294Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex128 PASSED [0.0128s] [ 54%] 2025-12-04T12:25:43.3122417Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_complex64 PASSED [0.0110s] [ 54%] 2025-12-04T12:25:43.3122522Z test_meta.py::TestMetaCUDA::test_meta_outplace_diag_embed_cuda_int64 PASSED [0.0104s] [ 54%] 2025-12-04T12:25:43.3122633Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_complex64 PASSED [0.0049s] [ 54%] 2025-12-04T12:25:43.3122738Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagflat_cuda_float32 PASSED [0.0048s] [ 54%] 2025-12-04T12:25:43.3122853Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_bfloat16 PASSED [0.0087s] [ 54%] 2025-12-04T12:25:43.3122962Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int16 PASSED [0.0087s] [ 54%] 2025-12-04T12:25:43.3123084Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int64 PASSED [0.0086s] [ 54%] 2025-12-04T12:25:43.3123201Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_copy_cuda_int8 PASSED [0.0087s] [ 54%] 2025-12-04T12:25:43.3123307Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_bfloat16 PASSED [0.0058s] [ 54%] 2025-12-04T12:25:43.3123418Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex128 PASSED [0.0057s] [ 54%] 2025-12-04T12:25:43.3123544Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_complex64 PASSED [0.0058s] [ 54%] 2025-12-04T12:25:43.3123649Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_float64 PASSED [0.0057s] [ 54%] 2025-12-04T12:25:43.3123749Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_int32 PASSED [0.0057s] [ 54%] 2025-12-04T12:25:43.3123849Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_cuda_uint8 PASSED [0.0057s] [ 54%] 2025-12-04T12:25:43.3123965Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_bfloat16 PASSED [0.0071s] [ 54%] 2025-12-04T12:25:43.3124087Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_complex128 PASSED [0.0072s] [ 54%] 2025-12-04T12:25:43.3124199Z test_meta.py::TestMetaCUDA::test_meta_outplace_diagonal_scatter_cuda_int8 PASSED [0.0071s] [ 54%] 2025-12-04T12:25:43.3124303Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_complex128 PASSED [0.0623s] [ 54%] 2025-12-04T12:25:43.3124401Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_float16 PASSED [0.0636s] [ 54%] 2025-12-04T12:25:43.3124500Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_int32 PASSED [0.0615s] [ 54%] 2025-12-04T12:25:43.3124596Z test_meta.py::TestMetaCUDA::test_meta_outplace_diff_cuda_uint8 PASSED [0.0614s] [ 54%] 2025-12-04T12:25:43.3124702Z test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_bfloat16 PASSED [1.5565s] [ 54%] 2025-12-04T12:25:43.3124804Z test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_float64 PASSED [0.0070s] [ 54%] 2025-12-04T12:25:43.3124903Z test_meta.py::TestMetaCUDA::test_meta_outplace_digamma_cuda_int64 PASSED [0.0051s] [ 54%] 2025-12-04T12:25:43.3125002Z test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_complex64 PASSED [0.0323s] [ 54%] 2025-12-04T12:25:43.3125100Z test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_float16 PASSED [0.0303s] [ 54%] 2025-12-04T12:25:43.3125196Z test_meta.py::TestMetaCUDA::test_meta_outplace_dist_cuda_float64 PASSED [0.0298s] [ 54%] 2025-12-04T12:25:43.3125312Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int16 PASSED [0.0159s] [ 54%] 2025-12-04T12:25:43.3125425Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int32 PASSED [0.0158s] [ 54%] 2025-12-04T12:25:43.3125537Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int64 PASSED [0.0156s] [ 54%] 2025-12-04T12:25:43.3125650Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_floor_rounding_cuda_int8 PASSED [0.0156s] [ 54%] 2025-12-04T12:25:43.3125770Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_bfloat16 PASSED [0.0089s] [ 54%] 2025-12-04T12:25:43.3125890Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_complex32 PASSED [0.0110s] [ 54%] 2025-12-04T12:25:43.3126019Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_no_rounding_mode_cuda_float64 PASSED [0.0088s] [ 54%] 2025-12-04T12:25:43.3126137Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_float64 PASSED [0.0094s] [ 54%] 2025-12-04T12:25:43.3126248Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int16 PASSED [0.0088s] [ 55%] 2025-12-04T12:25:43.3126361Z test_meta.py::TestMetaCUDA::test_meta_outplace_div_trunc_rounding_cuda_int64 PASSED [0.0088s] [ 55%] 2025-12-04T12:25:43.3126461Z test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_complex64 PASSED [1.5521s] [ 55%] 2025-12-04T12:25:43.3126570Z test_meta.py::TestMetaCUDA::test_meta_outplace_dot_cuda_float16 PASSED [0.0049s] [ 55%] 2025-12-04T12:25:43.3126683Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_bfloat16 PASSED [0.0039s] [ 55%] 2025-12-04T12:25:43.3126781Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_bool PASSED [1.5308s] [ 55%] 2025-12-04T12:25:43.3126885Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_complex32 PASSED [0.0054s] [ 55%] 2025-12-04T12:25:43.3126985Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_float32 PASSED [0.0037s] [ 55%] 2025-12-04T12:25:43.3127096Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_float64 PASSED [1.5575s] [ 55%] 2025-12-04T12:25:43.3127194Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_int16 PASSED [0.0054s] [ 55%] 2025-12-04T12:25:43.3127290Z test_meta.py::TestMetaCUDA::test_meta_outplace_double_cuda_uint8 PASSED [0.0037s] [ 55%] 2025-12-04T12:25:43.3127393Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_bfloat16 PASSED [1.5409s] [ 55%] 2025-12-04T12:25:43.3127491Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int16 PASSED [0.0051s] [ 55%] 2025-12-04T12:25:43.3127588Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_int64 PASSED [1.5452s] [ 55%] 2025-12-04T12:25:43.3127684Z test_meta.py::TestMetaCUDA::test_meta_outplace_dsplit_cuda_uint8 PASSED [0.0043s] [ 55%] 2025-12-04T12:25:43.3127790Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_complex32 PASSED [0.0044s] [ 55%] 2025-12-04T12:25:43.3127890Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_float16 PASSED [1.5773s] [ 55%] 2025-12-04T12:25:43.3127986Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_int32 PASSED [0.0058s] [ 55%] 2025-12-04T12:25:43.3128114Z test_meta.py::TestMetaCUDA::test_meta_outplace_dstack_cuda_uint8 PASSED [0.0041s] [ 55%] 2025-12-04T12:25:43.3128216Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_bfloat16 PASSED [2.4941s] [ 55%] 2025-12-04T12:25:43.3128323Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_complex128 PASSED [1.5979s] [ 55%] 2025-12-04T12:25:43.3128427Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_complex64 PASSED [1.5994s] [ 55%] 2025-12-04T12:25:43.3128527Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float16 PASSED [1.9046s] [ 55%] 2025-12-04T12:25:43.3128627Z test_meta.py::TestMetaCUDA::test_meta_outplace_einsum_cuda_float32 PASSED [2.0119s] [ 55%] 2025-12-04T12:25:43.3128729Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float16 PASSED [0.0040s] [ 55%] 2025-12-04T12:25:43.3128828Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_float32 PASSED [0.0031s] [ 55%] 2025-12-04T12:25:43.3128926Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_cuda_int8 PASSED [0.0027s] [ 55%] 2025-12-04T12:25:43.3129035Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_bfloat16 PASSED [1.5572s] [ 55%] 2025-12-04T12:25:43.3129147Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_complex128 PASSED [0.0073s] [ 55%] 2025-12-04T12:25:43.3129258Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_complex64 PASSED [0.0055s] [ 55%] 2025-12-04T12:25:43.3129368Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_float64 PASSED [0.0052s] [ 55%] 2025-12-04T12:25:43.3129470Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int32 PASSED [1.5549s] [ 55%] 2025-12-04T12:25:43.3129589Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_like_cuda_int64 PASSED [0.0071s] [ 55%] 2025-12-04T12:25:43.3129706Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_complex128 PASSED [0.0076s] [ 55%] 2025-12-04T12:25:43.3129817Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_permuted_cuda_int64 PASSED [0.0074s] [ 55%] 2025-12-04T12:25:43.3129929Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_bfloat16 PASSED [0.0037s] [ 55%] 2025-12-04T12:25:43.3130046Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_complex128 PASSED [1.5203s] [ 56%] 2025-12-04T12:25:43.3130169Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_float64 PASSED [0.0055s] [ 56%] 2025-12-04T12:25:43.3130294Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_int16 PASSED [0.0040s] [ 56%] 2025-12-04T12:25:43.3130401Z test_meta.py::TestMetaCUDA::test_meta_outplace_empty_strided_cuda_int8 PASSED [1.5624s] [ 56%] 2025-12-04T12:25:43.3130502Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_complex128 PASSED [0.0114s] [ 56%] 2025-12-04T12:25:43.3130597Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int16 PASSED [0.0092s] [ 56%] 2025-12-04T12:25:43.3130704Z test_meta.py::TestMetaCUDA::test_meta_outplace_eq_cuda_int64 PASSED [0.0089s] [ 56%] 2025-12-04T12:25:43.3130802Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_bool PASSED [1.5789s] [ 56%] 2025-12-04T12:25:43.3130906Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_complex64 PASSED [0.0050s] [ 56%] 2025-12-04T12:25:43.3131004Z test_meta.py::TestMetaCUDA::test_meta_outplace_equal_cuda_float64 PASSED [1.5425s] [ 56%] 2025-12-04T12:25:43.3131102Z test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_float16 PASSED [0.0046s] [ 56%] 2025-12-04T12:25:43.3131200Z test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_float32 PASSED [1.5396s] [ 56%] 2025-12-04T12:25:43.3131295Z test_meta.py::TestMetaCUDA::test_meta_outplace_erf_cuda_uint8 PASSED [0.0046s] [ 56%] 2025-12-04T12:25:43.3131394Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfc_cuda_float32 PASSED [0.0055s] [ 56%] 2025-12-04T12:25:43.3131497Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_bfloat16 PASSED [1.5553s] [ 56%] 2025-12-04T12:25:43.3131595Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_bool PASSED [0.0062s] [ 56%] 2025-12-04T12:25:43.3131694Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_float64 PASSED [1.5384s] [ 56%] 2025-12-04T12:25:43.3131791Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int64 PASSED [0.0045s] [ 56%] 2025-12-04T12:25:43.3131887Z test_meta.py::TestMetaCUDA::test_meta_outplace_erfinv_cuda_int8 PASSED [1.5554s] [ 56%] 2025-12-04T12:25:43.3131987Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_complex64 PASSED [0.2047s] [ 56%] 2025-12-04T12:25:43.3132083Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float16 PASSED [0.1163s] [ 56%] 2025-12-04T12:25:43.3132183Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_float64 PASSED [1.5492s] [ 56%] 2025-12-04T12:25:43.3132280Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_int16 PASSED [0.0076s] [ 56%] 2025-12-04T12:25:43.3132379Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp2_cuda_uint8 PASSED [0.0040s] [ 56%] 2025-12-04T12:25:43.3132474Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_bfloat16 PASSED [1.5423s] [ 56%] 2025-12-04T12:25:43.3132575Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_complex128 PASSED [0.0075s] [ 56%] 2025-12-04T12:25:43.3132670Z test_meta.py::TestMetaCUDA::test_meta_outplace_exp_cuda_int32 PASSED [0.0040s] [ 56%] 2025-12-04T12:25:43.3132780Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_bfloat16 PASSED [1.5375s] [ 56%] 2025-12-04T12:25:43.3132879Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_bool PASSED [0.0047s] [ 56%] 2025-12-04T12:25:43.3132987Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_float16 PASSED [1.5355s] [ 56%] 2025-12-04T12:25:43.3133100Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_int64 PASSED [0.0048s] [ 56%] 2025-12-04T12:25:43.3133204Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_as_cuda_uint8 PASSED [1.5410s] [ 56%] 2025-12-04T12:25:43.3133313Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bfloat16 PASSED [0.0086s] [ 56%] 2025-12-04T12:25:43.3133419Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_bool PASSED [0.0065s] [ 56%] 2025-12-04T12:25:43.3133529Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_float64 PASSED [0.0063s] [ 56%] 2025-12-04T12:25:43.3133647Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int16 PASSED [0.0063s] [ 56%] 2025-12-04T12:25:43.3133764Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_int8 PASSED [0.0063s] [ 56%] 2025-12-04T12:25:43.3133870Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_copy_cuda_uint8 PASSED [0.0063s] [ 57%] 2025-12-04T12:25:43.3133970Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_bool PASSED [0.0044s] [ 57%] 2025-12-04T12:25:43.3134072Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_float16 PASSED [0.0044s] [ 57%] 2025-12-04T12:25:43.3134183Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int16 PASSED [1.5618s] [ 57%] 2025-12-04T12:25:43.3134280Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int64 PASSED [0.0061s] [ 57%] 2025-12-04T12:25:43.3134378Z test_meta.py::TestMetaCUDA::test_meta_outplace_expand_cuda_int8 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.3134473Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_bool PASSED [1.5623s] [ 57%] 2025-12-04T12:25:43.3134578Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_complex64 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.3134675Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int16 PASSED [1.5329s] [ 57%] 2025-12-04T12:25:43.3134772Z test_meta.py::TestMetaCUDA::test_meta_outplace_expm1_cuda_int8 PASSED [0.0046s] [ 57%] 2025-12-04T12:25:43.3134887Z test_meta.py::TestMetaCUDA::test_meta_outplace_exponential_cuda_float32 PASSED [0.0096s] [ 57%] 2025-12-04T12:25:43.3134987Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_bfloat16 PASSED [0.0385s] [ 57%] 2025-12-04T12:25:43.3135084Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float16 PASSED [0.0381s] [ 57%] 2025-12-04T12:25:43.3135195Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e4m3fnuz PASSED [0.0379s] [ 57%] 2025-12-04T12:25:43.3135302Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_float8_e5m2fnuz PASSED [1.5937s] [ 57%] 2025-12-04T12:25:43.3135398Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int64 PASSED [0.0410s] [ 57%] 2025-12-04T12:25:43.3135491Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_int8 PASSED [1.5859s] [ 57%] 2025-12-04T12:25:43.3135588Z test_meta.py::TestMetaCUDA::test_meta_outplace_eye_cuda_uint8 PASSED [0.0404s] [ 57%] 2025-12-04T12:25:43.3135694Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_complex128 PASSED [3.9581s] [ 57%] 2025-12-04T12:25:43.3135796Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int16 PASSED [2.4071s] [ 57%] 2025-12-04T12:25:43.3135895Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_int64 PASSED [0.0183s] [ 57%] 2025-12-04T12:25:43.3135995Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft2_cuda_uint8 PASSED [0.0141s] [ 57%] 2025-12-04T12:25:43.3136102Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_complex128 PASSED [2.1227s] [ 57%] 2025-12-04T12:25:43.3136208Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_complex64 PASSED [2.1523s] [ 57%] 2025-12-04T12:25:43.3136311Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_float16 PASSED [1.4834s] [ 57%] 2025-12-04T12:25:43.3136408Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_int8 PASSED [0.7841s] [ 57%] 2025-12-04T12:25:43.3136510Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fft_cuda_uint8 PASSED [0.0205s] [ 57%] 2025-12-04T12:25:43.3136633Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_bfloat16 PASSED [0.0164s] [ 57%] 2025-12-04T12:25:43.3136745Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_float16 PASSED [1.4916s] [ 57%] 2025-12-04T12:25:43.3136850Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_fftshift_cuda_int64 PASSED [0.0069s] [ 57%] 2025-12-04T12:25:43.3136956Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft2_cuda_float64 PASSED [1.4969s] [ 57%] 2025-12-04T12:25:43.3137055Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_bool PASSED [1.3685s] [ 57%] 2025-12-04T12:25:43.3137169Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_float32 PASSED [0.0214s] [ 57%] 2025-12-04T12:25:43.3137287Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_float64 PASSED [0.4862s] [ 57%] 2025-12-04T12:25:43.3137387Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int32 PASSED [0.0213s] [ 57%] 2025-12-04T12:25:43.3137485Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_int8 PASSED [0.0135s] [ 57%] 2025-12-04T12:25:43.3137584Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfft_cuda_uint8 PASSED [0.0084s] [ 58%] 2025-12-04T12:25:43.3137695Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_bool PASSED [0.5753s] [ 58%] 2025-12-04T12:25:43.3137805Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex32 PASSED [2.3057s] [ 58%] 2025-12-04T12:25:43.3137912Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_complex64 PASSED [0.0262s] [ 58%] 2025-12-04T12:25:43.3138018Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_float16 PASSED [0.0159s] [ 58%] 2025-12-04T12:25:43.3138159Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_hfftn_cuda_int64 PASSED [0.0072s] [ 58%] 2025-12-04T12:25:43.3138268Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_complex32 PASSED [2.2148s] [ 58%] 2025-12-04T12:25:43.3138373Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_float64 PASSED [0.0192s] [ 58%] 2025-12-04T12:25:43.3138474Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft2_cuda_int8 PASSED [0.0139s] [ 58%] 2025-12-04T12:25:43.3138574Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_bool PASSED [0.0117s] [ 58%] 2025-12-04T12:25:43.3138680Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_complex32 PASSED [1.2310s] [ 58%] 2025-12-04T12:25:43.3138784Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float32 PASSED [0.0077s] [ 58%] 2025-12-04T12:25:43.3138889Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_float64 PASSED [0.7553s] [ 58%] 2025-12-04T12:25:43.3138991Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int16 PASSED [0.0079s] [ 58%] 2025-12-04T12:25:43.3139090Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_int64 PASSED [0.0073s] [ 58%] 2025-12-04T12:25:43.3139191Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifft_cuda_uint8 PASSED [0.0072s] [ 58%] 2025-12-04T12:25:43.3139298Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_complex32 PASSED [0.0067s] [ 58%] 2025-12-04T12:25:43.3139408Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_complex64 PASSED [0.5942s] [ 58%] 2025-12-04T12:25:43.3139514Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float16 PASSED [0.0208s] [ 58%] 2025-12-04T12:25:43.3139622Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftn_cuda_float64 PASSED [0.5896s] [ 58%] 2025-12-04T12:25:43.3139735Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_bfloat16 PASSED [0.0155s] [ 58%] 2025-12-04T12:25:43.3139849Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_float16 PASSED [0.0107s] [ 58%] 2025-12-04T12:25:43.3139956Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int32 PASSED [0.9204s] [ 58%] 2025-12-04T12:25:43.3140065Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_int8 PASSED [0.0062s] [ 58%] 2025-12-04T12:25:43.3140171Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ifftshift_cuda_uint8 PASSED [0.0047s] [ 58%] 2025-12-04T12:25:43.3140291Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_bool PASSED [0.3196s] [ 58%] 2025-12-04T12:25:43.3140397Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_float16 PASSED [0.3256s] [ 58%] 2025-12-04T12:25:43.3140500Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int16 PASSED [0.0233s] [ 58%] 2025-12-04T12:25:43.3140602Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft2_cuda_int64 PASSED [0.0163s] [ 58%] 2025-12-04T12:25:43.3140707Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int32 PASSED [0.0108s] [ 58%] 2025-12-04T12:25:43.3140820Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_int64 PASSED [0.0071s] [ 58%] 2025-12-04T12:25:43.3140938Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfft_cuda_uint8 PASSED [0.0071s] [ 58%] 2025-12-04T12:25:43.3141040Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_bool PASSED [0.0081s] [ 58%] 2025-12-04T12:25:43.3141148Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int16 PASSED [0.0081s] [ 58%] 2025-12-04T12:25:43.3141266Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_ihfftn_cuda_int64 PASSED [0.0082s] [ 58%] 2025-12-04T12:25:43.3141378Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_complex128 PASSED [0.4490s] [ 58%] 2025-12-04T12:25:43.3141482Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int16 PASSED [0.4584s] [ 59%] 2025-12-04T12:25:43.3141584Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int64 PASSED [0.0191s] [ 59%] 2025-12-04T12:25:43.3141685Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft2_cuda_int8 PASSED [0.0133s] [ 59%] 2025-12-04T12:25:43.3141787Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_bool PASSED [0.0107s] [ 59%] 2025-12-04T12:25:43.3141890Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfft_cuda_int8 PASSED [0.0062s] [ 59%] 2025-12-04T12:25:43.3141997Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_float64 PASSED [0.0069s] [ 59%] 2025-12-04T12:25:43.3142101Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int16 PASSED [0.0069s] [ 59%] 2025-12-04T12:25:43.3142204Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int32 PASSED [0.0069s] [ 59%] 2025-12-04T12:25:43.3142306Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int64 PASSED [0.0068s] [ 59%] 2025-12-04T12:25:43.3142407Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_int8 PASSED [0.0069s] [ 59%] 2025-12-04T12:25:43.3142511Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_irfftn_cuda_uint8 PASSED [0.0069s] [ 59%] 2025-12-04T12:25:43.3142611Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_bool PASSED [0.4437s] [ 59%] 2025-12-04T12:25:43.3142719Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_float16 PASSED [0.7876s] [ 59%] 2025-12-04T12:25:43.3142824Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_float32 PASSED [0.0191s] [ 59%] 2025-12-04T12:25:43.3142929Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int16 PASSED [0.0136s] [ 59%] 2025-12-04T12:25:43.3143030Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft2_cuda_int32 PASSED [0.0090s] [ 59%] 2025-12-04T12:25:43.3143135Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_float64 PASSED [0.0272s] [ 59%] 2025-12-04T12:25:43.3143234Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_int16 PASSED [0.0063s] [ 59%] 2025-12-04T12:25:43.3143336Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfft_cuda_int64 PASSED [0.0061s] [ 59%] 2025-12-04T12:25:43.3143435Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_bool PASSED [0.0067s] [ 59%] 2025-12-04T12:25:43.3143538Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int64 PASSED [0.0067s] [ 59%] 2025-12-04T12:25:43.3143636Z test_meta.py::TestMetaCUDA::test_meta_outplace_fft_rfftn_cuda_int8 PASSED [0.0067s] [ 59%] 2025-12-04T12:25:43.3143749Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_complex32 PASSED [0.9454s] [ 59%] 2025-12-04T12:25:43.3143846Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_float16 PASSED [0.0046s] [ 59%] 2025-12-04T12:25:43.3143946Z test_meta.py::TestMetaCUDA::test_meta_outplace_fill_cuda_uint8 PASSED [0.9356s] [ 59%] 2025-12-04T12:25:43.3144043Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_bool PASSED [0.0053s] [ 59%] 2025-12-04T12:25:43.3144145Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_float64 PASSED [0.0039s] [ 59%] 2025-12-04T12:25:43.3144245Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int16 PASSED [0.9322s] [ 59%] 2025-12-04T12:25:43.3144357Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_int8 PASSED [0.0055s] [ 59%] 2025-12-04T12:25:43.3144470Z test_meta.py::TestMetaCUDA::test_meta_outplace_flatten_cuda_uint8 PASSED [0.0040s] [ 59%] 2025-12-04T12:25:43.3144569Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_complex64 PASSED [0.0055s] [ 59%] 2025-12-04T12:25:43.3144669Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float16 PASSED [0.0051s] [ 59%] 2025-12-04T12:25:43.3144776Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float32 PASSED [0.0049s] [ 59%] 2025-12-04T12:25:43.3144875Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_float64 PASSED [0.0050s] [ 59%] 2025-12-04T12:25:43.3144972Z test_meta.py::TestMetaCUDA::test_meta_outplace_flip_cuda_int32 PASSED [0.0050s] [ 59%] 2025-12-04T12:25:43.3145068Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_int32 PASSED [0.0027s] [ 59%] 2025-12-04T12:25:43.3145165Z test_meta.py::TestMetaCUDA::test_meta_outplace_fliplr_cuda_uint8 PASSED [0.0025s] [ 59%] 2025-12-04T12:25:43.3145269Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_bfloat16 PASSED [0.0027s] [ 60%] 2025-12-04T12:25:43.3145372Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_complex64 PASSED [0.0027s] [ 60%] 2025-12-04T12:25:43.3145472Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_float64 PASSED [0.0025s] [ 60%] 2025-12-04T12:25:43.3145569Z test_meta.py::TestMetaCUDA::test_meta_outplace_flipud_cuda_uint8 PASSED [0.0027s] [ 60%] 2025-12-04T12:25:43.3145672Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_bfloat16 PASSED [0.0035s] [ 60%] 2025-12-04T12:25:43.3145778Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_complex128 PASSED [0.9352s] [ 60%] 2025-12-04T12:25:43.3145879Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_cuda_float32 PASSED [0.0051s] [ 60%] 2025-12-04T12:25:43.3145987Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_float64 PASSED [0.0090s] [ 60%] 2025-12-04T12:25:43.3146093Z test_meta.py::TestMetaCUDA::test_meta_outplace_float_power_cuda_int8 PASSED [0.0087s] [ 60%] 2025-12-04T12:25:43.3146191Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_cuda_float32 PASSED [0.0027s] [ 60%] 2025-12-04T12:25:43.3146300Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int32 PASSED [0.0160s] [ 60%] 2025-12-04T12:25:43.3146409Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_int64 PASSED [0.0162s] [ 60%] 2025-12-04T12:25:43.3146520Z test_meta.py::TestMetaCUDA::test_meta_outplace_floor_divide_cuda_uint8 PASSED [0.0098s] [ 60%] 2025-12-04T12:25:43.3146614Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_int8 PASSED [0.0085s] [ 60%] 2025-12-04T12:25:43.3146713Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmax_cuda_uint8 PASSED [0.0082s] [ 60%] 2025-12-04T12:25:43.3146812Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_bfloat16 PASSED [0.0084s] [ 60%] 2025-12-04T12:25:43.3146910Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_bool PASSED [0.0081s] [ 60%] 2025-12-04T12:25:43.3147009Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_float16 PASSED [0.0083s] [ 60%] 2025-12-04T12:25:43.3147105Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmin_cuda_uint8 PASSED [0.0082s] [ 60%] 2025-12-04T12:25:43.3147204Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_bfloat16 PASSED [0.0089s] [ 60%] 2025-12-04T12:25:43.3147314Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float16 PASSED [0.0088s] [ 60%] 2025-12-04T12:25:43.3147413Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float32 PASSED [0.0087s] [ 60%] 2025-12-04T12:25:43.3147509Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_float64 PASSED [0.0088s] [ 60%] 2025-12-04T12:25:43.3147605Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int16 PASSED [0.0087s] [ 60%] 2025-12-04T12:25:43.3147700Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_int32 PASSED [0.0087s] [ 60%] 2025-12-04T12:25:43.3147807Z test_meta.py::TestMetaCUDA::test_meta_outplace_fmod_cuda_uint8 PASSED [0.0087s] [ 60%] 2025-12-04T12:25:43.3147917Z test_meta.py::TestMetaCUDA::test_meta_outplace_frexp_cuda_float32 PASSED [0.9293s] [ 60%] 2025-12-04T12:25:43.3148015Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_bfloat16 PASSED [0.0053s] [ 60%] 2025-12-04T12:25:43.3148190Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_bool PASSED [0.0037s] [ 60%] 2025-12-04T12:25:43.3148291Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_complex64 PASSED [0.9282s] [ 60%] 2025-12-04T12:25:43.3148401Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_float16 PASSED [0.0053s] [ 60%] 2025-12-04T12:25:43.3148500Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_float64 PASSED [0.0039s] [ 60%] 2025-12-04T12:25:43.3148593Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_int16 PASSED [0.9460s] [ 60%] 2025-12-04T12:25:43.3148688Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_int8 PASSED [0.0054s] [ 60%] 2025-12-04T12:25:43.3148784Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_cuda_uint8 PASSED [0.0039s] [ 60%] 2025-12-04T12:25:43.3148896Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_complex128 PASSED [0.0085s] [ 60%] 2025-12-04T12:25:43.3149004Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_complex64 PASSED [0.9401s] [ 61%] 2025-12-04T12:25:43.3149114Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float32 PASSED [0.0081s] [ 61%] 2025-12-04T12:25:43.3149221Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_float64 PASSED [0.0061s] [ 61%] 2025-12-04T12:25:43.3149327Z test_meta.py::TestMetaCUDA::test_meta_outplace_full_like_cuda_int16 PASSED [0.0056s] [ 61%] 2025-12-04T12:25:43.3149427Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_bool PASSED [0.0068s] [ 61%] 2025-12-04T12:25:43.3149524Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_int32 PASSED [0.0065s] [ 61%] 2025-12-04T12:25:43.3149622Z test_meta.py::TestMetaCUDA::test_meta_outplace_gather_cuda_uint8 PASSED [0.0065s] [ 61%] 2025-12-04T12:25:43.3149719Z test_meta.py::TestMetaCUDA::test_meta_outplace_gcd_cuda_int16 PASSED [0.0083s] [ 61%] 2025-12-04T12:25:43.3149811Z test_meta.py::TestMetaCUDA::test_meta_outplace_gcd_cuda_int32 PASSED [0.0099s] [ 61%] 2025-12-04T12:25:43.3149906Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_bool PASSED [0.0081s] [ 61%] 2025-12-04T12:25:43.3149999Z test_meta.py::TestMetaCUDA::test_meta_outplace_ge_cuda_int64 PASSED [0.0082s] [ 61%] 2025-12-04T12:25:43.3150106Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float32 PASSED [0.0046s] [ 61%] 2025-12-04T12:25:43.3150213Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_float64 PASSED [0.0045s] [ 61%] 2025-12-04T12:25:43.3150313Z test_meta.py::TestMetaCUDA::test_meta_outplace_geometric_cuda_int8 PASSED [0.0045s] [ 61%] 2025-12-04T12:25:43.3150419Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_bfloat16 PASSED [0.0189s] [ 61%] 2025-12-04T12:25:43.3150523Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float16 PASSED [0.9480s] [ 61%] 2025-12-04T12:25:43.3150628Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float32 PASSED [0.0208s] [ 61%] 2025-12-04T12:25:43.3150731Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_float64 PASSED [0.9677s] [ 61%] 2025-12-04T12:25:43.3150846Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_int32 PASSED [0.0207s] [ 61%] 2025-12-04T12:25:43.3150945Z test_meta.py::TestMetaCUDA::test_meta_outplace_gradient_cuda_int8 PASSED [0.9571s] [ 61%] 2025-12-04T12:25:43.3151060Z test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_2d_cuda_float32 PASSED [0.3042s] [ 61%] 2025-12-04T12:25:43.3151174Z test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_2d_cuda_float64 PASSED [0.2947s] [ 61%] 2025-12-04T12:25:43.3151309Z test_meta.py::TestMetaCUDA::test_meta_outplace_grid_sampler_3d_cuda_bfloat16 SKIPPED [0.0002s] (Skipped!) [ 61%] 2025-12-04T12:25:43.3151424Z test_meta.py::TestMetaCUDA::test_meta_outplace_gt_cuda_int32 PASSED [0.0085s] [ 61%] 2025-12-04T12:25:43.3151538Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_bfloat16 PASSED [0.9397s] [ 61%] 2025-12-04T12:25:43.3151635Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int16 PASSED [0.0052s] [ 61%] 2025-12-04T12:25:43.3151734Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int32 PASSED [0.0037s] [ 61%] 2025-12-04T12:25:43.3151827Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_int8 PASSED [0.9411s] [ 61%] 2025-12-04T12:25:43.3151935Z test_meta.py::TestMetaCUDA::test_meta_outplace_half_cuda_uint8 PASSED [0.0056s] [ 61%] 2025-12-04T12:25:43.3152045Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_bfloat16 PASSED [0.0107s] [ 61%] 2025-12-04T12:25:43.3152151Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_bool PASSED [0.0104s] [ 61%] 2025-12-04T12:25:43.3152259Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_float32 PASSED [0.0111s] [ 61%] 2025-12-04T12:25:43.3152368Z test_meta.py::TestMetaCUDA::test_meta_outplace_hash_tensor_cuda_float64 PASSED [0.9385s] [ 61%] 2025-12-04T12:25:43.3152477Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_bfloat16 PASSED [0.0167s] [ 61%] 2025-12-04T12:25:43.3152585Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float32 PASSED [0.0145s] [ 61%] 2025-12-04T12:25:43.3152694Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_float64 PASSED [0.0142s] [ 61%] 2025-12-04T12:25:43.3152798Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int16 PASSED [0.0143s] [ 62%] 2025-12-04T12:25:43.3152901Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_int32 PASSED [0.0143s] [ 62%] 2025-12-04T12:25:43.3153003Z test_meta.py::TestMetaCUDA::test_meta_outplace_heaviside_cuda_uint8 PASSED [0.0142s] [ 62%] 2025-12-04T12:25:43.3153102Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int16 PASSED [0.0463s] [ 62%] 2025-12-04T12:25:43.3153199Z test_meta.py::TestMetaCUDA::test_meta_outplace_histc_cuda_int32 PASSED [0.0459s] [ 62%] 2025-12-04T12:25:43.3153299Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_bool PASSED [0.0027s] [ 62%] 2025-12-04T12:25:43.3153405Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_complex128 PASSED [0.9393s] [ 62%] 2025-12-04T12:25:43.3153506Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_int32 PASSED [0.0041s] [ 62%] 2025-12-04T12:25:43.3153604Z test_meta.py::TestMetaCUDA::test_meta_outplace_hsplit_cuda_uint8 PASSED [0.9292s] [ 62%] 2025-12-04T12:25:43.3153708Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_bfloat16 PASSED [0.0054s] [ 62%] 2025-12-04T12:25:43.3153815Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_complex128 PASSED [0.0041s] [ 62%] 2025-12-04T12:25:43.3153923Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_complex32 PASSED [0.0039s] [ 62%] 2025-12-04T12:25:43.3154023Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_float16 PASSED [0.9315s] [ 62%] 2025-12-04T12:25:43.3154122Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_int8 PASSED [0.0056s] [ 62%] 2025-12-04T12:25:43.3154220Z test_meta.py::TestMetaCUDA::test_meta_outplace_hstack_cuda_uint8 PASSED [0.0041s] [ 62%] 2025-12-04T12:25:43.3154320Z test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float16 PASSED [0.0087s] [ 62%] 2025-12-04T12:25:43.3154431Z test_meta.py::TestMetaCUDA::test_meta_outplace_hypot_cuda_float64 PASSED [0.0082s] [ 62%] 2025-12-04T12:25:43.3154525Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_bool PASSED [0.9381s] [ 62%] 2025-12-04T12:25:43.3154618Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int16 PASSED [0.0048s] [ 62%] 2025-12-04T12:25:43.3154711Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int32 PASSED [0.0034s] [ 62%] 2025-12-04T12:25:43.3154802Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_int8 PASSED [0.9411s] [ 62%] 2025-12-04T12:25:43.3154895Z test_meta.py::TestMetaCUDA::test_meta_outplace_i0_cuda_uint8 PASSED [0.0051s] [ 62%] 2025-12-04T12:25:43.3155010Z test_meta.py::TestMetaCUDA::test_meta_outplace_igammac_cuda_float32 PASSED [0.0159s] [ 62%] 2025-12-04T12:25:43.3155125Z test_meta.py::TestMetaCUDA::test_meta_outplace_igammac_cuda_float64 PASSED [0.0083s] [ 62%] 2025-12-04T12:25:43.3155227Z test_meta.py::TestMetaCUDA::test_meta_outplace_imag_cuda_complex128 PASSED [0.0029s] [ 62%] 2025-12-04T12:25:43.3155329Z test_meta.py::TestMetaCUDA::test_meta_outplace_imag_cuda_complex64 PASSED [0.9440s] [ 62%] 2025-12-04T12:25:43.3155452Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_bfloat16 PASSED [0.0124s] [ 62%] 2025-12-04T12:25:43.3155552Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_bool PASSED [0.0070s] [ 62%] 2025-12-04T12:25:43.3155656Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int16 PASSED [0.0096s] [ 62%] 2025-12-04T12:25:43.3155756Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int32 PASSED [0.0095s] [ 62%] 2025-12-04T12:25:43.3155858Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int64 PASSED [0.9317s] [ 62%] 2025-12-04T12:25:43.3155958Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_add_cuda_int8 PASSED [0.0120s] [ 62%] 2025-12-04T12:25:43.3156067Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_bfloat16 PASSED [0.0046s] [ 62%] 2025-12-04T12:25:43.3156170Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_bool PASSED [0.0043s] [ 62%] 2025-12-04T12:25:43.3156283Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_complex32 PASSED [0.0043s] [ 62%] 2025-12-04T12:25:43.3156391Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_float64 PASSED [0.0043s] [ 62%] 2025-12-04T12:25:43.3156495Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_copy_cuda_int32 PASSED [0.0042s] [ 62%] 2025-12-04T12:25:43.3156603Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_bfloat16 PASSED [0.9337s] [ 63%] 2025-12-04T12:25:43.3156705Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_bool PASSED [0.0069s] [ 63%] 2025-12-04T12:25:43.3156816Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex128 PASSED [0.0050s] [ 63%] 2025-12-04T12:25:43.3156925Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_complex64 PASSED [0.0048s] [ 63%] 2025-12-04T12:25:43.3157033Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float16 PASSED [0.9422s] [ 63%] 2025-12-04T12:25:43.3157141Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_float64 PASSED [0.0065s] [ 63%] 2025-12-04T12:25:43.3157242Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_fill_cuda_int8 PASSED [0.0048s] [ 63%] 2025-12-04T12:25:43.3157351Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_complex64 PASSED [0.0044s] [ 63%] 2025-12-04T12:25:43.3157455Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float16 PASSED [0.9291s] [ 63%] 2025-12-04T12:25:43.3157563Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_float64 PASSED [0.0065s] [ 63%] 2025-12-04T12:25:43.3157666Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int32 PASSED [0.0045s] [ 63%] 2025-12-04T12:25:43.3157769Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_int64 PASSED [0.9460s] [ 63%] 2025-12-04T12:25:43.3157870Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_put_cuda_uint8 PASSED [0.0064s] [ 63%] 2025-12-04T12:25:43.3157998Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_float16 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.3158155Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_int16 PASSED [0.9367s] [ 63%] 2025-12-04T12:25:43.3158270Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amax_cuda_uint8 PASSED [0.0093s] [ 63%] 2025-12-04T12:25:43.3158385Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float16 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.3158500Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_float64 PASSED [0.0074s] [ 63%] 2025-12-04T12:25:43.3158629Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int64 PASSED [0.9337s] [ 63%] 2025-12-04T12:25:43.3158755Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_amin_cuda_int8 PASSED [0.0089s] [ 63%] 2025-12-04T12:25:43.3158875Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_bfloat16 PASSED [0.0082s] [ 63%] 2025-12-04T12:25:43.3158993Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_mean_cuda_float16 PASSED [0.0079s] [ 63%] 2025-12-04T12:25:43.3159126Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_bfloat16 PASSED [0.9471s] [ 63%] 2025-12-04T12:25:43.3159239Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float16 PASSED [0.0092s] [ 63%] 2025-12-04T12:25:43.3159354Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_float32 PASSED [0.0076s] [ 63%] 2025-12-04T12:25:43.3159466Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int16 PASSED [0.0073s] [ 63%] 2025-12-04T12:25:43.3159586Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int64 PASSED [0.9303s] [ 63%] 2025-12-04T12:25:43.3159699Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_int8 PASSED [0.0092s] [ 63%] 2025-12-04T12:25:43.3159812Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_reduce_prod_cuda_uint8 PASSED [0.0074s] [ 63%] 2025-12-04T12:25:43.3159923Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_bfloat16 PASSED [0.0043s] [ 63%] 2025-12-04T12:25:43.3160038Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_complex128 PASSED [0.0040s] [ 63%] 2025-12-04T12:25:43.3160149Z test_meta.py::TestMetaCUDA::test_meta_outplace_index_select_cuda_complex64 PASSED [0.9310s] [ 63%] 2025-12-04T12:25:43.3160251Z test_meta.py::TestMetaCUDA::test_meta_outplace_inner_cuda_float64 PASSED [0.0041s] [ 63%] 2025-12-04T12:25:43.3160350Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_bfloat16 PASSED [0.9373s] [ 63%] 2025-12-04T12:25:43.3160446Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_bool PASSED [0.0046s] [ 63%] 2025-12-04T12:25:43.3160547Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_complex64 PASSED [0.0036s] [ 63%] 2025-12-04T12:25:43.3160646Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_float64 PASSED [0.9313s] [ 64%] 2025-12-04T12:25:43.3160743Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int16 PASSED [0.0052s] [ 64%] 2025-12-04T12:25:43.3160839Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int64 PASSED [0.0037s] [ 64%] 2025-12-04T12:25:43.3160933Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_int8 PASSED [0.9279s] [ 64%] 2025-12-04T12:25:43.3161028Z test_meta.py::TestMetaCUDA::test_meta_outplace_int_cuda_uint8 PASSED [0.0052s] [ 64%] 2025-12-04T12:25:43.3161133Z test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_float32 PASSED [0.0202s] [ 64%] 2025-12-04T12:25:43.3161233Z test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_int64 PASSED [0.0184s] [ 64%] 2025-12-04T12:25:43.3161331Z test_meta.py::TestMetaCUDA::test_meta_outplace_isclose_cuda_int8 PASSED [0.9398s] [ 64%] 2025-12-04T12:25:43.3161429Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_bool PASSED [0.0043s] [ 64%] 2025-12-04T12:25:43.3161535Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_float32 PASSED [0.0039s] [ 64%] 2025-12-04T12:25:43.3161648Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_int16 PASSED [0.9433s] [ 64%] 2025-12-04T12:25:43.3161750Z test_meta.py::TestMetaCUDA::test_meta_outplace_isfinite_cuda_uint8 PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.3161847Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_float16 PASSED [0.0356s] [ 64%] 2025-12-04T12:25:43.3161945Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int32 PASSED [0.9297s] [ 64%] 2025-12-04T12:25:43.3162039Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_int8 PASSED [0.0056s] [ 64%] 2025-12-04T12:25:43.3162145Z test_meta.py::TestMetaCUDA::test_meta_outplace_isin_cuda_uint8 PASSED [0.0040s] [ 64%] 2025-12-04T12:25:43.3162253Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_float16 PASSED [0.9442s] [ 64%] 2025-12-04T12:25:43.3162352Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_float32 PASSED [0.0042s] [ 64%] 2025-12-04T12:25:43.3162450Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_int16 PASSED [0.9456s] [ 64%] 2025-12-04T12:25:43.3162548Z test_meta.py::TestMetaCUDA::test_meta_outplace_isinf_cuda_int32 PASSED [0.0040s] [ 64%] 2025-12-04T12:25:43.3162657Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_bool PASSED [0.9370s] [ 64%] 2025-12-04T12:25:43.3162762Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_complex128 PASSED [0.0039s] [ 64%] 2025-12-04T12:25:43.3162860Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float32 PASSED [0.9354s] [ 64%] 2025-12-04T12:25:43.3162958Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_float64 PASSED [0.0040s] [ 64%] 2025-12-04T12:25:43.3163056Z test_meta.py::TestMetaCUDA::test_meta_outplace_isnan_cuda_int64 PASSED [0.9305s] [ 64%] 2025-12-04T12:25:43.3163157Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_bool PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.3163257Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int32 PASSED [0.9467s] [ 64%] 2025-12-04T12:25:43.3163359Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int64 PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.3163458Z test_meta.py::TestMetaCUDA::test_meta_outplace_isneginf_cuda_int8 PASSED [0.9564s] [ 64%] 2025-12-04T12:25:43.3163562Z test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_float32 PASSED [0.0044s] [ 64%] 2025-12-04T12:25:43.3163664Z test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_int32 PASSED [0.9253s] [ 64%] 2025-12-04T12:25:43.3163764Z test_meta.py::TestMetaCUDA::test_meta_outplace_isposinf_cuda_uint8 PASSED [0.0040s] [ 64%] 2025-12-04T12:25:43.3163869Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_bfloat16 PASSED [0.9312s] [ 64%] 2025-12-04T12:25:43.3163977Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_complex128 PASSED [0.0053s] [ 64%] 2025-12-04T12:25:43.3164076Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_float16 PASSED [0.9343s] [ 64%] 2025-12-04T12:25:43.3164173Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int16 PASSED [0.0049s] [ 64%] 2025-12-04T12:25:43.3164270Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int32 PASSED [0.9304s] [ 65%] 2025-12-04T12:25:43.3164368Z test_meta.py::TestMetaCUDA::test_meta_outplace_isreal_cuda_int64 PASSED [0.0049s] [ 65%] 2025-12-04T12:25:43.3164464Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_bool PASSED [0.9420s] [ 65%] 2025-12-04T12:25:43.3164559Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_int64 PASSED [0.0053s] [ 65%] 2025-12-04T12:25:43.3164656Z test_meta.py::TestMetaCUDA::test_meta_outplace_item_cuda_uint8 PASSED [0.9464s] [ 65%] 2025-12-04T12:25:43.3164793Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_complex128 PASSED [0.0084s] [ 65%] 2025-12-04T12:25:43.3164929Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_complex64 PASSED [0.0056s] [ 65%] 2025-12-04T12:25:43.3165054Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_int8 PASSED [0.0046s] [ 65%] 2025-12-04T12:25:43.3165200Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_2inputs_2outputs_cuda_uint8 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.3165341Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex128 PASSED [0.3212s] [ 65%] 2025-12-04T12:25:43.3165485Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_complex64 PASSED [0.0063s] [ 65%] 2025-12-04T12:25:43.3165620Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_4inputs_with_extra_args_cuda_int64 PASSED [0.0047s] [ 65%] 2025-12-04T12:25:43.3165745Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_bool PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.3165878Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_complex128 PASSED [0.0050s] [ 65%] 2025-12-04T12:25:43.3165999Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_complex64 PASSED [0.0051s] [ 65%] 2025-12-04T12:25:43.3166114Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_float64 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.3166241Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int32 PASSED [0.0043s] [ 65%] 2025-12-04T12:25:43.3166355Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int64 PASSED [0.2567s] [ 65%] 2025-12-04T12:25:43.3166465Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_cuda_int8 PASSED [0.0050s] [ 65%] 2025-12-04T12:25:43.3166596Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_bool PASSED [0.2478s] [ 65%] 2025-12-04T12:25:43.3166730Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float16 PASSED [0.2895s] [ 65%] 2025-12-04T12:25:43.3166866Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_float32 PASSED [0.0056s] [ 65%] 2025-12-04T12:25:43.3166999Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int16 PASSED [0.0044s] [ 65%] 2025-12-04T12:25:43.3167134Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_int32 PASSED [0.0045s] [ 65%] 2025-12-04T12:25:43.3167265Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_binary_return_by_ref_cuda_uint8 PASSED [0.0043s] [ 65%] 2025-12-04T12:25:43.3167383Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_bfloat16 PASSED [0.9371s] [ 65%] 2025-12-04T12:25:43.3167495Z test_meta.py::TestMetaCUDA::test_meta_outplace_jiterator_unary_cuda_int16 PASSED [0.0048s] [ 65%] 2025-12-04T12:25:43.3167602Z test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_float32 PASSED [0.0173s] [ 65%] 2025-12-04T12:25:43.3167704Z test_meta.py::TestMetaCUDA::test_meta_outplace_kthvalue_cuda_uint8 PASSED [0.0052s] [ 65%] 2025-12-04T12:25:43.3167800Z test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_int32 PASSED [0.0146s] [ 65%] 2025-12-04T12:25:43.3167896Z test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_int8 PASSED [0.0161s] [ 65%] 2025-12-04T12:25:43.3167994Z test_meta.py::TestMetaCUDA::test_meta_outplace_lcm_cuda_uint8 PASSED [0.0140s] [ 65%] 2025-12-04T12:25:43.3168166Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_bfloat16 PASSED [0.0096s] [ 65%] 2025-12-04T12:25:43.3168275Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_complex64 PASSED [0.0094s] [ 65%] 2025-12-04T12:25:43.3168371Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_int8 PASSED [0.0096s] [ 65%] 2025-12-04T12:25:43.3168470Z test_meta.py::TestMetaCUDA::test_meta_outplace_ldexp_cuda_uint8 PASSED [0.0096s] [ 65%] 2025-12-04T12:25:43.3168569Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_bfloat16 PASSED [0.0082s] [ 65%] 2025-12-04T12:25:43.3168663Z test_meta.py::TestMetaCUDA::test_meta_outplace_le_cuda_int32 PASSED [0.0081s] [ 66%] 2025-12-04T12:25:43.3168769Z test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_bfloat16 PASSED [0.9304s] [ 66%] 2025-12-04T12:25:43.3168885Z test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_float16 PASSED [0.0066s] [ 66%] 2025-12-04T12:25:43.3168989Z test_meta.py::TestMetaCUDA::test_meta_outplace_lgamma_cuda_float32 PASSED [0.0038s] [ 66%] 2025-12-04T12:25:43.3169114Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cholesky_ex_cuda_complex128 PASSED [0.0083s] [ 66%] 2025-12-04T12:25:43.3169227Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cond_cuda_float32 PASSED [0.9285s] [ 66%] 2025-12-04T12:25:43.3169342Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_complex128 PASSED [0.0057s] [ 66%] 2025-12-04T12:25:43.3169461Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_cross_cuda_int8 PASSED [0.0042s] [ 66%] 2025-12-04T12:25:43.3169583Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_complex128 PASSED [0.0960s] [ 66%] 2025-12-04T12:25:43.3169694Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_det_cuda_complex64 PASSED [0.0550s] [ 66%] 2025-12-04T12:25:43.3169811Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_complex32 PASSED [0.0063s] [ 66%] 2025-12-04T12:25:43.3169930Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_complex64 PASSED [0.0058s] [ 66%] 2025-12-04T12:25:43.3170056Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int32 PASSED [0.0057s] [ 66%] 2025-12-04T12:25:43.3170170Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_int64 PASSED [0.0057s] [ 66%] 2025-12-04T12:25:43.3170278Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_diagonal_cuda_uint8 PASSED [0.0057s] [ 66%] 2025-12-04T12:25:43.3170390Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_complex128 PASSED [0.0446s] [ 66%] 2025-12-04T12:25:43.3170500Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eig_cuda_float32 PASSED [0.0455s] [ 66%] 2025-12-04T12:25:43.3170614Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_complex128 PASSED [0.0359s] [ 66%] 2025-12-04T12:25:43.3170725Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigh_cuda_float32 PASSED [0.0264s] [ 66%] 2025-12-04T12:25:43.3170842Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvals_cuda_complex128 PASSED [0.0103s] [ 66%] 2025-12-04T12:25:43.3170961Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_eigvalsh_cuda_complex128 PASSED [0.0089s] [ 66%] 2025-12-04T12:25:43.3171073Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_complex128 PASSED [0.0124s] [ 66%] 2025-12-04T12:25:43.3171181Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_cuda_float32 PASSED [0.0519s] [ 66%] 2025-12-04T12:25:43.3171296Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_complex64 PASSED [0.0071s] [ 66%] 2025-12-04T12:25:43.3171407Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_inv_ex_cuda_float64 PASSED [0.9701s] [ 66%] 2025-12-04T12:25:43.3171524Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_cuda_float64 PASSED [0.0121s] [ 66%] 2025-12-04T12:25:43.3171651Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_ex_cuda_complex64 PASSED [0.2004s] [ 66%] 2025-12-04T12:25:43.3171771Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0046s] [ 66%] 2025-12-04T12:25:43.3171981Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_complex64 SKIPPED [0.0008s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 66%] 2025-12-04T12:25:43.3172181Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_ldl_solve_cuda_float64 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 66%] 2025-12-04T12:25:43.3172296Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_cuda_float64 PASSED [0.1335s] [ 66%] 2025-12-04T12:25:43.3172425Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_float32 PASSED [1.0280s] [ 66%] 2025-12-04T12:25:43.3172555Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lstsq_grad_oriented_cuda_float64 PASSED [0.1383s] [ 66%] 2025-12-04T12:25:43.3172677Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_cuda_complex64 PASSED [0.0573s] [ 66%] 2025-12-04T12:25:43.3172787Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_cuda_float64 PASSED [0.0164s] [ 66%] 2025-12-04T12:25:43.3172901Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_cuda_float32 PASSED [0.0714s] [ 66%] 2025-12-04T12:25:43.3173014Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_cuda_float64 PASSED [0.0344s] [ 67%] 2025-12-04T12:25:43.3173137Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_complex64 PASSED [0.0152s] [ 67%] 2025-12-04T12:25:43.3173269Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_factor_ex_cuda_float64 PASSED [0.0145s] [ 67%] 2025-12-04T12:25:43.3173394Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_lu_solve_cuda_float64 PASSED [0.0796s] [ 67%] 2025-12-04T12:25:43.3173518Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_bfloat16 PASSED [0.0383s] [ 67%] 2025-12-04T12:25:43.3173641Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_norm_cuda_complex128 PASSED [0.9040s] [ 67%] 2025-12-04T12:25:43.3173771Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_power_cuda_float64 PASSED [0.1013s] [ 67%] 2025-12-04T12:25:43.3173893Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_complex128 PASSED [0.0431s] [ 67%] 2025-12-04T12:25:43.3174010Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_matrix_rank_cuda_complex64 PASSED [0.9185s] [ 67%] 2025-12-04T12:25:43.3174130Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_multi_dot_cuda_complex128 PASSED [0.0094s] [ 67%] 2025-12-04T12:25:43.3174242Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float16 PASSED [0.0681s] [ 67%] 2025-12-04T12:25:43.3174352Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_cuda_float64 PASSED [0.0912s] [ 67%] 2025-12-04T12:25:43.3174491Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_subgradients_at_zero_cuda_bfloat16 PASSED [0.0526s] [ 67%] 2025-12-04T12:25:43.3174631Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_norm_subgradients_at_zero_cuda_float16 PASSED [0.0525s] [ 67%] 2025-12-04T12:25:43.3174742Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_float32 PASSED [0.0366s] [ 67%] 2025-12-04T12:25:43.3174852Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_cuda_float64 PASSED [0.0346s] [ 67%] 2025-12-04T12:25:43.3174975Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_hermitian_cuda_complex64 PASSED [0.0199s] [ 67%] 2025-12-04T12:25:43.3175182Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_complex128 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 67%] 2025-12-04T12:25:43.3175380Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 67%] 2025-12-04T12:25:43.3175493Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_qr_cuda_complex64 PASSED [0.0354s] [ 67%] 2025-12-04T12:25:43.3175601Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_qr_cuda_float32 PASSED [0.0268s] [ 67%] 2025-12-04T12:25:43.3175721Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_complex64 PASSED [0.1618s] [ 67%] 2025-12-04T12:25:43.3175836Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_slogdet_cuda_float64 PASSED [0.0090s] [ 67%] 2025-12-04T12:25:43.3175949Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_cuda_complex128 PASSED [0.0269s] [ 67%] 2025-12-04T12:25:43.3176078Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_solve_triangular_cuda_float64 PASSED [0.1256s] [ 67%] 2025-12-04T12:25:43.3176190Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svd_cuda_complex128 PASSED [0.9402s] [ 67%] 2025-12-04T12:25:43.3176306Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_complex128 PASSED [0.0255s] [ 67%] 2025-12-04T12:25:43.3176430Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_svdvals_cuda_float64 PASSED [0.0229s] [ 67%] 2025-12-04T12:25:43.3176547Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_float32 PASSED [0.0058s] [ 67%] 2025-12-04T12:25:43.3176661Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorinv_cuda_float64 PASSED [0.9025s] [ 67%] 2025-12-04T12:25:43.3176787Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_complex128 PASSED [0.0135s] [ 67%] 2025-12-04T12:25:43.3176903Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_tensorsolve_cuda_float32 PASSED [0.0063s] [ 67%] 2025-12-04T12:25:43.3177028Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_complex64 PASSED [0.0088s] [ 67%] 2025-12-04T12:25:43.3177149Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int16 PASSED [0.0081s] [ 67%] 2025-12-04T12:25:43.3177257Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int64 PASSED [0.0079s] [ 67%] 2025-12-04T12:25:43.3177367Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vander_cuda_int8 PASSED [0.0079s] [ 67%] 2025-12-04T12:25:43.3177499Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_bfloat16 PASSED [0.8971s] [ 68%] 2025-12-04T12:25:43.3177612Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_complex128 PASSED [0.0358s] [ 68%] 2025-12-04T12:25:43.3177726Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_float16 PASSED [0.9592s] [ 68%] 2025-12-04T12:25:43.3177837Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vecdot_cuda_float64 PASSED [0.0104s] [ 68%] 2025-12-04T12:25:43.3177963Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_complex128 PASSED [0.1238s] [ 68%] 2025-12-04T12:25:43.3178155Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_complex64 PASSED [0.1227s] [ 68%] 2025-12-04T12:25:43.3178274Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_float32 PASSED [0.1190s] [ 68%] 2025-12-04T12:25:43.3178392Z test_meta.py::TestMetaCUDA::test_meta_outplace_linalg_vector_norm_cuda_float64 PASSED [0.1189s] [ 68%] 2025-12-04T12:25:43.3178502Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_complex128 PASSED [0.0193s] [ 68%] 2025-12-04T12:25:43.3178610Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_complex64 PASSED [0.0188s] [ 68%] 2025-12-04T12:25:43.3178715Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_float16 PASSED [0.0189s] [ 68%] 2025-12-04T12:25:43.3178816Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_cuda_int16 PASSED [0.0190s] [ 68%] 2025-12-04T12:25:43.3178948Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_complex128 PASSED [0.0986s] [ 68%] 2025-12-04T12:25:43.3179080Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_complex64 PASSED [0.0987s] [ 68%] 2025-12-04T12:25:43.3179204Z test_meta.py::TestMetaCUDA::test_meta_outplace_linspace_tensor_overload_cuda_int64 PASSED [0.0998s] [ 68%] 2025-12-04T12:25:43.3179307Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_bfloat16 PASSED [0.9408s] [ 68%] 2025-12-04T12:25:43.3179412Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_complex128 PASSED [0.0071s] [ 68%] 2025-12-04T12:25:43.3179514Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_float64 PASSED [0.0038s] [ 68%] 2025-12-04T12:25:43.3179612Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int16 PASSED [0.9306s] [ 68%] 2025-12-04T12:25:43.3179708Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_int8 PASSED [0.0057s] [ 68%] 2025-12-04T12:25:43.3179806Z test_meta.py::TestMetaCUDA::test_meta_outplace_log10_cuda_uint8 PASSED [0.0043s] [ 68%] 2025-12-04T12:25:43.3179911Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_bfloat16 PASSED [0.9511s] [ 68%] 2025-12-04T12:25:43.3180009Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_bool PASSED [0.0043s] [ 68%] 2025-12-04T12:25:43.3180114Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_complex64 PASSED [0.9326s] [ 68%] 2025-12-04T12:25:43.3180239Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_float16 PASSED [0.0042s] [ 68%] 2025-12-04T12:25:43.3180341Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int16 PASSED [0.9266s] [ 68%] 2025-12-04T12:25:43.3180436Z test_meta.py::TestMetaCUDA::test_meta_outplace_log1p_cuda_int32 PASSED [0.0042s] [ 68%] 2025-12-04T12:25:43.3180539Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_bfloat16 PASSED [0.0040s] [ 68%] 2025-12-04T12:25:43.3180633Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_bool PASSED [0.9398s] [ 68%] 2025-12-04T12:25:43.3180746Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_float32 PASSED [0.0054s] [ 68%] 2025-12-04T12:25:43.3180857Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_float64 PASSED [0.0038s] [ 68%] 2025-12-04T12:25:43.3180953Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int16 PASSED [0.9402s] [ 68%] 2025-12-04T12:25:43.3181050Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_int64 PASSED [0.0056s] [ 68%] 2025-12-04T12:25:43.3181146Z test_meta.py::TestMetaCUDA::test_meta_outplace_log2_cuda_uint8 PASSED [0.0040s] [ 68%] 2025-12-04T12:25:43.3181259Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_bfloat16 PASSED [0.9355s] [ 68%] 2025-12-04T12:25:43.3181360Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex128 PASSED [0.0071s] [ 68%] 2025-12-04T12:25:43.3181460Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_complex32 PASSED [0.0050s] [ 68%] 2025-12-04T12:25:43.3181556Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_float64 PASSED [0.9357s] [ 69%] 2025-12-04T12:25:43.3181654Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_cuda_int16 PASSED [0.0057s] [ 69%] 2025-12-04T12:25:43.3181763Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_bfloat16 PASSED [0.0053s] [ 69%] 2025-12-04T12:25:43.3181872Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float32 PASSED [0.0048s] [ 69%] 2025-12-04T12:25:43.3181980Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_normal_cuda_float64 PASSED [0.0047s] [ 69%] 2025-12-04T12:25:43.3182093Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_cuda_bfloat16 PASSED [0.0088s] [ 69%] 2025-12-04T12:25:43.3182212Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_bool PASSED [0.0087s] [ 69%] 2025-12-04T12:25:43.3182332Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_int64 PASSED [0.0087s] [ 69%] 2025-12-04T12:25:43.3182453Z test_meta.py::TestMetaCUDA::test_meta_outplace_log_softmax_with_dtype_cuda_uint8 PASSED [0.0087s] [ 69%] 2025-12-04T12:25:43.3182566Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_bfloat16 PASSED [0.9471s] [ 69%] 2025-12-04T12:25:43.3182674Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp2_cuda_float64 PASSED [0.0060s] [ 69%] 2025-12-04T12:25:43.3182786Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_complex64 PASSED [0.5683s] [ 69%] 2025-12-04T12:25:43.3182894Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float16 PASSED [0.0221s] [ 69%] 2025-12-04T12:25:43.3183004Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float32 PASSED [0.0218s] [ 69%] 2025-12-04T12:25:43.3183110Z test_meta.py::TestMetaCUDA::test_meta_outplace_logaddexp_cuda_float64 PASSED [0.0217s] [ 69%] 2025-12-04T12:25:43.3183224Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_bfloat16 PASSED [0.0178s] [ 69%] 2025-12-04T12:25:43.3183334Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float32 PASSED [0.0048s] [ 69%] 2025-12-04T12:25:43.3183445Z test_meta.py::TestMetaCUDA::test_meta_outplace_logcumsumexp_cuda_float64 PASSED [0.0048s] [ 69%] 2025-12-04T12:25:43.3183551Z test_meta.py::TestMetaCUDA::test_meta_outplace_logdet_cuda_complex64 PASSED [0.0113s] [ 69%] 2025-12-04T12:25:43.3183662Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_bfloat16 PASSED [0.0109s] [ 69%] 2025-12-04T12:25:43.3183781Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_bool PASSED [0.0088s] [ 69%] 2025-12-04T12:25:43.3183892Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_complex64 PASSED [0.3919s] [ 69%] 2025-12-04T12:25:43.3183998Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_and_cuda_int8 PASSED [0.0109s] [ 69%] 2025-12-04T12:25:43.3184102Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_not_cuda_bool PASSED [0.9390s] [ 69%] 2025-12-04T12:25:43.3184212Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_bfloat16 PASSED [0.0125s] [ 69%] 2025-12-04T12:25:43.3184325Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_bool PASSED [0.0088s] [ 69%] 2025-12-04T12:25:43.3184444Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_float64 PASSED [0.0107s] [ 69%] 2025-12-04T12:25:43.3184547Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_or_cuda_int64 PASSED [0.0106s] [ 69%] 2025-12-04T12:25:43.3184654Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_bool PASSED [0.0088s] [ 69%] 2025-12-04T12:25:43.3184762Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_float16 PASSED [0.0107s] [ 69%] 2025-12-04T12:25:43.3184881Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int16 PASSED [0.0105s] [ 69%] 2025-12-04T12:25:43.3184983Z test_meta.py::TestMetaCUDA::test_meta_outplace_logical_xor_cuda_int8 PASSED [0.0106s] [ 69%] 2025-12-04T12:25:43.3185087Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_bfloat16 PASSED [0.0071s] [ 69%] 2025-12-04T12:25:43.3185186Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float16 PASSED [0.0069s] [ 69%] 2025-12-04T12:25:43.3185286Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float32 PASSED [0.0068s] [ 69%] 2025-12-04T12:25:43.3185385Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_float64 PASSED [0.0068s] [ 69%] 2025-12-04T12:25:43.3185485Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int16 PASSED [0.0069s] [ 70%] 2025-12-04T12:25:43.3185582Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int32 PASSED [0.0068s] [ 70%] 2025-12-04T12:25:43.3185682Z test_meta.py::TestMetaCUDA::test_meta_outplace_logit_cuda_int8 PASSED [0.0069s] [ 70%] 2025-12-04T12:25:43.3185788Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_bfloat16 PASSED [0.1117s] [ 70%] 2025-12-04T12:25:43.3185899Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_complex64 PASSED [0.1263s] [ 70%] 2025-12-04T12:25:43.3186002Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float16 PASSED [0.1261s] [ 70%] 2025-12-04T12:25:43.3186107Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float32 PASSED [0.1338s] [ 70%] 2025-12-04T12:25:43.3186213Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_float64 PASSED [0.1187s] [ 70%] 2025-12-04T12:25:43.3186313Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_cuda_int8 PASSED [0.0431s] [ 70%] 2025-12-04T12:25:43.3186445Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_bfloat16 PASSED [0.6418s] [ 70%] 2025-12-04T12:25:43.3186577Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_complex128 PASSED [0.6410s] [ 70%] 2025-12-04T12:25:43.3186701Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_int8 PASSED [0.2432s] [ 70%] 2025-12-04T12:25:43.3186823Z test_meta.py::TestMetaCUDA::test_meta_outplace_logspace_tensor_overload_cuda_uint8 PASSED [0.1959s] [ 70%] 2025-12-04T12:25:43.3186928Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_bool PASSED [0.0105s] [ 70%] 2025-12-04T12:25:43.3187035Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_float32 PASSED [0.0224s] [ 70%] 2025-12-04T12:25:43.3187140Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int16 PASSED [0.9560s] [ 70%] 2025-12-04T12:25:43.3187243Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int32 PASSED [0.0115s] [ 70%] 2025-12-04T12:25:43.3187356Z test_meta.py::TestMetaCUDA::test_meta_outplace_logsumexp_cuda_int8 PASSED [0.0092s] [ 70%] 2025-12-04T12:25:43.3187455Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_bfloat16 PASSED [0.0035s] [ 70%] 2025-12-04T12:25:43.3187560Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_complex64 PASSED [0.9297s] [ 70%] 2025-12-04T12:25:43.3187656Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int16 PASSED [0.0049s] [ 70%] 2025-12-04T12:25:43.3187755Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int32 PASSED [0.0036s] [ 70%] 2025-12-04T12:25:43.3187849Z test_meta.py::TestMetaCUDA::test_meta_outplace_long_cuda_int8 PASSED [0.9362s] [ 70%] 2025-12-04T12:25:43.3187960Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_bfloat16 PASSED [0.0105s] [ 70%] 2025-12-04T12:25:43.3188115Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float16 PASSED [0.0086s] [ 70%] 2025-12-04T12:25:43.3188213Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_float64 PASSED [0.0083s] [ 70%] 2025-12-04T12:25:43.3188309Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int32 PASSED [0.0082s] [ 70%] 2025-12-04T12:25:43.3188405Z test_meta.py::TestMetaCUDA::test_meta_outplace_lt_cuda_int64 PASSED [0.0083s] [ 70%] 2025-12-04T12:25:43.3188522Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_complex128 PASSED [0.1068s] [ 70%] 2025-12-04T12:25:43.3188618Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_cuda_float64 PASSED [0.0248s] [ 70%] 2025-12-04T12:25:43.3188726Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_complex64 PASSED [0.0278s] [ 70%] 2025-12-04T12:25:43.3188830Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_solve_cuda_float32 PASSED [0.0283s] [ 70%] 2025-12-04T12:25:43.3188935Z test_meta.py::TestMetaCUDA::test_meta_outplace_lu_unpack_cuda_float64 PASSED [0.0206s] [ 70%] 2025-12-04T12:25:43.3189034Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_complex32 PASSED [0.0035s] [ 70%] 2025-12-04T12:25:43.3189129Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float32 PASSED [0.9486s] [ 70%] 2025-12-04T12:25:43.3189225Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_float64 PASSED [0.0052s] [ 70%] 2025-12-04T12:25:43.3189320Z test_meta.py::TestMetaCUDA::test_meta_outplace_mH_cuda_int16 PASSED [0.0038s] [ 71%] 2025-12-04T12:25:43.3189419Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_complex128 PASSED [0.9460s] [ 71%] 2025-12-04T12:25:43.3189515Z test_meta.py::TestMetaCUDA::test_meta_outplace_mT_cuda_int8 PASSED [0.0049s] [ 71%] 2025-12-04T12:25:43.3189623Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_float16 PASSED [0.1979s] [ 71%] 2025-12-04T12:25:43.3189731Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int16 PASSED [0.1659s] [ 71%] 2025-12-04T12:25:43.3189837Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_int64 PASSED [0.1650s] [ 71%] 2025-12-04T12:25:43.3189942Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amax_cuda_uint8 PASSED [0.1644s] [ 71%] 2025-12-04T12:25:43.3190050Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float16 PASSED [0.1960s] [ 71%] 2025-12-04T12:25:43.3190161Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_amin_cuda_float32 PASSED [0.1890s] [ 71%] 2025-12-04T12:25:43.3190273Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_float16 PASSED [0.1109s] [ 71%] 2025-12-04T12:25:43.3190383Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int16 PASSED [0.0949s] [ 71%] 2025-12-04T12:25:43.3190491Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int32 PASSED [0.0966s] [ 71%] 2025-12-04T12:25:43.3190602Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_int8 PASSED [0.0956s] [ 71%] 2025-12-04T12:25:43.3190709Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmax_cuda_uint8 PASSED [0.0952s] [ 71%] 2025-12-04T12:25:43.3190820Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_float16 PASSED [0.1098s] [ 71%] 2025-12-04T12:25:43.3190940Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int64 PASSED [0.0951s] [ 71%] 2025-12-04T12:25:43.3191050Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_int8 PASSED [0.0950s] [ 71%] 2025-12-04T12:25:43.3191158Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_argmin_cuda_uint8 PASSED [0.0952s] [ 71%] 2025-12-04T12:25:43.3191274Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_float16 PASSED [0.0491s] [ 71%] 2025-12-04T12:25:43.3191383Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int16 PASSED [0.0476s] [ 71%] 2025-12-04T12:25:43.3191494Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumprod_cuda_int64 PASSED [0.0476s] [ 71%] 2025-12-04T12:25:43.3191614Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_cumsum_cuda_int32 PASSED [0.0494s] [ 71%] 2025-12-04T12:25:43.3191742Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_int16 PASSED [0.0070s] [ 71%] 2025-12-04T12:25:43.3191849Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_fill_cuda_uint8 PASSED [0.0065s] [ 71%] 2025-12-04T12:25:43.3191967Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_log_softmax_cuda_float32 PASSED [0.0546s] [ 71%] 2025-12-04T12:25:43.3192092Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logaddexp_cuda_float32 PASSED [0.0507s] [ 71%] 2025-12-04T12:25:43.3192207Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_float16 PASSED [0.2390s] [ 71%] 2025-12-04T12:25:43.3192323Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int16 PASSED [0.2104s] [ 71%] 2025-12-04T12:25:43.3192433Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_logsumexp_cuda_int8 PASSED [0.2122s] [ 71%] 2025-12-04T12:25:43.3192549Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_complex128 PASSED [0.2422s] [ 71%] 2025-12-04T12:25:43.3192658Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_mean_cuda_float16 PASSED [0.2940s] [ 71%] 2025-12-04T12:25:43.3192768Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_norm_cuda_float32 PASSED [0.9387s] [ 71%] 2025-12-04T12:25:43.3192891Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_normalize_cuda_complex128 PASSED [0.0463s] [ 71%] 2025-12-04T12:25:43.3193000Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_bool PASSED [0.1792s] [ 71%] 2025-12-04T12:25:43.3193108Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_float16 PASSED [0.2205s] [ 71%] 2025-12-04T12:25:43.3193218Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_float64 PASSED [0.2076s] [ 71%] 2025-12-04T12:25:43.3193324Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int16 PASSED [0.1771s] [ 71%] 2025-12-04T12:25:43.3193435Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_prod_cuda_int64 PASSED [0.1758s] [ 72%] 2025-12-04T12:25:43.3193550Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_bfloat16 PASSED [0.0044s] [ 72%] 2025-12-04T12:25:43.3193660Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_bool PASSED [0.9479s] [ 72%] 2025-12-04T12:25:43.3193777Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_complex128 PASSED [0.0061s] [ 72%] 2025-12-04T12:25:43.3193891Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float16 PASSED [0.0043s] [ 72%] 2025-12-04T12:25:43.3194002Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float32 PASSED [0.9557s] [ 72%] 2025-12-04T12:25:43.3194114Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_float64 PASSED [0.0063s] [ 72%] 2025-12-04T12:25:43.3194379Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_scatter_cuda_int64 PASSED [0.0055s] [ 72%] 2025-12-04T12:25:43.3194491Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_bool PASSED [0.0077s] [ 72%] 2025-12-04T12:25:43.3194605Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_complex64 PASSED [0.0068s] [ 72%] 2025-12-04T12:25:43.3194716Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_select_cuda_float32 PASSED [0.0066s] [ 72%] 2025-12-04T12:25:43.3194846Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmax_cuda_float16 PASSED [0.0478s] [ 72%] 2025-12-04T12:25:43.3194961Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_softmin_cuda_bfloat16 PASSED [0.0500s] [ 72%] 2025-12-04T12:25:43.3195073Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_bfloat16 PASSED [0.3889s] [ 72%] 2025-12-04T12:25:43.3195175Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_std_cuda_int8 PASSED [0.3866s] [ 72%] 2025-12-04T12:25:43.3195281Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_sum_cuda_int16 PASSED [0.1626s] [ 72%] 2025-12-04T12:25:43.3195398Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_bfloat16 PASSED [0.3847s] [ 72%] 2025-12-04T12:25:43.3195519Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_float64 PASSED [0.3787s] [ 72%] 2025-12-04T12:25:43.3195621Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int16 PASSED [0.3843s] [ 72%] 2025-12-04T12:25:43.3195728Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_int64 PASSED [0.3833s] [ 72%] 2025-12-04T12:25:43.3195842Z test_meta.py::TestMetaCUDA::test_meta_outplace_masked_var_cuda_uint8 PASSED [0.3818s] [ 72%] 2025-12-04T12:25:43.3195946Z test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_complex64 PASSED [0.9532s] [ 72%] 2025-12-04T12:25:43.3196048Z test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_float16 PASSED [0.0065s] [ 72%] 2025-12-04T12:25:43.3196149Z test_meta.py::TestMetaCUDA::test_meta_outplace_matmul_cuda_float32 PASSED [0.9306s] [ 72%] 2025-12-04T12:25:43.3196258Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_bfloat16 PASSED [0.0126s] [ 72%] 2025-12-04T12:25:43.3196366Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float32 PASSED [0.9446s] [ 72%] 2025-12-04T12:25:43.3196471Z test_meta.py::TestMetaCUDA::test_meta_outplace_matrix_exp_cuda_float64 PASSED [0.0068s] [ 72%] 2025-12-04T12:25:43.3196580Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_float16 PASSED [0.0093s] [ 72%] 2025-12-04T12:25:43.3196687Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_float32 PASSED [0.0083s] [ 72%] 2025-12-04T12:25:43.3196794Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int16 PASSED [0.0082s] [ 72%] 2025-12-04T12:25:43.3196897Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int32 PASSED [0.0082s] [ 72%] 2025-12-04T12:25:43.3197001Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_binary_cuda_int8 PASSED [0.0082s] [ 72%] 2025-12-04T12:25:43.3197138Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_pool2d_with_indices_backward_cuda_float16 PASSED [1.0214s] [ 72%] 2025-12-04T12:25:43.3197255Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_bool PASSED [0.9397s] [ 72%] 2025-12-04T12:25:43.3197376Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_float64 PASSED [0.0047s] [ 72%] 2025-12-04T12:25:43.3197493Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_int32 PASSED [0.0034s] [ 72%] 2025-12-04T12:25:43.3197611Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_no_dim_cuda_int64 PASSED [0.9525s] [ 73%] 2025-12-04T12:25:43.3197735Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_bfloat16 PASSED [0.0050s] [ 73%] 2025-12-04T12:25:43.3197851Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_bool PASSED [0.9370s] [ 73%] 2025-12-04T12:25:43.3197973Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_float64 PASSED [0.0050s] [ 73%] 2025-12-04T12:25:43.3198152Z test_meta.py::TestMetaCUDA::test_meta_outplace_max_reduction_with_dim_cuda_int16 PASSED [0.9460s] [ 73%] 2025-12-04T12:25:43.3198257Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_bfloat16 PASSED [0.0102s] [ 73%] 2025-12-04T12:25:43.3198357Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_bool PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3198477Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_float16 PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3198582Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_float32 PASSED [0.0082s] [ 73%] 2025-12-04T12:25:43.3198681Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_int32 PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3198783Z test_meta.py::TestMetaCUDA::test_meta_outplace_maximum_cuda_int8 PASSED [0.0081s] [ 73%] 2025-12-04T12:25:43.3198884Z test_meta.py::TestMetaCUDA::test_meta_outplace_mean_cuda_complex64 PASSED [0.0212s] [ 73%] 2025-12-04T12:25:43.3198987Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_bfloat16 PASSED [0.0061s] [ 73%] 2025-12-04T12:25:43.3199097Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int32 PASSED [0.0058s] [ 73%] 2025-12-04T12:25:43.3199210Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_int8 PASSED [0.0060s] [ 73%] 2025-12-04T12:25:43.3199308Z test_meta.py::TestMetaCUDA::test_meta_outplace_median_cuda_uint8 PASSED [0.0058s] [ 73%] 2025-12-04T12:25:43.3199435Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_bool PASSED [0.0055s] [ 73%] 2025-12-04T12:25:43.3199572Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_list_of_tensors_cuda_int16 PASSED [0.0054s] [ 73%] 2025-12-04T12:25:43.3199708Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_complex128 PASSED [0.0054s] [ 73%] 2025-12-04T12:25:43.3199837Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_float64 PASSED [0.0054s] [ 73%] 2025-12-04T12:25:43.3199961Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int32 PASSED [0.0054s] [ 73%] 2025-12-04T12:25:43.3200086Z test_meta.py::TestMetaCUDA::test_meta_outplace_meshgrid_variadic_tensors_cuda_int64 PASSED [0.0054s] [ 73%] 2025-12-04T12:25:43.3200194Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_float16 PASSED [0.0084s] [ 73%] 2025-12-04T12:25:43.3200302Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_float32 PASSED [0.0082s] [ 73%] 2025-12-04T12:25:43.3200404Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_binary_cuda_int8 PASSED [0.0082s] [ 73%] 2025-12-04T12:25:43.3200526Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_bfloat16 PASSED [0.9434s] [ 73%] 2025-12-04T12:25:43.3200642Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_int32 PASSED [0.0047s] [ 73%] 2025-12-04T12:25:43.3200759Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_no_dim_cuda_int8 PASSED [0.0033s] [ 73%] 2025-12-04T12:25:43.3200882Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_float16 PASSED [0.9609s] [ 73%] 2025-12-04T12:25:43.3201006Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_float32 PASSED [0.0050s] [ 73%] 2025-12-04T12:25:43.3201122Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int16 PASSED [0.9501s] [ 73%] 2025-12-04T12:25:43.3201242Z test_meta.py::TestMetaCUDA::test_meta_outplace_min_reduction_with_dim_cuda_int32 PASSED [0.0051s] [ 73%] 2025-12-04T12:25:43.3201342Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_bool PASSED [0.0087s] [ 73%] 2025-12-04T12:25:43.3201443Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int16 PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3201543Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int32 PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3201640Z test_meta.py::TestMetaCUDA::test_meta_outplace_minimum_cuda_int8 PASSED [0.0083s] [ 73%] 2025-12-04T12:25:43.3201736Z test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_bfloat16 PASSED [0.9920s] [ 74%] 2025-12-04T12:25:43.3201836Z test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_complex64 PASSED [0.0070s] [ 74%] 2025-12-04T12:25:43.3201934Z test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float16 PASSED [0.0044s] [ 74%] 2025-12-04T12:25:43.3202032Z test_meta.py::TestMetaCUDA::test_meta_outplace_mm_cuda_float32 PASSED [0.0042s] [ 74%] 2025-12-04T12:25:43.3202140Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_bfloat16 PASSED [1.1324s] [ 74%] 2025-12-04T12:25:43.3202238Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_bool PASSED [0.0065s] [ 74%] 2025-12-04T12:25:43.3202335Z test_meta.py::TestMetaCUDA::test_meta_outplace_mode_cuda_float16 PASSED [0.0049s] [ 74%] 2025-12-04T12:25:43.3202443Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex128 PASSED [0.9569s] [ 74%] 2025-12-04T12:25:43.3202550Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_complex32 PASSED [0.0043s] [ 74%] 2025-12-04T12:25:43.3202663Z test_meta.py::TestMetaCUDA::test_meta_outplace_movedim_cuda_float16 PASSED [0.9305s] [ 74%] 2025-12-04T12:25:43.3202775Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_bool PASSED [0.0052s] [ 74%] 2025-12-04T12:25:43.3202872Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_float64 PASSED [0.0036s] [ 74%] 2025-12-04T12:25:43.3202971Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_int8 PASSED [0.9530s] [ 74%] 2025-12-04T12:25:43.3203068Z test_meta.py::TestMetaCUDA::test_meta_outplace_msort_cuda_uint8 PASSED [0.0053s] [ 74%] 2025-12-04T12:25:43.3203176Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_bfloat16 PASSED [0.0091s] [ 74%] 2025-12-04T12:25:43.3203270Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_bool PASSED [0.0081s] [ 74%] 2025-12-04T12:25:43.3203373Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex128 PASSED [0.0082s] [ 74%] 2025-12-04T12:25:43.3203474Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_complex64 PASSED [0.0082s] [ 74%] 2025-12-04T12:25:43.3203571Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_int32 PASSED [0.0082s] [ 74%] 2025-12-04T12:25:43.3203666Z test_meta.py::TestMetaCUDA::test_meta_outplace_mul_cuda_int8 PASSED [0.0081s] [ 74%] 2025-12-04T12:25:43.3203779Z test_meta.py::TestMetaCUDA::test_meta_outplace_multinomial_cuda_float32 PASSED [0.0281s] [ 74%] 2025-12-04T12:25:43.3203890Z test_meta.py::TestMetaCUDA::test_meta_outplace_multinomial_cuda_float64 PASSED [0.9366s] [ 74%] 2025-12-04T12:25:43.3203992Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex128 PASSED [0.0046s] [ 74%] 2025-12-04T12:25:43.3204090Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_complex64 PASSED [0.0031s] [ 74%] 2025-12-04T12:25:43.3204186Z test_meta.py::TestMetaCUDA::test_meta_outplace_mv_cuda_float16 PASSED [0.9483s] [ 74%] 2025-12-04T12:25:43.3204311Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float16 PASSED [0.0160s] [ 74%] 2025-12-04T12:25:43.3204436Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0137s] [ 74%] 2025-12-04T12:25:43.3204559Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_float64 PASSED [0.0135s] [ 74%] 2025-12-04T12:25:43.3204677Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_1_cuda_int8 PASSED [0.0135s] [ 74%] 2025-12-04T12:25:43.3204800Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0135s] [ 74%] 2025-12-04T12:25:43.3204920Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_int64 PASSED [0.0135s] [ 74%] 2025-12-04T12:25:43.3205040Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_3_cuda_uint8 PASSED [0.9471s] [ 74%] 2025-12-04T12:25:43.3205157Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int64 PASSED [0.0155s] [ 74%] 2025-12-04T12:25:43.3205275Z test_meta.py::TestMetaCUDA::test_meta_outplace_mvlgamma_mvlgamma_p_5_cuda_int8 PASSED [0.0137s] [ 74%] 2025-12-04T12:25:43.3205386Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_float64 PASSED [0.9263s] [ 74%] 2025-12-04T12:25:43.3205494Z test_meta.py::TestMetaCUDA::test_meta_outplace_nan_to_num_cuda_uint8 PASSED [0.0051s] [ 74%] 2025-12-04T12:25:43.3205598Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmean_cuda_float32 PASSED [0.9308s] [ 75%] 2025-12-04T12:25:43.3205717Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanmedian_cuda_float16 PASSED [0.0081s] [ 75%] 2025-12-04T12:25:43.3205829Z test_meta.py::TestMetaCUDA::test_meta_outplace_nanquantile_cuda_float32 PASSED [0.0322s] [ 75%] 2025-12-04T12:25:43.3207400Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_bool PASSED [0.0130s] [ 75%] 2025-12-04T12:25:43.3207506Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_float64 PASSED [0.0171s] [ 75%] 2025-12-04T12:25:43.3207607Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int16 PASSED [0.0127s] [ 75%] 2025-12-04T12:25:43.3207705Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int32 PASSED [0.0128s] [ 75%] 2025-12-04T12:25:43.3207829Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int64 PASSED [0.0124s] [ 75%] 2025-12-04T12:25:43.3207939Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_int8 PASSED [0.0126s] [ 75%] 2025-12-04T12:25:43.3208038Z test_meta.py::TestMetaCUDA::test_meta_outplace_nansum_cuda_uint8 PASSED [0.0128s] [ 75%] 2025-12-04T12:25:43.3208276Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_bool XFAIL [0.0030s] [ 75%] 2025-12-04T12:25:43.3208403Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_float32 XFAIL [0.9442s] [ 75%] 2025-12-04T12:25:43.3208508Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_int16 XFAIL [0.9335s] [ 75%] 2025-12-04T12:25:43.3208612Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_int32 XFAIL [0.9478s] [ 75%] 2025-12-04T12:25:43.3208716Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_copy_cuda_uint8 XFAIL [0.9268s] [ 75%] 2025-12-04T12:25:43.3208818Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_bool PASSED [1.8723s] [ 75%] 2025-12-04T12:25:43.3208921Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_complex64 PASSED [0.0046s] [ 75%] 2025-12-04T12:25:43.3209024Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_float16 PASSED [0.9379s] [ 75%] 2025-12-04T12:25:43.3209124Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_int8 PASSED [0.0047s] [ 75%] 2025-12-04T12:25:43.3209223Z test_meta.py::TestMetaCUDA::test_meta_outplace_narrow_cuda_uint8 PASSED [0.9364s] [ 75%] 2025-12-04T12:25:43.3209342Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_bfloat16 PASSED [0.0152s] [ 75%] 2025-12-04T12:25:43.3209460Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float16 PASSED [0.0132s] [ 75%] 2025-12-04T12:25:43.3209576Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_batch_norm_cuda_float64 PASSED [0.0128s] [ 75%] 2025-12-04T12:25:43.3209694Z test_meta.py::TestMetaCUDA::test_meta_outplace_native_layer_norm_cuda_bfloat16 PASSED [0.0236s] [ 75%] 2025-12-04T12:25:43.3209792Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_bool PASSED [0.0082s] [ 75%] 2025-12-04T12:25:43.3209889Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_float32 PASSED [0.0082s] [ 75%] 2025-12-04T12:25:43.3209985Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int16 PASSED [0.0082s] [ 75%] 2025-12-04T12:25:43.3210079Z test_meta.py::TestMetaCUDA::test_meta_outplace_ne_cuda_int64 PASSED [0.0084s] [ 75%] 2025-12-04T12:25:43.3210179Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_bfloat16 PASSED [0.0033s] [ 75%] 2025-12-04T12:25:43.3210279Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex128 PASSED [0.9381s] [ 75%] 2025-12-04T12:25:43.3210380Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_complex64 PASSED [0.0060s] [ 75%] 2025-12-04T12:25:43.3210474Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_int32 PASSED [0.9438s] [ 75%] 2025-12-04T12:25:43.3210570Z test_meta.py::TestMetaCUDA::test_meta_outplace_neg_cuda_uint8 PASSED [0.0042s] [ 75%] 2025-12-04T12:25:43.3210673Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_bool PASSED [0.0046s] [ 75%] 2025-12-04T12:25:43.3210780Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_float16 PASSED [0.9370s] [ 75%] 2025-12-04T12:25:43.3210900Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_cuda_uint8 PASSED [0.0060s] [ 75%] 2025-12-04T12:25:43.3211017Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float16 PASSED [0.0046s] [ 75%] 2025-12-04T12:25:43.3211132Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_empty_strided_cuda_float64 PASSED [0.0043s] [ 76%] 2025-12-04T12:25:43.3211234Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_bool PASSED [0.9418s] [ 76%] 2025-12-04T12:25:43.3211342Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_complex64 PASSED [0.0065s] [ 76%] 2025-12-04T12:25:43.3211456Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_int32 PASSED [0.0046s] [ 76%] 2025-12-04T12:25:43.3211571Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_full_cuda_uint8 PASSED [0.0044s] [ 76%] 2025-12-04T12:25:43.3211678Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex32 PASSED [0.9339s] [ 76%] 2025-12-04T12:25:43.3211786Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_complex64 PASSED [0.0063s] [ 76%] 2025-12-04T12:25:43.3211890Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_float16 PASSED [0.0046s] [ 76%] 2025-12-04T12:25:43.3212002Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_ones_cuda_uint8 PASSED [0.0044s] [ 76%] 2025-12-04T12:25:43.3212109Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_bfloat16 PASSED [0.9502s] [ 76%] 2025-12-04T12:25:43.3212209Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_bool PASSED [0.0063s] [ 76%] 2025-12-04T12:25:43.3212312Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_int16 PASSED [0.0047s] [ 76%] 2025-12-04T12:25:43.3212417Z test_meta.py::TestMetaCUDA::test_meta_outplace_new_zeros_cuda_uint8 PASSED [0.0045s] [ 76%] 2025-12-04T12:25:43.3212560Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_bfloat16 PASSED [0.9467s] [ 76%] 2025-12-04T12:25:43.3212699Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0061s] [ 76%] 2025-12-04T12:25:43.3212838Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0103s] [ 76%] 2025-12-04T12:25:43.3212979Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_bfloat16 PASSED [0.0092s] [ 76%] 2025-12-04T12:25:43.3213115Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [0.0091s] [ 76%] 2025-12-04T12:25:43.3213252Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_adaptive_max_pool3d_cuda_float16 PASSED [0.0128s] [ 76%] 2025-12-04T12:25:43.3213386Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_alpha_dropout_cuda_float64 PASSED [0.0125s] [ 76%] 2025-12-04T12:25:43.3213514Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_avg_pool2d_cuda_float64 PASSED [0.0105s] [ 76%] 2025-12-04T12:25:43.3213638Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float16 PASSED [0.0138s] [ 76%] 2025-12-04T12:25:43.3213766Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_cuda_float32 PASSED [0.9667s] [ 76%] 2025-12-04T12:25:43.3213912Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_batch_norm_without_cudnn_cuda_float32 PASSED [0.9384s] [ 76%] 2025-12-04T12:25:43.3214040Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_bilinear_cuda_bfloat16 PASSED [0.4513s] [ 76%] 2025-12-04T12:25:43.3214163Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_bilinear_cuda_float16 PASSED [0.4797s] [ 76%] 2025-12-04T12:25:43.3214307Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_bfloat16 PASSED [0.9545s] [ 76%] 2025-12-04T12:25:43.3214446Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float16 PASSED [0.0053s] [ 76%] 2025-12-04T12:25:43.3214588Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_cuda_float64 PASSED [0.9421s] [ 76%] 2025-12-04T12:25:43.3214768Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16 PASSED [0.0394s] [ 76%] 2025-12-04T12:25:43.3214888Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_celu_cuda_bfloat16 PASSED [0.0037s] [ 76%] 2025-12-04T12:25:43.3215021Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_float32 PASSED [0.9328s] [ 76%] 2025-12-04T12:25:43.3215238Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_channel_shuffle_cuda_uint8 PASSED [0.0050s] [ 76%] 2025-12-04T12:25:43.3215374Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_bfloat16 PASSED [0.1136s] [ 76%] 2025-12-04T12:25:43.3215507Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_complex64 PASSED [0.0247s] [ 76%] 2025-12-04T12:25:43.3215626Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv1d_cuda_float64 PASSED [0.0068s] [ 76%] 2025-12-04T12:25:43.3215950Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_bfloat16 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 26400, provided ptr: 0x71bf3a603a00 size: 5888 2025-12-04T12:25:43.3216147Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 26400, provided ptr: 0x71bf3a603a00 size: 5888 2025-12-04T12:25:43.3216348Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 168960, provided ptr: 0x71bf3a603600 size: 6656 2025-12-04T12:25:43.3216538Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 168960, provided ptr: 0x71bf3a603600 size: 6656 2025-12-04T12:25:43.3216584Z PASSED [0.0882s] [ 77%] 2025-12-04T12:25:43.3216711Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv3d_cuda_float64 PASSED [0.0161s] [ 77%] 2025-12-04T12:25:43.3216850Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose1d_cuda_float64 PASSED [0.9333s] [ 77%] 2025-12-04T12:25:43.3216987Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_bfloat16 PASSED [0.4611s] [ 77%] 2025-12-04T12:25:43.3217122Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float16 PASSED [1.4702s] [ 77%] 2025-12-04T12:25:43.3217255Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.0113s] [ 77%] 2025-12-04T12:25:43.3217389Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_bfloat16 PASSED [0.0374s] [ 77%] 2025-12-04T12:25:43.3217521Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_float16 PASSED [0.0093s] [ 77%] 2025-12-04T12:25:43.3217656Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.0091s] [ 77%] 2025-12-04T12:25:43.3217797Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_embedding_loss_cuda_int32 PASSED [0.0112s] [ 77%] 2025-12-04T12:25:43.3217940Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cosine_similarity_cuda_bfloat16 PASSED [0.0159s] [ 77%] 2025-12-04T12:25:43.3218132Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cross_entropy_cuda_bfloat16 PASSED [0.9401s] [ 77%] 2025-12-04T12:25:43.3218265Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_cross_entropy_cuda_float16 PASSED [0.0073s] [ 77%] 2025-12-04T12:25:43.3218391Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_bfloat16 PASSED [0.0087s] [ 77%] 2025-12-04T12:25:43.3218519Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout3d_cuda_float16 PASSED [0.0078s] [ 77%] 2025-12-04T12:25:43.3218640Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_dropout_cuda_float32 PASSED [0.0225s] [ 77%] 2025-12-04T12:25:43.3218755Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_elu_cuda_float16 PASSED [0.9523s] [ 77%] 2025-12-04T12:25:43.3218901Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_bfloat16 PASSED [0.0364s] [ 77%] 2025-12-04T12:25:43.3219031Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_bag_cuda_float32 PASSED [0.9584s] [ 77%] 2025-12-04T12:25:43.3219156Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_embedding_cuda_bfloat16 PASSED [0.0081s] [ 77%] 2025-12-04T12:25:43.3219312Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0069s] [ 77%] 2025-12-04T12:25:43.3219481Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_with_train_cuda_float64 PASSED [0.0065s] [ 77%] 2025-12-04T12:25:43.3219661Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_feature_alpha_dropout_without_train_cuda_complex128 PASSED [0.0070s] [ 77%] 2025-12-04T12:25:43.3219799Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_bfloat16 PASSED [0.2018s] [ 77%] 2025-12-04T12:25:43.3219937Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float16 PASSED [0.2020s] [ 77%] 2025-12-04T12:25:43.3220087Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [0.2062s] [ 77%] 2025-12-04T12:25:43.3220209Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_gelu_cuda_bfloat16 PASSED [0.0293s] [ 77%] 2025-12-04T12:25:43.3220338Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_bfloat16 PASSED [0.3649s] [ 77%] 2025-12-04T12:25:43.3220463Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float16 PASSED [0.3435s] [ 77%] 2025-12-04T12:25:43.3220591Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_grid_sample_cuda_float32 PASSED [1.2442s] [ 77%] 2025-12-04T12:25:43.3220715Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_group_norm_cuda_float16 PASSED [0.0570s] [ 77%] 2025-12-04T12:25:43.3220845Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_float16 PASSED [0.9386s] [ 77%] 2025-12-04T12:25:43.3220971Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardsigmoid_cuda_float64 PASSED [0.0062s] [ 77%] 2025-12-04T12:25:43.3221099Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardswish_cuda_bfloat16 PASSED [0.0136s] [ 77%] 2025-12-04T12:25:43.3221222Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_bfloat16 PASSED [0.9522s] [ 77%] 2025-12-04T12:25:43.3221346Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_float32 PASSED [0.0075s] [ 77%] 2025-12-04T12:25:43.3221468Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_float64 PASSED [0.9351s] [ 78%] 2025-12-04T12:25:43.3221587Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int16 PASSED [0.0074s] [ 78%] 2025-12-04T12:25:43.3221707Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hardtanh_cuda_int64 PASSED [0.9540s] [ 78%] 2025-12-04T12:25:43.3221849Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_bfloat16 PASSED [0.0147s] [ 78%] 2025-12-04T12:25:43.3221990Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_hinge_embedding_loss_cuda_float16 PASSED [0.0127s] [ 78%] 2025-12-04T12:25:43.3222115Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_huber_loss_cuda_float16 PASSED [0.0097s] [ 78%] 2025-12-04T12:25:43.3222245Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_bfloat16 PASSED [0.0153s] [ 78%] 2025-12-04T12:25:43.3222376Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_instance_norm_cuda_float32 PASSED [0.0185s] [ 78%] 2025-12-04T12:25:43.3222519Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_bfloat16 PASSED [0.4226s] [ 78%] 2025-12-04T12:25:43.3222677Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_bicubic_cuda_float16 PASSED [0.9464s] [ 78%] 2025-12-04T12:25:43.3222830Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_bfloat16 PASSED [0.0281s] [ 78%] 2025-12-04T12:25:43.3222976Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest-exact_cuda_uint8 PASSED [1.0213s] [ 78%] 2025-12-04T12:25:43.3223116Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest_cuda_bfloat16 PASSED [0.0106s] [ 78%] 2025-12-04T12:25:43.3223254Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_nearest_cuda_float16 PASSED [1.5023s] [ 78%] 2025-12-04T12:25:43.3223407Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_interpolate_trilinear_cuda_float64 PASSED [0.1304s] [ 78%] 2025-12-04T12:25:43.3223543Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_kl_div_cuda_bfloat16 PASSED [0.0142s] [ 78%] 2025-12-04T12:25:43.3223664Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_kl_div_cuda_float64 PASSED [0.0132s] [ 78%] 2025-12-04T12:25:43.3223788Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_bfloat16 PASSED [1.4922s] [ 78%] 2025-12-04T12:25:43.3223920Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_complex64 PASSED [0.0091s] [ 78%] 2025-12-04T12:25:43.3224040Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float16 PASSED [0.0056s] [ 78%] 2025-12-04T12:25:43.3224157Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_l1_loss_cuda_float32 PASSED [0.0052s] [ 78%] 2025-12-04T12:25:43.3224282Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_layer_norm_cuda_float32 PASSED [0.0073s] [ 78%] 2025-12-04T12:25:43.3224406Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_leaky_relu_cuda_float16 PASSED [0.0065s] [ 78%] 2025-12-04T12:25:43.3224529Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_complex64 PASSED [0.0173s] [ 78%] 2025-12-04T12:25:43.3224649Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_linear_cuda_float32 PASSED [0.0172s] [ 78%] 2025-12-04T12:25:43.3224792Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float16 PASSED [0.0042s] [ 78%] 2025-12-04T12:25:43.3224929Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_local_response_norm_cuda_float64 PASSED [0.0042s] [ 78%] 2025-12-04T12:25:43.3225068Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_float16 PASSED [0.0229s] [ 78%] 2025-12-04T12:25:43.3225203Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_margin_ranking_loss_cuda_int64 PASSED [1.5281s] [ 78%] 2025-12-04T12:25:43.3225328Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float32 PASSED [0.2599s] [ 78%] 2025-12-04T12:25:43.3225450Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool1d_cuda_float64 PASSED [0.2588s] [ 78%] 2025-12-04T12:25:43.3225577Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool2d_cuda_bfloat16 PASSED [0.4309s] [ 78%] 2025-12-04T12:25:43.3225704Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_bfloat16 PASSED [0.1856s] [ 78%] 2025-12-04T12:25:43.3225830Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_pool3d_cuda_float64 PASSED [0.1799s] [ 78%] 2025-12-04T12:25:43.3225957Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_bfloat16 PASSED [0.1123s] [ 78%] 2025-12-04T12:25:43.3226082Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_cuda_float64 PASSED [0.1057s] [ 78%] 2025-12-04T12:25:43.3226218Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool1d_grad_cuda_bfloat16 PASSED [0.0259s] [ 78%] 2025-12-04T12:25:43.3226346Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_bfloat16 PASSED [0.1917s] [ 79%] 2025-12-04T12:25:43.3226485Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_cuda_float64 PASSED [0.1927s] [ 79%] 2025-12-04T12:25:43.3226618Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.0537s] [ 79%] 2025-12-04T12:25:43.3226754Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool2d_grad_cuda_float64 PASSED [0.0537s] [ 79%] 2025-12-04T12:25:43.3226881Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_bfloat16 PASSED [0.0647s] [ 79%] 2025-12-04T12:25:43.3227007Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_cuda_float16 PASSED [0.0638s] [ 79%] 2025-12-04T12:25:43.3227152Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_grad_cuda_bfloat16 PASSED [0.0199s] [ 79%] 2025-12-04T12:25:43.3227299Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_max_unpool3d_grad_cuda_float16 PASSED [0.0203s] [ 79%] 2025-12-04T12:25:43.3227416Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float16 PASSED [0.0041s] [ 79%] 2025-12-04T12:25:43.3227533Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mish_cuda_float32 PASSED [1.5016s] [ 79%] 2025-12-04T12:25:43.3227668Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_bfloat16 PASSED [0.0081s] [ 79%] 2025-12-04T12:25:43.3227792Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_mse_loss_cuda_float16 PASSED [1.4960s] [ 79%] 2025-12-04T12:25:43.3227943Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_bfloat16 PASSED [0.4785s] [ 79%] 2025-12-04T12:25:43.3228139Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [0.4663s] [ 79%] 2025-12-04T12:25:43.3228289Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_head_attention_forward_cuda_float64 PASSED [0.4632s] [ 79%] 2025-12-04T12:25:43.3228426Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_float16 PASSED [0.0186s] [ 79%] 2025-12-04T12:25:43.3228562Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0120s] [ 79%] 2025-12-04T12:25:43.3228706Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_multilabel_margin_loss_cuda_bfloat16 PASSED [0.0158s] [ 79%] 2025-12-04T12:25:43.3228829Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float16 PASSED [1.5133s] [ 79%] 2025-12-04T12:25:43.3228950Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_nll_loss_cuda_float32 PASSED [0.0139s] [ 79%] 2025-12-04T12:25:43.3229085Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_complex128 PASSED [0.0171s] [ 79%] 2025-12-04T12:25:43.3229208Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float16 PASSED [0.0158s] [ 79%] 2025-12-04T12:25:43.3229331Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_normalize_cuda_float32 PASSED [0.0151s] [ 79%] 2025-12-04T12:25:43.3229463Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_complex128 PASSED [0.0046s] [ 79%] 2025-12-04T12:25:43.3229591Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_float64 PASSED [1.4984s] [ 79%] 2025-12-04T12:25:43.3229713Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_circular_cuda_int32 PASSED [0.0066s] [ 79%] 2025-12-04T12:25:43.3229843Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_complex64 PASSED [0.0162s] [ 79%] 2025-12-04T12:25:43.3229965Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_constant_cuda_int64 PASSED [0.0156s] [ 79%] 2025-12-04T12:25:43.3230095Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_complex128 PASSED [0.0138s] [ 79%] 2025-12-04T12:25:43.3230225Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_complex64 PASSED [0.0055s] [ 79%] 2025-12-04T12:25:43.3230368Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_reflect_cuda_float64 PASSED [1.5022s] [ 79%] 2025-12-04T12:25:43.3230503Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_complex128 PASSED [0.0157s] [ 79%] 2025-12-04T12:25:43.3230629Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_float16 PASSED [0.0058s] [ 79%] 2025-12-04T12:25:43.3230758Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_float32 PASSED [0.0057s] [ 79%] 2025-12-04T12:25:43.3230883Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_cuda_int32 PASSED [0.0055s] [ 79%] 2025-12-04T12:25:43.3231038Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pad_replicate_negative_cuda_float32 PASSED [1.4935s] [ 79%] 2025-12-04T12:25:43.3231189Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_bfloat16 PASSED [0.0094s] [ 80%] 2025-12-04T12:25:43.3231332Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_complex64 PASSED [1.4883s] [ 80%] 2025-12-04T12:25:43.3231479Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float16 PASSED [0.0094s] [ 80%] 2025-12-04T12:25:43.3231615Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float32 PASSED [1.5020s] [ 80%] 2025-12-04T12:25:43.3231750Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_float64 PASSED [0.0089s] [ 80%] 2025-12-04T12:25:43.3231884Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int16 PASSED [1.4932s] [ 80%] 2025-12-04T12:25:43.3232015Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int64 PASSED [0.0089s] [ 80%] 2025-12-04T12:25:43.3232149Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pairwise_distance_cuda_int8 PASSED [1.5015s] [ 80%] 2025-12-04T12:25:43.3232269Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pdist_cuda_float32 PASSED [0.0137s] [ 80%] 2025-12-04T12:25:43.3232405Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_complex128 PASSED [0.0040s] [ 80%] 2025-12-04T12:25:43.3232534Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0035s] [ 80%] 2025-12-04T12:25:43.3232663Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int64 PASSED [0.0035s] [ 80%] 2025-12-04T12:25:43.3232789Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_int8 PASSED [0.0035s] [ 80%] 2025-12-04T12:25:43.3232918Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_shuffle_cuda_uint8 PASSED [0.0033s] [ 80%] 2025-12-04T12:25:43.3233053Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_bfloat16 PASSED [0.0036s] [ 80%] 2025-12-04T12:25:43.3233179Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_bool PASSED [0.0036s] [ 80%] 2025-12-04T12:25:43.3233317Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0034s] [ 80%] 2025-12-04T12:25:43.3233451Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float16 PASSED [0.0037s] [ 80%] 2025-12-04T12:25:43.3233585Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0036s] [ 80%] 2025-12-04T12:25:43.3233711Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_pixel_unshuffle_cuda_int8 PASSED [0.0034s] [ 80%] 2025-12-04T12:25:43.3233847Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_bfloat16 PASSED [0.0623s] [ 80%] 2025-12-04T12:25:43.3233978Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float16 PASSED [0.0614s] [ 80%] 2025-12-04T12:25:43.3234110Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.0588s] [ 80%] 2025-12-04T12:25:43.3234250Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int64 PASSED [0.0599s] [ 80%] 2025-12-04T12:25:43.3234381Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_poisson_nll_loss_cuda_int8 PASSED [1.5459s] [ 80%] 2025-12-04T12:25:43.3234501Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_prelu_cuda_bfloat16 PASSED [0.0269s] [ 80%] 2025-12-04T12:25:43.3234622Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_prelu_cuda_float32 PASSED [1.5230s] [ 80%] 2025-12-04T12:25:43.3234737Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int16 PASSED [0.0061s] [ 80%] 2025-12-04T12:25:43.3234863Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_int64 PASSED [0.0043s] [ 80%] 2025-12-04T12:25:43.3234987Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu6_cuda_uint8 PASSED [0.0041s] [ 80%] 2025-12-04T12:25:43.3235103Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_bfloat16 PASSED [1.4950s] [ 80%] 2025-12-04T12:25:43.3235224Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_float64 PASSED [0.0059s] [ 80%] 2025-12-04T12:25:43.3235354Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int64 PASSED [0.0042s] [ 80%] 2025-12-04T12:25:43.3235474Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_relu_cuda_int8 PASSED [1.4871s] [ 80%] 2025-12-04T12:25:43.3235602Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_complex128 PASSED [0.0108s] [ 80%] 2025-12-04T12:25:43.3235727Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_float16 PASSED [1.4964s] [ 80%] 2025-12-04T12:25:43.3235849Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rms_norm_cuda_float64 PASSED [0.0084s] [ 81%] 2025-12-04T12:25:43.3235966Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_rrelu_cuda_float64 PASSED [1.5062s] [ 81%] 2025-12-04T12:25:43.3236119Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_bfloat16 PASSED [0.0583s] [ 81%] 2025-12-04T12:25:43.3236270Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float16 PASSED [0.0532s] [ 81%] 2025-12-04T12:25:43.3236418Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.0493s] [ 81%] 2025-12-04T12:25:43.3236536Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_selu_cuda_bfloat16 PASSED [1.4932s] [ 81%] 2025-12-04T12:25:43.3236651Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_selu_cuda_float32 PASSED [0.0062s] [ 81%] 2025-12-04T12:25:43.3236784Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_silu_complex_cuda_complex64 PASSED [0.0035s] [ 81%] 2025-12-04T12:25:43.3236913Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_smooth_l1_loss_cuda_float16 PASSED [0.0090s] [ 81%] 2025-12-04T12:25:43.3237048Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_soft_margin_loss_cuda_float32 PASSED [1.5114s] [ 81%] 2025-12-04T12:25:43.3237169Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_cuda_float32 PASSED [0.0078s] [ 81%] 2025-12-04T12:25:43.3237308Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_complex64 PASSED [0.0052s] [ 81%] 2025-12-04T12:25:43.3237446Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0048s] [ 81%] 2025-12-04T12:25:43.3237582Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softmin_with_dtype_cuda_float64 PASSED [1.5567s] [ 81%] 2025-12-04T12:25:43.3237709Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float16 PASSED [0.0197s] [ 81%] 2025-12-04T12:25:43.3237831Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softplus_cuda_float64 PASSED [1.5223s] [ 81%] 2025-12-04T12:25:43.3237957Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float16 PASSED [0.0186s] [ 81%] 2025-12-04T12:25:43.3238125Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softshrink_cuda_float32 PASSED [0.0079s] [ 81%] 2025-12-04T12:25:43.3238251Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_bfloat16 PASSED [0.0037s] [ 81%] 2025-12-04T12:25:43.3238377Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_complex128 PASSED [1.5113s] [ 81%] 2025-12-04T12:25:43.3238503Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_complex64 PASSED [0.0057s] [ 81%] 2025-12-04T12:25:43.3238639Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_float16 PASSED [0.0040s] [ 81%] 2025-12-04T12:25:43.3238776Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_float32 PASSED [1.5204s] [ 81%] 2025-12-04T12:25:43.3238894Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int16 PASSED [0.0058s] [ 81%] 2025-12-04T12:25:43.3239013Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int32 PASSED [0.0039s] [ 81%] 2025-12-04T12:25:43.3239141Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_int8 PASSED [1.4990s] [ 81%] 2025-12-04T12:25:43.3239259Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_softsign_cuda_uint8 PASSED [0.0058s] [ 81%] 2025-12-04T12:25:43.3239388Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_complex64 PASSED [1.5591s] [ 81%] 2025-12-04T12:25:43.3239514Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_float32 PASSED [0.0053s] [ 81%] 2025-12-04T12:25:43.3239636Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_int16 PASSED [1.5000s] [ 81%] 2025-12-04T12:25:43.3239757Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_int32 PASSED [0.0053s] [ 81%] 2025-12-04T12:25:43.3239888Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_tanhshrink_cuda_uint8 PASSED [1.5011s] [ 81%] 2025-12-04T12:25:43.3240013Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_float16 PASSED [0.0067s] [ 81%] 2025-12-04T12:25:43.3240137Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int16 PASSED [0.0046s] [ 81%] 2025-12-04T12:25:43.3240258Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_threshold_cuda_int64 PASSED [1.5043s] [ 81%] 2025-12-04T12:25:43.3240398Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_float16 PASSED [0.0141s] [ 81%] 2025-12-04T12:25:43.3240537Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_loss_cuda_float64 PASSED [1.5164s] [ 81%] 2025-12-04T12:25:43.3240699Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex128 PASSED [0.0152s] [ 82%] 2025-12-04T12:25:43.3240856Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 PASSED [0.0126s] [ 82%] 2025-12-04T12:25:43.3241016Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0118s] [ 82%] 2025-12-04T12:25:43.3241169Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_triplet_margin_with_distance_loss_cuda_int8 PASSED [0.0118s] [ 82%] 2025-12-04T12:25:43.3241292Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bfloat16 PASSED [0.0919s] [ 82%] 2025-12-04T12:25:43.3241407Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_unfold_cuda_bool PASSED [0.0970s] [ 82%] 2025-12-04T12:25:43.3241549Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_bilinear_cuda_bfloat16 PASSED [1.5291s] [ 82%] 2025-12-04T12:25:43.3241685Z test_meta.py::TestMetaCUDA::test_meta_outplace_nn_functional_upsample_nearest_cuda_float64 PASSED [0.0082s] [ 82%] 2025-12-04T12:25:43.3241794Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_bfloat16 PASSED [0.0118s] [ 82%] 2025-12-04T12:25:43.3241919Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_complex128 PASSED [0.0110s] [ 82%] 2025-12-04T12:25:43.3242024Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_float16 PASSED [0.0109s] [ 82%] 2025-12-04T12:25:43.3242129Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_float32 PASSED [0.0108s] [ 82%] 2025-12-04T12:25:43.3242229Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int32 PASSED [0.0108s] [ 82%] 2025-12-04T12:25:43.3242328Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_cuda_int8 PASSED [0.0107s] [ 82%] 2025-12-04T12:25:43.3242484Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_complex128 SKIPPED [0.0006s] (Only runs on cpu) [ 82%] 2025-12-04T12:25:43.3242632Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int16 SKIPPED [0.0006s] (Only runs on cpu) [ 82%] 2025-12-04T12:25:43.3242769Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int32 SKIPPED [0.0005s] (Only runs on cpu) [ 82%] 2025-12-04T12:25:43.3242907Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_int64 SKIPPED [0.0006s] (Only runs on cpu) [ 82%] 2025-12-04T12:25:43.3243053Z test_meta.py::TestMetaCUDA::test_meta_outplace_nonzero_static_cuda_uint8 SKIPPED [0.0007s] (Only runs on cpu) [ 82%] 2025-12-04T12:25:43.3243157Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_bfloat16 PASSED [0.0363s] [ 82%] 2025-12-04T12:25:43.3243255Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float16 PASSED [0.0357s] [ 82%] 2025-12-04T12:25:43.3243355Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float32 PASSED [0.0352s] [ 82%] 2025-12-04T12:25:43.3243454Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_cuda_float64 PASSED [1.5445s] [ 82%] 2025-12-04T12:25:43.3243565Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_fro_cuda_complex128 PASSED [0.0072s] [ 82%] 2025-12-04T12:25:43.3243670Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_fro_cuda_float64 PASSED [0.0050s] [ 82%] 2025-12-04T12:25:43.3243779Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_complex64 PASSED [0.0047s] [ 82%] 2025-12-04T12:25:43.3243884Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float16 PASSED [0.0048s] [ 82%] 2025-12-04T12:25:43.3243988Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_inf_cuda_float64 PASSED [0.0046s] [ 82%] 2025-12-04T12:25:43.3244097Z test_meta.py::TestMetaCUDA::test_meta_outplace_norm_nuc_cuda_complex128 PASSED [1.5259s] [ 82%] 2025-12-04T12:25:43.3244203Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_float16 PASSED [1.5167s] [ 82%] 2025-12-04T12:25:43.3244305Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_cuda_float64 PASSED [0.0118s] [ 82%] 2025-12-04T12:25:43.3244421Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_bfloat16 PASSED [1.5290s] [ 82%] 2025-12-04T12:25:43.3244534Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_in_place_cuda_float64 PASSED [0.0061s] [ 82%] 2025-12-04T12:25:43.3244656Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_bfloat16 PASSED [0.0063s] [ 82%] 2025-12-04T12:25:43.3244776Z test_meta.py::TestMetaCUDA::test_meta_outplace_normal_number_mean_cuda_float64 PASSED [0.0057s] [ 82%] 2025-12-04T12:25:43.3244879Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_bfloat16 PASSED [1.5393s] [ 82%] 2025-12-04T12:25:43.3244975Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_bool PASSED [0.0046s] [ 83%] 2025-12-04T12:25:43.3245074Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_float64 PASSED [1.5215s] [ 83%] 2025-12-04T12:25:43.3245173Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int16 PASSED [0.0045s] [ 83%] 2025-12-04T12:25:43.3245271Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int32 PASSED [1.5343s] [ 83%] 2025-12-04T12:25:43.3245366Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_int8 PASSED [0.0047s] [ 83%] 2025-12-04T12:25:43.3245475Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_cuda_uint8 PASSED [1.5187s] [ 83%] 2025-12-04T12:25:43.3245583Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_bfloat16 PASSED [0.0085s] [ 83%] 2025-12-04T12:25:43.3245692Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_complex64 PASSED [0.0064s] [ 83%] 2025-12-04T12:25:43.3245798Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_float64 PASSED [0.0060s] [ 83%] 2025-12-04T12:25:43.3245900Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int32 PASSED [1.4999s] [ 83%] 2025-12-04T12:25:43.3246002Z test_meta.py::TestMetaCUDA::test_meta_outplace_ones_like_cuda_int64 PASSED [0.0081s] [ 83%] 2025-12-04T12:25:43.3246108Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_bool PASSED [0.0034s] [ 83%] 2025-12-04T12:25:43.3246225Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_complex64 PASSED [1.5205s] [ 83%] 2025-12-04T12:25:43.3246326Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_float16 PASSED [0.0047s] [ 83%] 2025-12-04T12:25:43.3246427Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_float32 PASSED [1.5021s] [ 83%] 2025-12-04T12:25:43.3246536Z test_meta.py::TestMetaCUDA::test_meta_outplace_outer_cuda_int16 PASSED [0.0049s] [ 83%] 2025-12-04T12:25:43.3246651Z test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_complex128 PASSED [0.1375s] [ 83%] 2025-12-04T12:25:43.3246759Z test_meta.py::TestMetaCUDA::test_meta_outplace_pca_lowrank_cuda_float64 PASSED [0.0741s] [ 83%] 2025-12-04T12:25:43.3246872Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_bfloat16 PASSED [1.5122s] [ 83%] 2025-12-04T12:25:43.3246979Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_bool PASSED [0.0062s] [ 83%] 2025-12-04T12:25:43.3247096Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_complex128 PASSED [0.0045s] [ 83%] 2025-12-04T12:25:43.3247205Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_float16 PASSED [0.0041s] [ 83%] 2025-12-04T12:25:43.3247315Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_copy_cuda_int64 PASSED [0.0041s] [ 83%] 2025-12-04T12:25:43.3247422Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_complex32 PASSED [1.5230s] [ 83%] 2025-12-04T12:25:43.3247527Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float16 PASSED [0.0047s] [ 83%] 2025-12-04T12:25:43.3247630Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_float64 PASSED [1.5063s] [ 83%] 2025-12-04T12:25:43.3247728Z test_meta.py::TestMetaCUDA::test_meta_outplace_permute_cuda_int32 PASSED [0.0053s] [ 83%] 2025-12-04T12:25:43.3247837Z test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_complex64 PASSED [0.0118s] [ 83%] 2025-12-04T12:25:43.3247941Z test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_float32 PASSED [0.0096s] [ 83%] 2025-12-04T12:25:43.3248046Z test_meta.py::TestMetaCUDA::test_meta_outplace_pinverse_cuda_float64 PASSED [0.0093s] [ 83%] 2025-12-04T12:25:43.3248216Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_bfloat16 PASSED [0.0070s] [ 83%] 2025-12-04T12:25:43.3248343Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0067s] [ 83%] 2025-12-04T12:25:43.3248469Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_0_cuda_float64 PASSED [0.0067s] [ 83%] 2025-12-04T12:25:43.3248598Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_bfloat16 PASSED [1.5061s] [ 83%] 2025-12-04T12:25:43.3248723Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_float16 PASSED [0.0090s] [ 83%] 2025-12-04T12:25:43.3248849Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0071s] [ 83%] 2025-12-04T12:25:43.3248971Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_int16 PASSED [0.0093s] [ 84%] 2025-12-04T12:25:43.3249095Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_1_cuda_uint8 PASSED [0.0067s] [ 84%] 2025-12-04T12:25:43.3249237Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_bfloat16 PASSED [0.0067s] [ 84%] 2025-12-04T12:25:43.3249363Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float16 PASSED [0.0067s] [ 84%] 2025-12-04T12:25:43.3249487Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_2_cuda_float64 PASSED [1.5376s] [ 84%] 2025-12-04T12:25:43.3249611Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_float16 PASSED [0.0085s] [ 84%] 2025-12-04T12:25:43.3249733Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_3_cuda_int8 PASSED [0.0069s] [ 84%] 2025-12-04T12:25:43.3249867Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_bool PASSED [0.0067s] [ 84%] 2025-12-04T12:25:43.3250010Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0067s] [ 84%] 2025-12-04T12:25:43.3250133Z test_meta.py::TestMetaCUDA::test_meta_outplace_polygamma_polygamma_n_4_cuda_float64 PASSED [0.0066s] [ 84%] 2025-12-04T12:25:43.3250242Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_bfloat16 PASSED [0.0023s] [ 84%] 2025-12-04T12:25:43.3250360Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_float64 PASSED [1.5118s] [ 84%] 2025-12-04T12:25:43.3250462Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int16 PASSED [0.0039s] [ 84%] 2025-12-04T12:25:43.3250562Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int32 PASSED [1.4907s] [ 84%] 2025-12-04T12:25:43.3250661Z test_meta.py::TestMetaCUDA::test_meta_outplace_positive_cuda_int8 PASSED [0.0038s] [ 84%] 2025-12-04T12:25:43.3250761Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_bfloat16 PASSED [0.0095s] [ 84%] 2025-12-04T12:25:43.3250863Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_complex128 PASSED [0.0085s] [ 84%] 2025-12-04T12:25:43.3250961Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_complex32 PASSED [0.0084s] [ 84%] 2025-12-04T12:25:43.3251057Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_int64 PASSED [0.0082s] [ 84%] 2025-12-04T12:25:43.3251153Z test_meta.py::TestMetaCUDA::test_meta_outplace_pow_cuda_uint8 PASSED [0.0081s] [ 84%] 2025-12-04T12:25:43.3251251Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_float64 PASSED [0.0133s] [ 84%] 2025-12-04T12:25:43.3251346Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int16 PASSED [0.0148s] [ 84%] 2025-12-04T12:25:43.3251442Z test_meta.py::TestMetaCUDA::test_meta_outplace_prod_cuda_int32 PASSED [0.2606s] [ 84%] 2025-12-04T12:25:43.3251542Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_complex128 PASSED [0.0123s] [ 84%] 2025-12-04T12:25:43.3251642Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_float64 PASSED [0.0118s] [ 84%] 2025-12-04T12:25:43.3251736Z test_meta.py::TestMetaCUDA::test_meta_outplace_put_cuda_int32 PASSED [0.0116s] [ 84%] 2025-12-04T12:25:43.3251842Z test_meta.py::TestMetaCUDA::test_meta_outplace_quantile_cuda_float32 PASSED [0.1082s] [ 84%] 2025-12-04T12:25:43.3251946Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_bfloat16 PASSED [1.4975s] [ 84%] 2025-12-04T12:25:43.3252052Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_float16 PASSED [0.0047s] [ 84%] 2025-12-04T12:25:43.3252153Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_float64 PASSED [1.5237s] [ 84%] 2025-12-04T12:25:43.3252253Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_int64 PASSED [0.0047s] [ 84%] 2025-12-04T12:25:43.3252350Z test_meta.py::TestMetaCUDA::test_meta_outplace_rad2deg_cuda_int8 PASSED [1.5252s] [ 84%] 2025-12-04T12:25:43.3252458Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_bfloat16 PASSED [0.0125s] [ 84%] 2025-12-04T12:25:43.3252568Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_complex64 PASSED [1.5169s] [ 84%] 2025-12-04T12:25:43.3252673Z test_meta.py::TestMetaCUDA::test_meta_outplace_rand_like_cuda_float32 PASSED [0.0124s] [ 84%] 2025-12-04T12:25:43.3252790Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_bfloat16 PASSED [1.5120s] [ 84%] 2025-12-04T12:25:43.3252889Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_cuda_int8 PASSED [0.0104s] [ 84%] 2025-12-04T12:25:43.3253002Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_bfloat16 PASSED [0.0154s] [ 85%] 2025-12-04T12:25:43.3253113Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float16 PASSED [0.0149s] [ 85%] 2025-12-04T12:25:43.3253223Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_float64 PASSED [1.5234s] [ 85%] 2025-12-04T12:25:43.3253342Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_int64 PASSED [0.0161s] [ 85%] 2025-12-04T12:25:43.3253460Z test_meta.py::TestMetaCUDA::test_meta_outplace_randint_like_cuda_uint8 PASSED [0.0145s] [ 85%] 2025-12-04T12:25:43.3253563Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_cuda_complex32 PASSED [0.0044s] [ 85%] 2025-12-04T12:25:43.3253676Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_bfloat16 PASSED [0.0101s] [ 85%] 2025-12-04T12:25:43.3253786Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_complex128 PASSED [0.0101s] [ 85%] 2025-12-04T12:25:43.3253905Z test_meta.py::TestMetaCUDA::test_meta_outplace_randn_like_cuda_float32 PASSED [0.0101s] [ 85%] 2025-12-04T12:25:43.3254001Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_bool PASSED [0.0031s] [ 85%] 2025-12-04T12:25:43.3254104Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_complex64 PASSED [1.5008s] [ 85%] 2025-12-04T12:25:43.3254202Z test_meta.py::TestMetaCUDA::test_meta_outplace_ravel_cuda_float32 PASSED [0.0047s] [ 85%] 2025-12-04T12:25:43.3254298Z test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_bool PASSED [1.5154s] [ 85%] 2025-12-04T12:25:43.3254399Z test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_complex32 PASSED [0.0049s] [ 85%] 2025-12-04T12:25:43.3254496Z test_meta.py::TestMetaCUDA::test_meta_outplace_real_cuda_float64 PASSED [1.4986s] [ 85%] 2025-12-04T12:25:43.3254610Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_complex64 PASSED [0.0057s] [ 85%] 2025-12-04T12:25:43.3254718Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_float32 PASSED [0.0040s] [ 85%] 2025-12-04T12:25:43.3254823Z test_meta.py::TestMetaCUDA::test_meta_outplace_reciprocal_cuda_uint8 PASSED [1.4686s] [ 85%] 2025-12-04T12:25:43.3254930Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_float64 PASSED [0.0112s] [ 85%] 2025-12-04T12:25:43.3255035Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_int16 PASSED [0.0091s] [ 85%] 2025-12-04T12:25:43.3255137Z test_meta.py::TestMetaCUDA::test_meta_outplace_remainder_cuda_uint8 PASSED [0.0089s] [ 85%] 2025-12-04T12:25:43.3255240Z test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_bfloat16 PASSED [0.0101s] [ 85%] 2025-12-04T12:25:43.3255346Z test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_complex128 PASSED [0.0085s] [ 85%] 2025-12-04T12:25:43.3255447Z test_meta.py::TestMetaCUDA::test_meta_outplace_renorm_cuda_float32 PASSED [1.5657s] [ 85%] 2025-12-04T12:25:43.3255548Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_bfloat16 PASSED [0.0150s] [ 85%] 2025-12-04T12:25:43.3255649Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_cuda_bool PASSED [0.0135s] [ 85%] 2025-12-04T12:25:43.3255768Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_bfloat16 PASSED [1.6298s] [ 85%] 2025-12-04T12:25:43.3255888Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float32 PASSED [0.0059s] [ 85%] 2025-12-04T12:25:43.3256006Z test_meta.py::TestMetaCUDA::test_meta_outplace_repeat_interleave_cuda_float64 PASSED [0.0043s] [ 85%] 2025-12-04T12:25:43.3256112Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_bool PASSED [1.6070s] [ 85%] 2025-12-04T12:25:43.3256223Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex128 PASSED [0.0050s] [ 85%] 2025-12-04T12:25:43.3256346Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_complex32 PASSED [0.0038s] [ 85%] 2025-12-04T12:25:43.3256454Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_float64 PASSED [1.5861s] [ 85%] 2025-12-04T12:25:43.3256560Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int16 PASSED [0.0051s] [ 85%] 2025-12-04T12:25:43.3256663Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_int64 PASSED [0.0037s] [ 85%] 2025-12-04T12:25:43.3256766Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_as_cuda_uint8 PASSED [1.4998s] [ 85%] 2025-12-04T12:25:43.3256864Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_bool PASSED [0.0058s] [ 86%] 2025-12-04T12:25:43.3256982Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex128 PASSED [0.0042s] [ 86%] 2025-12-04T12:25:43.3257099Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_complex32 PASSED [1.5001s] [ 86%] 2025-12-04T12:25:43.3257202Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float16 PASSED [0.0060s] [ 86%] 2025-12-04T12:25:43.3257305Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_float32 PASSED [0.0042s] [ 86%] 2025-12-04T12:25:43.3257418Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int16 PASSED [1.5021s] [ 86%] 2025-12-04T12:25:43.3257516Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int32 PASSED [0.0058s] [ 86%] 2025-12-04T12:25:43.3257614Z test_meta.py::TestMetaCUDA::test_meta_outplace_reshape_cuda_int8 PASSED [0.0042s] [ 86%] 2025-12-04T12:25:43.3257713Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_bool PASSED [1.5047s] [ 86%] 2025-12-04T12:25:43.3257816Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float16 PASSED [0.0052s] [ 86%] 2025-12-04T12:25:43.3257919Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_float64 PASSED [0.0037s] [ 86%] 2025-12-04T12:25:43.3258017Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int32 PASSED [1.5065s] [ 86%] 2025-12-04T12:25:43.3258142Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_int8 PASSED [0.0049s] [ 86%] 2025-12-04T12:25:43.3258239Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize__cuda_uint8 PASSED [0.0034s] [ 86%] 2025-12-04T12:25:43.3258350Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_bfloat16 PASSED [1.5123s] [ 86%] 2025-12-04T12:25:43.3258461Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_complex128 PASSED [0.0052s] [ 86%] 2025-12-04T12:25:43.3258569Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float16 PASSED [0.0037s] [ 86%] 2025-12-04T12:25:43.3258675Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_float32 PASSED [1.5103s] [ 86%] 2025-12-04T12:25:43.3258779Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int64 PASSED [0.0054s] [ 86%] 2025-12-04T12:25:43.3258880Z test_meta.py::TestMetaCUDA::test_meta_outplace_resize_as__cuda_int8 PASSED [0.0039s] [ 86%] 2025-12-04T12:25:43.3258991Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_bfloat16 PASSED [1.4987s] [ 86%] 2025-12-04T12:25:43.3259098Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_bool PASSED [0.0041s] [ 86%] 2025-12-04T12:25:43.3259213Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_complex64 PASSED [1.5094s] [ 86%] 2025-12-04T12:25:43.3259318Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int32 PASSED [0.0043s] [ 86%] 2025-12-04T12:25:43.3259424Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_conj_cuda_int64 PASSED [1.5015s] [ 86%] 2025-12-04T12:25:43.3259533Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_bfloat16 PASSED [0.0044s] [ 86%] 2025-12-04T12:25:43.3259639Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_bool PASSED [1.5205s] [ 86%] 2025-12-04T12:25:43.3259745Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int16 PASSED [0.0043s] [ 86%] 2025-12-04T12:25:43.3259851Z test_meta.py::TestMetaCUDA::test_meta_outplace_resolve_neg_cuda_int64 PASSED [1.5023s] [ 86%] 2025-12-04T12:25:43.3259973Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_complex128 PASSED [0.0104s] [ 86%] 2025-12-04T12:25:43.3260072Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float16 PASSED [1.5122s] [ 86%] 2025-12-04T12:25:43.3260168Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float32 PASSED [0.0103s] [ 86%] 2025-12-04T12:25:43.3260267Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_float64 PASSED [1.5189s] [ 86%] 2025-12-04T12:25:43.3260364Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int16 PASSED [0.0106s] [ 86%] 2025-12-04T12:25:43.3260473Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int32 PASSED [1.5071s] [ 86%] 2025-12-04T12:25:43.3260584Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_int64 PASSED [0.0110s] [ 86%] 2025-12-04T12:25:43.3260681Z test_meta.py::TestMetaCUDA::test_meta_outplace_roll_cuda_uint8 PASSED [1.5101s] [ 87%] 2025-12-04T12:25:43.3260776Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_bool PASSED [0.0143s] [ 87%] 2025-12-04T12:25:43.3260881Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_complex128 PASSED [0.0124s] [ 87%] 2025-12-04T12:25:43.3260995Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_float32 PASSED [1.5294s] [ 87%] 2025-12-04T12:25:43.3261093Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_float64 PASSED [0.0139s] [ 87%] 2025-12-04T12:25:43.3261190Z test_meta.py::TestMetaCUDA::test_meta_outplace_rot90_cuda_int32 PASSED [0.0120s] [ 87%] 2025-12-04T12:25:43.3261287Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int16 PASSED [1.5268s] [ 87%] 2025-12-04T12:25:43.3261385Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int32 PASSED [0.0046s] [ 87%] 2025-12-04T12:25:43.3261481Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_cuda_int8 PASSED [1.6114s] [ 87%] 2025-12-04T12:25:43.3261598Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_bfloat16 PASSED [0.0064s] [ 87%] 2025-12-04T12:25:43.3261714Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float16 PASSED [0.0044s] [ 87%] 2025-12-04T12:25:43.3261831Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_0_cuda_float32 PASSED [0.0039s] [ 87%] 2025-12-04T12:25:43.3261945Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_3_cuda_float16 PASSED [1.5851s] [ 87%] 2025-12-04T12:25:43.3262066Z test_meta.py::TestMetaCUDA::test_meta_outplace_round_decimals_neg_3_cuda_float16 PASSED [0.0061s] [ 87%] 2025-12-04T12:25:43.3262163Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_bool PASSED [0.0041s] [ 87%] 2025-12-04T12:25:43.3262267Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex128 PASSED [1.5174s] [ 87%] 2025-12-04T12:25:43.3262372Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_complex32 PASSED [0.2410s] [ 87%] 2025-12-04T12:25:43.3262471Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_float16 PASSED [0.0041s] [ 87%] 2025-12-04T12:25:43.3262570Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsqrt_cuda_float64 PASSED [1.5055s] [ 87%] 2025-12-04T12:25:43.3262668Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_float16 PASSED [0.0094s] [ 87%] 2025-12-04T12:25:43.3262765Z test_meta.py::TestMetaCUDA::test_meta_outplace_rsub_cuda_int16 PASSED [1.4956s] [ 87%] 2025-12-04T12:25:43.3262881Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_complex32 PASSED [0.0038s] [ 87%] 2025-12-04T12:25:43.3262995Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_complex64 PASSED [1.5101s] [ 87%] 2025-12-04T12:25:43.3263108Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_float16 PASSED [0.0038s] [ 87%] 2025-12-04T12:25:43.3263219Z test_meta.py::TestMetaCUDA::test_meta_outplace_scalar_tensor_cuda_int32 PASSED [1.5062s] [ 87%] 2025-12-04T12:25:43.3263328Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_bfloat16 PASSED [0.0103s] [ 87%] 2025-12-04T12:25:43.3263434Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_bool PASSED [0.0081s] [ 87%] 2025-12-04T12:25:43.3263556Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_float32 PASSED [0.0078s] [ 87%] 2025-12-04T12:25:43.3263664Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int16 PASSED [0.0077s] [ 87%] 2025-12-04T12:25:43.3263769Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_int64 PASSED [1.4996s] [ 87%] 2025-12-04T12:25:43.3263875Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_add_cuda_uint8 PASSED [0.0105s] [ 87%] 2025-12-04T12:25:43.3263972Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_bool PASSED [0.0145s] [ 87%] 2025-12-04T12:25:43.3264092Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_float64 PASSED [0.0192s] [ 87%] 2025-12-04T12:25:43.3264202Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_cuda_int64 PASSED [1.5275s] [ 87%] 2025-12-04T12:25:43.3264324Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amax_cuda_bfloat16 PASSED [0.0195s] [ 87%] 2025-12-04T12:25:43.3264440Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int32 PASSED [1.5267s] [ 87%] 2025-12-04T12:25:43.3264567Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_int64 PASSED [0.0191s] [ 87%] 2025-12-04T12:25:43.3264681Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_amin_cuda_uint8 PASSED [1.5555s] [ 88%] 2025-12-04T12:25:43.3264801Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_float16 PASSED [0.0214s] [ 88%] 2025-12-04T12:25:43.3264919Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_float32 PASSED [1.5436s] [ 88%] 2025-12-04T12:25:43.3265035Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int32 PASSED [0.0209s] [ 88%] 2025-12-04T12:25:43.3265151Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_mean_cuda_int64 PASSED [1.5342s] [ 88%] 2025-12-04T12:25:43.3265272Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_bfloat16 PASSED [0.0183s] [ 88%] 2025-12-04T12:25:43.3265391Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_prod_cuda_uint8 PASSED [1.6027s] [ 88%] 2025-12-04T12:25:43.3265509Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_bool PASSED [0.0193s] [ 88%] 2025-12-04T12:25:43.3265631Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_float64 PASSED [1.5594s] [ 88%] 2025-12-04T12:25:43.3265745Z test_meta.py::TestMetaCUDA::test_meta_outplace_scatter_reduce_sum_cuda_uint8 PASSED [0.0194s] [ 88%] 2025-12-04T12:25:43.3265856Z test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_float64 PASSED [0.1814s] [ 88%] 2025-12-04T12:25:43.3265965Z test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_int16 PASSED [0.1815s] [ 88%] 2025-12-04T12:25:43.3266075Z test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_int8 PASSED [0.1819s] [ 88%] 2025-12-04T12:25:43.3266183Z test_meta.py::TestMetaCUDA::test_meta_outplace_searchsorted_cuda_uint8 PASSED [0.1810s] [ 88%] 2025-12-04T12:25:43.3266283Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_bool PASSED [1.5631s] [ 88%] 2025-12-04T12:25:43.3266384Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_float64 PASSED [0.0054s] [ 88%] 2025-12-04T12:25:43.3266484Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int32 PASSED [0.0040s] [ 88%] 2025-12-04T12:25:43.3266582Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_cuda_int8 PASSED [1.5465s] [ 88%] 2025-12-04T12:25:43.3266694Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_float32 PASSED [0.0058s] [ 88%] 2025-12-04T12:25:43.3266807Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_float64 PASSED [0.0041s] [ 88%] 2025-12-04T12:25:43.3266917Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int32 PASSED [1.5658s] [ 88%] 2025-12-04T12:25:43.3267026Z test_meta.py::TestMetaCUDA::test_meta_outplace_select_scatter_cuda_int8 PASSED [0.0074s] [ 88%] 2025-12-04T12:25:43.3267134Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_bool PASSED [0.0034s] [ 88%] 2025-12-04T12:25:43.3267234Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_complex32 PASSED [1.5393s] [ 88%] 2025-12-04T12:25:43.3267331Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_float16 PASSED [0.0045s] [ 88%] 2025-12-04T12:25:43.3267428Z test_meta.py::TestMetaCUDA::test_meta_outplace_sgn_cuda_float32 PASSED [1.5195s] [ 88%] 2025-12-04T12:25:43.3267525Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_bool PASSED [0.0052s] [ 88%] 2025-12-04T12:25:43.3267627Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_complex64 PASSED [0.0037s] [ 88%] 2025-12-04T12:25:43.3267738Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_float32 PASSED [1.5389s] [ 88%] 2025-12-04T12:25:43.3267845Z test_meta.py::TestMetaCUDA::test_meta_outplace_short_cuda_int64 PASSED [0.0057s] [ 88%] 2025-12-04T12:25:43.3267956Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex128 PASSED [0.0059s] [ 88%] 2025-12-04T12:25:43.3268062Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex32 PASSED [1.4989s] [ 88%] 2025-12-04T12:25:43.3268226Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_complex64 PASSED [0.0069s] [ 88%] 2025-12-04T12:25:43.3268329Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_float32 PASSED [0.0038s] [ 88%] 2025-12-04T12:25:43.3268431Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_float64 PASSED [1.5048s] [ 88%] 2025-12-04T12:25:43.3268531Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_int16 PASSED [0.0054s] [ 88%] 2025-12-04T12:25:43.3268631Z test_meta.py::TestMetaCUDA::test_meta_outplace_sigmoid_cuda_uint8 PASSED [0.0038s] [ 89%] 2025-12-04T12:25:43.3268732Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_bfloat16 PASSED [1.5258s] [ 89%] 2025-12-04T12:25:43.3268830Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_float16 PASSED [0.0044s] [ 89%] 2025-12-04T12:25:43.3268929Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_float64 PASSED [1.5012s] [ 89%] 2025-12-04T12:25:43.3269027Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_int16 PASSED [0.0046s] [ 89%] 2025-12-04T12:25:43.3269125Z test_meta.py::TestMetaCUDA::test_meta_outplace_sign_cuda_uint8 PASSED [1.4939s] [ 89%] 2025-12-04T12:25:43.3269250Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_bartlett_cuda_float32 PASSED [0.0091s] [ 89%] 2025-12-04T12:25:43.3269375Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_blackman_cuda_float32 PASSED [1.5378s] [ 89%] 2025-12-04T12:25:43.3269498Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_cosine_cuda_float32 PASSED [0.0074s] [ 89%] 2025-12-04T12:25:43.3269623Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_gaussian_cuda_float64 PASSED [1.5417s] [ 89%] 2025-12-04T12:25:43.3269745Z test_meta.py::TestMetaCUDA::test_meta_outplace_signal_windows_nuttall_cuda_float32 PASSED [0.0182s] [ 89%] 2025-12-04T12:25:43.3269847Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_bool PASSED [1.5003s] [ 89%] 2025-12-04T12:25:43.3269949Z test_meta.py::TestMetaCUDA::test_meta_outplace_signbit_cuda_float32 PASSED [0.0046s] [ 89%] 2025-12-04T12:25:43.3270052Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_complex128 PASSED [1.7072s] [ 89%] 2025-12-04T12:25:43.3270150Z test_meta.py::TestMetaCUDA::test_meta_outplace_sin_cuda_float16 PASSED [0.0056s] [ 89%] 2025-12-04T12:25:43.3270248Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_bool PASSED [0.0079s] [ 89%] 2025-12-04T12:25:43.3270347Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_complex64 PASSED [0.2741s] [ 89%] 2025-12-04T12:25:43.3270446Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_float32 PASSED [0.0054s] [ 89%] 2025-12-04T12:25:43.3270542Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int32 PASSED [0.0051s] [ 89%] 2025-12-04T12:25:43.3270637Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int64 PASSED [0.0050s] [ 89%] 2025-12-04T12:25:43.3270747Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinc_cuda_int8 PASSED [0.0049s] [ 89%] 2025-12-04T12:25:43.3270842Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_bool PASSED [1.4964s] [ 89%] 2025-12-04T12:25:43.3270944Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_complex128 PASSED [0.0060s] [ 89%] 2025-12-04T12:25:43.3271045Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float32 PASSED [1.4986s] [ 89%] 2025-12-04T12:25:43.3271141Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_float64 PASSED [0.0045s] [ 89%] 2025-12-04T12:25:43.3271249Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int32 PASSED [1.5201s] [ 89%] 2025-12-04T12:25:43.3271358Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int64 PASSED [0.0047s] [ 89%] 2025-12-04T12:25:43.3271451Z test_meta.py::TestMetaCUDA::test_meta_outplace_sinh_cuda_int8 PASSED [1.5259s] [ 89%] 2025-12-04T12:25:43.3271549Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_bool PASSED [0.0054s] [ 89%] 2025-12-04T12:25:43.3271654Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_cuda_complex64 PASSED [1.5140s] [ 89%] 2025-12-04T12:25:43.3271775Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_int64 PASSED [0.0173s] [ 89%] 2025-12-04T12:25:43.3271884Z test_meta.py::TestMetaCUDA::test_meta_outplace_slice_scatter_cuda_uint8 PASSED [0.0149s] [ 89%] 2025-12-04T12:25:43.3272005Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_bfloat16 PASSED [0.0056s] [ 89%] 2025-12-04T12:25:43.3272125Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_complex64 PASSED [0.0054s] [ 89%] 2025-12-04T12:25:43.3272243Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_float64 PASSED [0.0052s] [ 89%] 2025-12-04T12:25:43.3272357Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_int32 PASSED [0.0054s] [ 89%] 2025-12-04T12:25:43.3272471Z test_meta.py::TestMetaCUDA::test_meta_outplace_softmax_with_dtype_cuda_int8 PASSED [0.0054s] [ 90%] 2025-12-04T12:25:43.3272571Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float16 PASSED [0.0138s] [ 90%] 2025-12-04T12:25:43.3272671Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_float32 PASSED [1.5131s] [ 90%] 2025-12-04T12:25:43.3272766Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int16 PASSED [0.0163s] [ 90%] 2025-12-04T12:25:43.3272864Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int32 PASSED [0.0140s] [ 90%] 2025-12-04T12:25:43.3272958Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int64 PASSED [1.5273s] [ 90%] 2025-12-04T12:25:43.3273056Z test_meta.py::TestMetaCUDA::test_meta_outplace_sort_cuda_int8 PASSED [0.0164s] [ 90%] 2025-12-04T12:25:43.3273188Z test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_mm_reduce_cuda_float16 SKIPPED [0.0002s] (Skipped!) [ 90%] 2025-12-04T12:25:43.3273332Z test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_complex128 SKIPPED [0.0001s] (Skipped!) [ 90%] 2025-12-04T12:25:43.3273473Z test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0001s] (Skipped!) [ 90%] 2025-12-04T12:25:43.3273615Z test_meta.py::TestMetaCUDA::test_meta_outplace_sparse_sampled_addmm_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 90%] 2025-12-04T12:25:43.3273732Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_float32 PASSED [1.5132s] [ 90%] 2025-12-04T12:25:43.3273846Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_float64 PASSED [0.2194s] [ 90%] 2025-12-04T12:25:43.3273959Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int16 PASSED [0.0054s] [ 90%] 2025-12-04T12:25:43.3274070Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_airy_ai_cuda_int32 PASSED [1.5160s] [ 90%] 2025-12-04T12:25:43.3274186Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_float32 PASSED [0.0071s] [ 90%] 2025-12-04T12:25:43.3274298Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_int16 PASSED [0.0052s] [ 90%] 2025-12-04T12:25:43.3274433Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j0_cuda_int32 PASSED [1.5302s] [ 90%] 2025-12-04T12:25:43.3274546Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_bool PASSED [0.0073s] [ 90%] 2025-12-04T12:25:43.3274662Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_float32 PASSED [0.0048s] [ 90%] 2025-12-04T12:25:43.3274773Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int16 PASSED [1.5007s] [ 90%] 2025-12-04T12:25:43.3274887Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_j1_cuda_int64 PASSED [1.5882s] [ 90%] 2025-12-04T12:25:43.3275008Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_bool PASSED [1.6137s] [ 90%] 2025-12-04T12:25:43.3275134Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y0_cuda_float32 PASSED [1.5940s] [ 90%] 2025-12-04T12:25:43.3275248Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_float32 PASSED [1.5795s] [ 90%] 2025-12-04T12:25:43.3275363Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_bessel_y1_cuda_uint8 PASSED [1.5741s] [ 90%] 2025-12-04T12:25:43.3275510Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_float64 PASSED [0.3649s] [ 90%] 2025-12-04T12:25:43.3275643Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_t_cuda_uint8 PASSED [0.0106s] [ 90%] 2025-12-04T12:25:43.3275778Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_float64 PASSED [0.3689s] [ 90%] 2025-12-04T12:25:43.3275913Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_v_cuda_uint8 PASSED [0.0106s] [ 90%] 2025-12-04T12:25:43.3276044Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_bool PASSED [0.0098s] [ 90%] 2025-12-04T12:25:43.3276176Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_chebyshev_polynomial_w_cuda_int16 PASSED [0.0078s] [ 90%] 2025-12-04T12:25:43.3276290Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int32 PASSED [1.5799s] [ 90%] 2025-12-04T12:25:43.3276401Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_int8 PASSED [0.0069s] [ 90%] 2025-12-04T12:25:43.3276510Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_entr_cuda_uint8 PASSED [0.0049s] [ 90%] 2025-12-04T12:25:43.3276618Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_bool PASSED [0.0056s] [ 90%] 2025-12-04T12:25:43.3276728Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_float64 PASSED [1.5652s] [ 90%] 2025-12-04T12:25:43.3276837Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int16 PASSED [0.0056s] [ 91%] 2025-12-04T12:25:43.3276947Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_int32 PASSED [0.0040s] [ 91%] 2025-12-04T12:25:43.3277055Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_erfcx_cuda_uint8 PASSED [1.6026s] [ 91%] 2025-12-04T12:25:43.3277188Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_int32 PASSED [0.0131s] [ 91%] 2025-12-04T12:25:43.3277317Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_int8 PASSED [0.0082s] [ 91%] 2025-12-04T12:25:43.3277445Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_h_cuda_uint8 PASSED [0.0080s] [ 91%] 2025-12-04T12:25:43.3277576Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_bool PASSED [0.0098s] [ 91%] 2025-12-04T12:25:43.3277709Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_float32 PASSED [0.0091s] [ 91%] 2025-12-04T12:25:43.3277839Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_int64 PASSED [0.0079s] [ 91%] 2025-12-04T12:25:43.3277972Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_hermite_polynomial_he_cuda_uint8 PASSED [0.0079s] [ 91%] 2025-12-04T12:25:43.3278155Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_float16 PASSED [1.6116s] [ 91%] 2025-12-04T12:25:43.3278263Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_int16 PASSED [0.0071s] [ 91%] 2025-12-04T12:25:43.3278372Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i0e_cuda_uint8 PASSED [0.0037s] [ 91%] 2025-12-04T12:25:43.3278480Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_bfloat16 PASSED [1.5718s] [ 91%] 2025-12-04T12:25:43.3278591Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1_cuda_float64 PASSED [0.0064s] [ 91%] 2025-12-04T12:25:43.3278695Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_i1e_cuda_int16 PASSED [0.0048s] [ 91%] 2025-12-04T12:25:43.3278838Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_laguerre_polynomial_l_cuda_int8 PASSED [0.0102s] [ 91%] 2025-12-04T12:25:43.3278985Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_float64 PASSED [0.0092s] [ 91%] 2025-12-04T12:25:43.3279116Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_legendre_polynomial_p_cuda_int16 PASSED [0.0097s] [ 91%] 2025-12-04T12:25:43.3279244Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_bool PASSED [0.0075s] [ 91%] 2025-12-04T12:25:43.3279360Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_log_ndtr_cuda_float32 PASSED [1.5681s] [ 91%] 2025-12-04T12:25:43.3279485Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_bool PASSED [0.0073s] [ 91%] 2025-12-04T12:25:43.3279613Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i0_cuda_float64 PASSED [0.1532s] [ 91%] 2025-12-04T12:25:43.3279737Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_bool PASSED [0.0054s] [ 91%] 2025-12-04T12:25:43.3279866Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_float32 PASSED [1.5671s] [ 91%] 2025-12-04T12:25:43.3279990Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int16 PASSED [0.0055s] [ 91%] 2025-12-04T12:25:43.3280115Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_i1_cuda_int8 PASSED [0.0040s] [ 91%] 2025-12-04T12:25:43.3280244Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int16 PASSED [0.0052s] [ 91%] 2025-12-04T12:25:43.3280373Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k0_cuda_int32 PASSED [1.5140s] [ 91%] 2025-12-04T12:25:43.3280494Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_bool PASSED [0.0105s] [ 91%] 2025-12-04T12:25:43.3280621Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int16 PASSED [0.0040s] [ 91%] 2025-12-04T12:25:43.3280745Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int32 PASSED [1.5281s] [ 91%] 2025-12-04T12:25:43.3280870Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_int64 PASSED [0.0059s] [ 91%] 2025-12-04T12:25:43.3280996Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_modified_bessel_k1_cuda_uint8 PASSED [0.0041s] [ 91%] 2025-12-04T12:25:43.3281109Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_bfloat16 PASSED [0.0113s] [ 91%] 2025-12-04T12:25:43.3281217Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_bool PASSED [0.0049s] [ 91%] 2025-12-04T12:25:43.3281327Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtr_cuda_float32 PASSED [0.0046s] [ 92%] 2025-12-04T12:25:43.3281438Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_float64 PASSED [1.6970s] [ 92%] 2025-12-04T12:25:43.3281546Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int64 PASSED [0.0075s] [ 92%] 2025-12-04T12:25:43.3281657Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_ndtri_cuda_int8 PASSED [0.0041s] [ 92%] 2025-12-04T12:25:43.3281806Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_bfloat16 PASSED [0.0072s] [ 92%] 2025-12-04T12:25:43.3281968Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float32 PASSED [0.0069s] [ 92%] 2025-12-04T12:25:43.3282116Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_float64 PASSED [0.0068s] [ 92%] 2025-12-04T12:25:43.3282261Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_polygamma_special_polygamma_n_0_cuda_int32 PASSED [0.0068s] [ 92%] 2025-12-04T12:25:43.3282399Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int16 PASSED [1.5188s] [ 92%] 2025-12-04T12:25:43.3282543Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_scaled_modified_bessel_k0_cuda_int8 PASSED [0.0064s] [ 92%] 2025-12-04T12:25:43.3282698Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_t_cuda_float64 PASSED [0.0106s] [ 92%] 2025-12-04T12:25:43.3282843Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0095s] [ 92%] 2025-12-04T12:25:43.3282986Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_u_cuda_int32 PASSED [0.0098s] [ 92%] 2025-12-04T12:25:43.3283142Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int32 PASSED [0.0099s] [ 92%] 2025-12-04T12:25:43.3283286Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_int8 PASSED [0.0080s] [ 92%] 2025-12-04T12:25:43.3283426Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_v_cuda_uint8 PASSED [0.0079s] [ 92%] 2025-12-04T12:25:43.3283572Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_float64 PASSED [0.0091s] [ 92%] 2025-12-04T12:25:43.3283713Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_shifted_chebyshev_polynomial_w_cuda_uint8 PASSED [0.0097s] [ 92%] 2025-12-04T12:25:43.3283841Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int16 PASSED [1.5103s] [ 92%] 2025-12-04T12:25:43.3283968Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int64 PASSED [0.0058s] [ 92%] 2025-12-04T12:25:43.3284096Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_spherical_bessel_j0_cuda_int8 PASSED [0.0040s] [ 92%] 2025-12-04T12:25:43.3284209Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_xlog1py_cuda_int64 PASSED [0.0137s] [ 92%] 2025-12-04T12:25:43.3284319Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int16 PASSED [0.0112s] [ 92%] 2025-12-04T12:25:43.3284427Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_int32 PASSED [0.0085s] [ 92%] 2025-12-04T12:25:43.3284536Z test_meta.py::TestMetaCUDA::test_meta_outplace_special_zeta_cuda_uint8 PASSED [0.0083s] [ 92%] 2025-12-04T12:25:43.3284641Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_complex128 PASSED [1.5392s] [ 92%] 2025-12-04T12:25:43.3284740Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int16 PASSED [0.0045s] [ 92%] 2025-12-04T12:25:43.3284837Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_cuda_int8 PASSED [1.5684s] [ 92%] 2025-12-04T12:25:43.3284949Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_int8 PASSED [0.0050s] [ 92%] 2025-12-04T12:25:43.3285060Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_list_args_cuda_uint8 PASSED [1.5251s] [ 92%] 2025-12-04T12:25:43.3285178Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_bool PASSED [0.0057s] [ 92%] 2025-12-04T12:25:43.3285303Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex128 PASSED [0.0039s] [ 92%] 2025-12-04T12:25:43.3285430Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex32 PASSED [1.5652s] [ 92%] 2025-12-04T12:25:43.3285554Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_copy_cuda_complex64 PASSED [0.0054s] [ 92%] 2025-12-04T12:25:43.3285666Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_bool PASSED [0.0035s] [ 92%] 2025-12-04T12:25:43.3285800Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_complex128 PASSED [1.5502s] [ 92%] 2025-12-04T12:25:43.3285914Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float16 PASSED [0.0052s] [ 93%] 2025-12-04T12:25:43.3286030Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_float32 PASSED [0.0037s] [ 93%] 2025-12-04T12:25:43.3286142Z test_meta.py::TestMetaCUDA::test_meta_outplace_split_with_sizes_cuda_int64 PASSED [1.5463s] [ 93%] 2025-12-04T12:25:43.3286241Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_bool PASSED [0.0055s] [ 93%] 2025-12-04T12:25:43.3286356Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_complex64 PASSED [1.5718s] [ 93%] 2025-12-04T12:25:43.3286464Z test_meta.py::TestMetaCUDA::test_meta_outplace_sqrt_cuda_int8 PASSED [0.0047s] [ 93%] 2025-12-04T12:25:43.3286567Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_bfloat16 PASSED [0.0049s] [ 93%] 2025-12-04T12:25:43.3286669Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_float16 PASSED [0.0041s] [ 93%] 2025-12-04T12:25:43.3286781Z test_meta.py::TestMetaCUDA::test_meta_outplace_square_cuda_int64 PASSED [1.5986s] [ 93%] 2025-12-04T12:25:43.3286896Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_complex64 PASSED [0.0082s] [ 93%] 2025-12-04T12:25:43.3287006Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_float32 PASSED [1.5372s] [ 93%] 2025-12-04T12:25:43.3287112Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_copy_cuda_int8 PASSED [0.0078s] [ 93%] 2025-12-04T12:25:43.3287216Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_cuda_bfloat16 PASSED [1.5613s] [ 93%] 2025-12-04T12:25:43.3287331Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_int16 PASSED [0.0058s] [ 93%] 2025-12-04T12:25:43.3287443Z test_meta.py::TestMetaCUDA::test_meta_outplace_squeeze_multiple_cuda_uint8 PASSED [0.0040s] [ 93%] 2025-12-04T12:25:43.3287551Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_complex128 PASSED [0.0087s] [ 93%] 2025-12-04T12:25:43.3287649Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_float16 PASSED [0.0134s] [ 93%] 2025-12-04T12:25:43.3287747Z test_meta.py::TestMetaCUDA::test_meta_outplace_stack_cuda_int8 PASSED [0.0092s] [ 93%] 2025-12-04T12:25:43.3287846Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_cuda_float16 PASSED [0.0108s] [ 93%] 2025-12-04T12:25:43.3287963Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_bfloat16 PASSED [0.9998s] [ 93%] 2025-12-04T12:25:43.3288107Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_float16 PASSED [0.0050s] [ 93%] 2025-12-04T12:25:43.3288223Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_mean_unbiased_cuda_float64 PASSED [0.9214s] [ 93%] 2025-12-04T12:25:43.3288335Z test_meta.py::TestMetaCUDA::test_meta_outplace_std_unbiased_cuda_float64 PASSED [0.0047s] [ 93%] 2025-12-04T12:25:43.3288440Z test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_complex128 PASSED [0.6610s] [ 93%] 2025-12-04T12:25:43.3288541Z test_meta.py::TestMetaCUDA::test_meta_outplace_stft_cuda_complex64 PASSED [0.6869s] [ 93%] 2025-12-04T12:25:43.3288640Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_bfloat16 PASSED [0.0106s] [ 93%] 2025-12-04T12:25:43.3288743Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex128 PASSED [0.0098s] [ 93%] 2025-12-04T12:25:43.3288842Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_complex64 PASSED [0.0097s] [ 93%] 2025-12-04T12:25:43.3288940Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_float16 PASSED [0.0097s] [ 93%] 2025-12-04T12:25:43.3289036Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_int16 PASSED [0.9298s] [ 93%] 2025-12-04T12:25:43.3289133Z test_meta.py::TestMetaCUDA::test_meta_outplace_sub_cuda_uint8 PASSED [0.0116s] [ 93%] 2025-12-04T12:25:43.3289233Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_complex64 PASSED [0.0081s] [ 93%] 2025-12-04T12:25:43.3289346Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_float16 PASSED [0.0086s] [ 93%] 2025-12-04T12:25:43.3289439Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int16 PASSED [0.9265s] [ 93%] 2025-12-04T12:25:43.3289534Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_cuda_int64 PASSED [0.0098s] [ 93%] 2025-12-04T12:25:43.3289644Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_complex64 PASSED [0.9241s] [ 93%] 2025-12-04T12:25:43.3289753Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float16 PASSED [0.0103s] [ 93%] 2025-12-04T12:25:43.3289860Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float32 PASSED [0.0070s] [ 94%] 2025-12-04T12:25:43.3289981Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_float64 PASSED [0.0068s] [ 94%] 2025-12-04T12:25:43.3290102Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int32 PASSED [0.0070s] [ 94%] 2025-12-04T12:25:43.3290208Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int64 PASSED [0.0068s] [ 94%] 2025-12-04T12:25:43.3290313Z test_meta.py::TestMetaCUDA::test_meta_outplace_sum_to_size_cuda_int8 PASSED [0.0068s] [ 94%] 2025-12-04T12:25:43.3290448Z test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_complex128 PASSED [0.0960s] [ 94%] 2025-12-04T12:25:43.3290556Z test_meta.py::TestMetaCUDA::test_meta_outplace_svd_lowrank_cuda_float32 PASSED [0.0995s] [ 94%] 2025-12-04T12:25:43.3290663Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_complex128 PASSED [0.9366s] [ 94%] 2025-12-04T12:25:43.3290763Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float16 PASSED [0.0053s] [ 94%] 2025-12-04T12:25:43.3290866Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_float64 PASSED [0.0038s] [ 94%] 2025-12-04T12:25:43.3290966Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_int16 PASSED [0.9302s] [ 94%] 2025-12-04T12:25:43.3291064Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_copy_cuda_int8 PASSED [0.0053s] [ 94%] 2025-12-04T12:25:43.3291162Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_bfloat16 PASSED [0.0032s] [ 94%] 2025-12-04T12:25:43.3291256Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_bool PASSED [0.9194s] [ 94%] 2025-12-04T12:25:43.3291355Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_complex128 PASSED [0.0046s] [ 94%] 2025-12-04T12:25:43.3291449Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_float32 PASSED [0.9174s] [ 94%] 2025-12-04T12:25:43.3291543Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_int32 PASSED [0.0046s] [ 94%] 2025-12-04T12:25:43.3291634Z test_meta.py::TestMetaCUDA::test_meta_outplace_t_cuda_int8 PASSED [0.9189s] [ 94%] 2025-12-04T12:25:43.3291753Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_complex128 PASSED [0.0056s] [ 94%] 2025-12-04T12:25:43.3291868Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_float16 PASSED [0.9358s] [ 94%] 2025-12-04T12:25:43.3291980Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int16 PASSED [0.0055s] [ 94%] 2025-12-04T12:25:43.3292090Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int32 PASSED [0.9287s] [ 94%] 2025-12-04T12:25:43.3292199Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_along_dim_cuda_int64 PASSED [0.0051s] [ 94%] 2025-12-04T12:25:43.3292299Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_bfloat16 PASSED [0.0081s] [ 94%] 2025-12-04T12:25:43.3292395Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_bool PASSED [0.9383s] [ 94%] 2025-12-04T12:25:43.3292496Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_complex64 PASSED [0.0095s] [ 94%] 2025-12-04T12:25:43.3292592Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_int8 PASSED [0.0077s] [ 94%] 2025-12-04T12:25:43.3292689Z test_meta.py::TestMetaCUDA::test_meta_outplace_take_cuda_uint8 PASSED [0.0075s] [ 94%] 2025-12-04T12:25:43.3292787Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_bfloat16 PASSED [0.9256s] [ 94%] 2025-12-04T12:25:43.3292892Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_bool PASSED [0.0043s] [ 94%] 2025-12-04T12:25:43.3292994Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_complex128 PASSED [0.9264s] [ 94%] 2025-12-04T12:25:43.3293092Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_complex32 PASSED [0.0043s] [ 94%] 2025-12-04T12:25:43.3293190Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_float16 PASSED [0.9193s] [ 94%] 2025-12-04T12:25:43.3293286Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_float32 PASSED [0.0044s] [ 94%] 2025-12-04T12:25:43.3293381Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int16 PASSED [0.9232s] [ 94%] 2025-12-04T12:25:43.3293486Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_int64 PASSED [0.0044s] [ 94%] 2025-12-04T12:25:43.3293592Z test_meta.py::TestMetaCUDA::test_meta_outplace_tan_cuda_uint8 PASSED [0.9201s] [ 95%] 2025-12-04T12:25:43.3293694Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_complex128 PASSED [0.0045s] [ 95%] 2025-12-04T12:25:43.3293793Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_float64 PASSED [0.9066s] [ 95%] 2025-12-04T12:25:43.3293901Z test_meta.py::TestMetaCUDA::test_meta_outplace_tanh_cuda_uint8 PASSED [0.0042s] [ 95%] 2025-12-04T12:25:43.3294009Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_bool PASSED [0.0055s] [ 95%] 2025-12-04T12:25:43.3294117Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int16 PASSED [0.0052s] [ 95%] 2025-12-04T12:25:43.3294224Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_int32 PASSED [0.0050s] [ 95%] 2025-12-04T12:25:43.3294334Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensor_split_cuda_uint8 PASSED [0.9359s] [ 95%] 2025-12-04T12:25:43.3294444Z test_meta.py::TestMetaCUDA::test_meta_outplace_tensordot_cuda_complex128 PASSED [0.0077s] [ 95%] 2025-12-04T12:25:43.3294544Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_bfloat16 PASSED [0.0170s] [ 95%] 2025-12-04T12:25:43.3294643Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_float64 PASSED [0.0167s] [ 95%] 2025-12-04T12:25:43.3294742Z test_meta.py::TestMetaCUDA::test_meta_outplace_tile_cuda_int16 PASSED [0.0166s] [ 95%] 2025-12-04T12:25:43.3294854Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float16 SKIPPED [0.0001s] (Skipped!) [ 95%] 2025-12-04T12:25:43.3294969Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 95%] 2025-12-04T12:25:43.3295080Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 95%] 2025-12-04T12:25:43.3295187Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_int16 SKIPPED [0.0001s] (Skipped!) [ 95%] 2025-12-04T12:25:43.3295295Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_cuda_uint8 SKIPPED [0.0001s] (Skipped!) [ 95%] 2025-12-04T12:25:43.3295397Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_bool PASSED [0.9758s] [ 95%] 2025-12-04T12:25:43.3295505Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_complex128 PASSED [0.0104s] [ 95%] 2025-12-04T12:25:43.3295611Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_float32 PASSED [0.9280s] [ 95%] 2025-12-04T12:25:43.3295717Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_float64 PASSED [0.0053s] [ 95%] 2025-12-04T12:25:43.3295820Z test_meta.py::TestMetaCUDA::test_meta_outplace_to_sparse_cuda_int8 PASSED [0.9294s] [ 95%] 2025-12-04T12:25:43.3295915Z test_meta.py::TestMetaCUDA::test_meta_outplace_topk_cuda_int16 PASSED [0.0082s] [ 95%] 2025-12-04T12:25:43.3296080Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0007s] (Requires CUDA SM >= 8.9) [ 95%] 2025-12-04T12:25:43.3296327Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_bfloat16 SKIPPED [0.0007s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 95%] 2025-12-04T12:25:43.3296567Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0005s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 95%] 2025-12-04T12:25:43.3296722Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_bfloat16 PASSED [0.0086s] [ 95%] 2025-12-04T12:25:43.3296860Z test_meta.py::TestMetaCUDA::test_meta_outplace_torch_ops_aten__safe_softmax_default_cuda_float64 PASSED [0.0078s] [ 95%] 2025-12-04T12:25:43.3296964Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_bfloat16 PASSED [0.0024s] [ 95%] 2025-12-04T12:25:43.3297069Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_complex128 PASSED [0.9095s] [ 95%] 2025-12-04T12:25:43.3297180Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float16 PASSED [0.0039s] [ 95%] 2025-12-04T12:25:43.3297292Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_float64 PASSED [0.9187s] [ 95%] 2025-12-04T12:25:43.3297390Z test_meta.py::TestMetaCUDA::test_meta_outplace_trace_cuda_int32 PASSED [0.0040s] [ 95%] 2025-12-04T12:25:43.3297501Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_bool PASSED [0.0066s] [ 95%] 2025-12-04T12:25:43.3297627Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_complex32 PASSED [0.0059s] [ 95%] 2025-12-04T12:25:43.3297743Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_complex64 PASSED [0.0058s] [ 95%] 2025-12-04T12:25:43.3297853Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int16 PASSED [0.0057s] [ 96%] 2025-12-04T12:25:43.3297961Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int64 PASSED [0.0057s] [ 96%] 2025-12-04T12:25:43.3298103Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_int8 PASSED [0.0057s] [ 96%] 2025-12-04T12:25:43.3298212Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_copy_cuda_uint8 PASSED [0.0057s] [ 96%] 2025-12-04T12:25:43.3298322Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_bfloat16 PASSED [0.0041s] [ 96%] 2025-12-04T12:25:43.3298425Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_bool PASSED [0.9343s] [ 96%] 2025-12-04T12:25:43.3298538Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex128 PASSED [0.0058s] [ 96%] 2025-12-04T12:25:43.3298646Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_complex64 PASSED [0.0044s] [ 96%] 2025-12-04T12:25:43.3298754Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_float32 PASSED [0.0042s] [ 96%] 2025-12-04T12:25:43.3298859Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int32 PASSED [0.9257s] [ 96%] 2025-12-04T12:25:43.3298964Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int64 PASSED [0.0061s] [ 96%] 2025-12-04T12:25:43.3299065Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_int8 PASSED [0.0045s] [ 96%] 2025-12-04T12:25:43.3299166Z test_meta.py::TestMetaCUDA::test_meta_outplace_transpose_cuda_uint8 PASSED [0.0044s] [ 96%] 2025-12-04T12:25:43.3299276Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_bfloat16 PASSED [0.0083s] [ 96%] 2025-12-04T12:25:43.3299382Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_float16 PASSED [0.0078s] [ 96%] 2025-12-04T12:25:43.3299487Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapezoid_cuda_int32 PASSED [0.0076s] [ 96%] 2025-12-04T12:25:43.3299593Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_complex128 PASSED [0.0075s] [ 96%] 2025-12-04T12:25:43.3299693Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float16 PASSED [0.0077s] [ 96%] 2025-12-04T12:25:43.3299791Z test_meta.py::TestMetaCUDA::test_meta_outplace_trapz_cuda_float64 PASSED [0.0074s] [ 96%] 2025-12-04T12:25:43.3299908Z test_meta.py::TestMetaCUDA::test_meta_outplace_triangular_solve_cuda_float32 PASSED [0.0130s] [ 96%] 2025-12-04T12:25:43.3300003Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_bool PASSED [0.0103s] [ 96%] 2025-12-04T12:25:43.3300104Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_complex32 PASSED [0.0099s] [ 96%] 2025-12-04T12:25:43.3300217Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_float64 PASSED [0.0098s] [ 96%] 2025-12-04T12:25:43.3300315Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int32 PASSED [0.0097s] [ 96%] 2025-12-04T12:25:43.3300409Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_int8 PASSED [0.0097s] [ 96%] 2025-12-04T12:25:43.3300508Z test_meta.py::TestMetaCUDA::test_meta_outplace_tril_cuda_uint8 PASSED [0.9198s] [ 96%] 2025-12-04T12:25:43.3300604Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int32 PASSED [0.0121s] [ 96%] 2025-12-04T12:25:43.3300699Z test_meta.py::TestMetaCUDA::test_meta_outplace_triu_cuda_int64 PASSED [0.0101s] [ 96%] 2025-12-04T12:25:43.3300816Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_bool PASSED [0.0093s] [ 96%] 2025-12-04T12:25:43.3300943Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_complex128 PASSED [0.0090s] [ 96%] 2025-12-04T12:25:43.3301054Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_complex32 PASSED [0.0091s] [ 96%] 2025-12-04T12:25:43.3301165Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float16 PASSED [0.0089s] [ 96%] 2025-12-04T12:25:43.3301288Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_float64 PASSED [0.0088s] [ 96%] 2025-12-04T12:25:43.3301396Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_int16 PASSED [0.0090s] [ 96%] 2025-12-04T12:25:43.3301503Z test_meta.py::TestMetaCUDA::test_meta_outplace_true_divide_cuda_uint8 PASSED [0.0089s] [ 96%] 2025-12-04T12:25:43.3301606Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_bfloat16 PASSED [0.9107s] [ 96%] 2025-12-04T12:25:43.3301706Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_float32 PASSED [0.0041s] [ 96%] 2025-12-04T12:25:43.3301803Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int32 PASSED [0.9031s] [ 97%] 2025-12-04T12:25:43.3301901Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_int64 PASSED [0.0044s] [ 97%] 2025-12-04T12:25:43.3301999Z test_meta.py::TestMetaCUDA::test_meta_outplace_trunc_cuda_uint8 PASSED [0.8999s] [ 97%] 2025-12-04T12:25:43.3302114Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_bfloat16 PASSED [0.0059s] [ 97%] 2025-12-04T12:25:43.3302221Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int32 PASSED [0.0073s] [ 97%] 2025-12-04T12:25:43.3302328Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_int64 PASSED [0.9173s] [ 97%] 2025-12-04T12:25:43.3302433Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_copy_cuda_uint8 PASSED [0.0063s] [ 97%] 2025-12-04T12:25:43.3302534Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_bool PASSED [0.0042s] [ 97%] 2025-12-04T12:25:43.3302634Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float16 PASSED [0.9019s] [ 97%] 2025-12-04T12:25:43.3302734Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float32 PASSED [0.0056s] [ 97%] 2025-12-04T12:25:43.3302834Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_float64 PASSED [0.0042s] [ 97%] 2025-12-04T12:25:43.3302933Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int32 PASSED [0.9087s] [ 97%] 2025-12-04T12:25:43.3303031Z test_meta.py::TestMetaCUDA::test_meta_outplace_unbind_cuda_int64 PASSED [0.0056s] [ 97%] 2025-12-04T12:25:43.3303139Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_bfloat16 PASSED [0.0051s] [ 97%] 2025-12-04T12:25:43.3303238Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_bool PASSED [0.0047s] [ 97%] 2025-12-04T12:25:43.3303348Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_complex128 PASSED [0.0047s] [ 97%] 2025-12-04T12:25:43.3303452Z test_meta.py::TestMetaCUDA::test_meta_outplace_unflatten_cuda_int64 PASSED [0.0045s] [ 97%] 2025-12-04T12:25:43.3303558Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_bool PASSED [0.0122s] [ 97%] 2025-12-04T12:25:43.3303671Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_complex128 PASSED [0.0120s] [ 97%] 2025-12-04T12:25:43.3303792Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_float32 PASSED [0.9174s] [ 97%] 2025-12-04T12:25:43.3303898Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_copy_cuda_int32 PASSED [0.0139s] [ 97%] 2025-12-04T12:25:43.3304002Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_complex32 PASSED [0.9122s] [ 97%] 2025-12-04T12:25:43.3304100Z test_meta.py::TestMetaCUDA::test_meta_outplace_unfold_cuda_int16 PASSED [0.0089s] [ 97%] 2025-12-04T12:25:43.3304203Z test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_bfloat16 PASSED [0.9067s] [ 97%] 2025-12-04T12:25:43.3304318Z test_meta.py::TestMetaCUDA::test_meta_outplace_uniform_cuda_float64 PASSED [0.0056s] [ 97%] 2025-12-04T12:25:43.3304454Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_bfloat16 PASSED [0.0972s] [ 97%] 2025-12-04T12:25:43.3304570Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_bool PASSED [0.0932s] [ 97%] 2025-12-04T12:25:43.3304689Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float16 PASSED [0.0949s] [ 97%] 2025-12-04T12:25:43.3304818Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_float64 PASSED [0.0947s] [ 97%] 2025-12-04T12:25:43.3304934Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_consecutive_cuda_uint8 PASSED [0.0953s] [ 97%] 2025-12-04T12:25:43.3305039Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_bfloat16 PASSED [0.2061s] [ 97%] 2025-12-04T12:25:43.3305137Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_bool PASSED [0.1911s] [ 97%] 2025-12-04T12:25:43.3305237Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_float64 PASSED [0.1996s] [ 97%] 2025-12-04T12:25:43.3305335Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_int64 PASSED [0.1980s] [ 97%] 2025-12-04T12:25:43.3305435Z test_meta.py::TestMetaCUDA::test_meta_outplace_unique_cuda_uint8 PASSED [0.2010s] [ 97%] 2025-12-04T12:25:43.3305545Z test_meta.py::TestMetaCUDA::test_meta_outplace_unravel_index_cuda_int32 PASSED [0.0183s] [ 97%] 2025-12-04T12:25:43.3305661Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_complex64 PASSED [0.0031s] [ 98%] 2025-12-04T12:25:43.3305768Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_chunk_cuda_int16 PASSED [0.9155s] [ 98%] 2025-12-04T12:25:43.3305883Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex128 PASSED [0.0041s] [ 98%] 2025-12-04T12:25:43.3305995Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex32 PASSED [0.9066s] [ 98%] 2025-12-04T12:25:43.3306110Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_complex64 PASSED [0.0041s] [ 98%] 2025-12-04T12:25:43.3306220Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_float16 PASSED [0.9126s] [ 98%] 2025-12-04T12:25:43.3306329Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_float64 PASSED [0.0043s] [ 98%] 2025-12-04T12:25:43.3306438Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsafe_split_cuda_uint8 PASSED [0.9198s] [ 98%] 2025-12-04T12:25:43.3306548Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_bool PASSED [0.0081s] [ 98%] 2025-12-04T12:25:43.3306661Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_copy_cuda_float64 PASSED [0.0063s] [ 98%] 2025-12-04T12:25:43.3306770Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_bfloat16 PASSED [0.9631s] [ 98%] 2025-12-04T12:25:43.3306873Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_bool PASSED [0.0060s] [ 98%] 2025-12-04T12:25:43.3306982Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_complex128 PASSED [0.0047s] [ 98%] 2025-12-04T12:25:43.3307093Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_float32 PASSED [0.0044s] [ 98%] 2025-12-04T12:25:43.3307196Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int16 PASSED [0.9758s] [ 98%] 2025-12-04T12:25:43.3307301Z test_meta.py::TestMetaCUDA::test_meta_outplace_unsqueeze_cuda_int64 PASSED [0.0062s] [ 98%] 2025-12-04T12:25:43.3307411Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_complex128 PASSED [0.0103s] [ 98%] 2025-12-04T12:25:43.3307510Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float16 PASSED [0.0107s] [ 98%] 2025-12-04T12:25:43.3307607Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_cuda_float32 PASSED [0.0090s] [ 98%] 2025-12-04T12:25:43.3307714Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_bfloat16 PASSED [0.0086s] [ 98%] 2025-12-04T12:25:43.3307822Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_complex128 PASSED [1.0009s] [ 98%] 2025-12-04T12:25:43.3307938Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_float16 PASSED [0.0115s] [ 98%] 2025-12-04T12:25:43.3308052Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_cuda_float32 PASSED [0.0080s] [ 98%] 2025-12-04T12:25:43.3308205Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_complex128 PASSED [0.9851s] [ 98%] 2025-12-04T12:25:43.3308320Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float16 PASSED [0.0051s] [ 98%] 2025-12-04T12:25:43.3308450Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_mean_unbiased_cuda_float32 PASSED [0.9895s] [ 98%] 2025-12-04T12:25:43.3308561Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_bfloat16 PASSED [0.0046s] [ 98%] 2025-12-04T12:25:43.3308676Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_complex128 PASSED [0.9932s] [ 98%] 2025-12-04T12:25:43.3308787Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_complex64 PASSED [0.0048s] [ 98%] 2025-12-04T12:25:43.3308897Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float32 PASSED [0.9843s] [ 98%] 2025-12-04T12:25:43.3309006Z test_meta.py::TestMetaCUDA::test_meta_outplace_var_unbiased_cuda_float64 PASSED [0.0045s] [ 98%] 2025-12-04T12:25:43.3309107Z test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_bfloat16 PASSED [0.9919s] [ 98%] 2025-12-04T12:25:43.3309207Z test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_complex64 PASSED [0.0066s] [ 98%] 2025-12-04T12:25:43.3309306Z test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_float16 PASSED [0.0034s] [ 98%] 2025-12-04T12:25:43.3309404Z test_meta.py::TestMetaCUDA::test_meta_outplace_vdot_cuda_float32 PASSED [0.9364s] [ 98%] 2025-12-04T12:25:43.3309520Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_complex_cuda_float16 PASSED [0.0037s] [ 98%] 2025-12-04T12:25:43.3309632Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_complex_cuda_float64 PASSED [0.9095s] [ 99%] 2025-12-04T12:25:43.3309733Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_bool PASSED [0.0051s] [ 99%] 2025-12-04T12:25:43.3309840Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex32 PASSED [0.0039s] [ 99%] 2025-12-04T12:25:43.3309947Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_complex64 PASSED [0.9055s] [ 99%] 2025-12-04T12:25:43.3310052Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_float64 PASSED [0.0051s] [ 99%] 2025-12-04T12:25:43.3310152Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_cuda_int16 PASSED [0.0039s] [ 99%] 2025-12-04T12:25:43.3310265Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_as_real_cuda_complex64 PASSED [0.9032s] [ 99%] 2025-12-04T12:25:43.3310364Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_bool PASSED [0.0074s] [ 99%] 2025-12-04T12:25:43.3310467Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_copy_cuda_int64 PASSED [0.9058s] [ 99%] 2025-12-04T12:25:43.3310568Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_complex128 PASSED [0.0057s] [ 99%] 2025-12-04T12:25:43.3310666Z test_meta.py::TestMetaCUDA::test_meta_outplace_view_cuda_int64 PASSED [0.0042s] [ 99%] 2025-12-04T12:25:43.3310775Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex128 PASSED [0.9002s] [ 99%] 2025-12-04T12:25:43.3310880Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_complex32 PASSED [0.0041s] [ 99%] 2025-12-04T12:25:43.3310991Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int16 PASSED [0.9132s] [ 99%] 2025-12-04T12:25:43.3311090Z test_meta.py::TestMetaCUDA::test_meta_outplace_vsplit_cuda_int64 PASSED [0.0042s] [ 99%] 2025-12-04T12:25:43.3311195Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_complex128 PASSED [0.9132s] [ 99%] 2025-12-04T12:25:43.3311299Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_complex32 PASSED [0.0036s] [ 99%] 2025-12-04T12:25:43.3311398Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float16 PASSED [0.9017s] [ 99%] 2025-12-04T12:25:43.3311511Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_float64 PASSED [0.0043s] [ 99%] 2025-12-04T12:25:43.3311621Z test_meta.py::TestMetaCUDA::test_meta_outplace_vstack_cuda_int8 PASSED [1.6047s] [ 99%] 2025-12-04T12:25:43.3311721Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_bfloat16 PASSED [0.0156s] [ 99%] 2025-12-04T12:25:43.3311818Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_float32 PASSED [0.0133s] [ 99%] 2025-12-04T12:25:43.3311918Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int16 PASSED [0.0132s] [ 99%] 2025-12-04T12:25:43.3312026Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int32 PASSED [0.0145s] [ 99%] 2025-12-04T12:25:43.3312123Z test_meta.py::TestMetaCUDA::test_meta_outplace_xlogy_cuda_int64 PASSED [0.0135s] [ 99%] 2025-12-04T12:25:43.3312225Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_complex64 PASSED [0.0050s] [ 99%] 2025-12-04T12:25:43.3312325Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_float64 PASSED [1.5516s] [ 99%] 2025-12-04T12:25:43.3312423Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int16 PASSED [0.0058s] [ 99%] 2025-12-04T12:25:43.3312520Z test_meta.py::TestMetaCUDA::test_meta_outplace_zero__cuda_int8 PASSED [0.0037s] [ 99%] 2025-12-04T12:25:43.3312625Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_complex128 PASSED [1.5542s] [ 99%] 2025-12-04T12:25:43.3312728Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_complex32 PASSED [0.0044s] [ 99%] 2025-12-04T12:25:43.3312826Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_float32 PASSED [1.5786s] [ 99%] 2025-12-04T12:25:43.3312924Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_float64 PASSED [0.0045s] [ 99%] 2025-12-04T12:25:43.3313022Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_cuda_int8 PASSED [1.5509s] [ 99%] 2025-12-04T12:25:43.3313125Z test_meta.py::TestMetaCUDA::test_meta_outplace_zeros_like_cuda_uint8 PASSED [0.0080s] [ 99%] 2025-12-04T12:25:43.3313274Z test_meta.py::TestMetaCUDA::test_mixed_dtype_for_native_layer_norm_backward_float16_bias_dtype2_cuda PASSED [0.0030s] [ 99%] 2025-12-04T12:25:43.3313369Z test_meta.py::TestMetaCUDA::test_triangular_solve_out_cuda PASSED [0.0378s] [100%] 2025-12-04T12:25:43.3313373Z 2025-12-04T12:25:43.3313547Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_meta/test_meta-2408f44677edc4c0.xml - 2025-12-04T12:25:43.3313639Z = 2454 passed, 1137 skipped, 10079 deselected, 41 xfailed in 799.93s (0:13:19) = 2025-12-04T12:25:43.3313817Z The following tests failed and then succeeded when run in a new process['test/test_meta.py::TestMetaCUDA::test_meta_inplace_floor_cuda_float16'] 2025-12-04T12:25:43.3313820Z 2025-12-04T12:25:43.3313942Z FINISHED PRINTING LOG FILE of test_meta 3/3 (test/test-reports/test_meta_3.3_fff0f714bc69ad37_.log) 2025-12-04T12:25:43.3313945Z 2025-12-04T12:25:43.3314035Z Finished test_meta 3/3 ... [2025-12-04 12:25:42.823077][4975925.319251884], took 43.77min 2025-12-04T12:25:43.3314274Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:25:43.3314364Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:25:43.3314462Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T12:25:43.3314523Z Uploading artifacts took 0.00 seconds 2025-12-04T12:25:43.3314614Z Running test_ops_gradients 2/2 ... [2025-12-04 12:25:42.830102][4975925.326289108] 2025-12-04T12:25:43.3314665Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:25:43.3314966Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops_gradients.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:25:42.830347] 2025-12-04T12:33:50.0289419Z 2025-12-04T12:33:50.0291071Z test_ops_gradients 2/2 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_gradients_2.2_fd51807903a71a76_.log 2025-12-04T12:33:50.0663836Z Running 2715 items in this shard: test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyNMSCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpySortCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpySplitCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyTakeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_NumpyViewCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__chunk_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__unsafe_masked_index_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmm_decomposed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_allclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_arange_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_atleast_1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_byte_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cdouble_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clamp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_combinations_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cummax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_eye_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_irfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fft_rfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_frexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_half_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_isreal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cond_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_eigh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lstsq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_lu_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_svd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vander_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logaddexp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_argmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nanmean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nextafter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_binary_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_channel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_elu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_hardtanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_interpolate_trilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pairwise_distance_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_inf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_norm_nuc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reshape_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_reshape_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_roll_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_searchsorted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_blackman_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_chebyshev_polynomial_w_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_special_zeta_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tensor_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapz_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triangular_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_true_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_fail_gradgrad_zeros_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyCubeCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyNMSCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpySortCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpySplitCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpySplitCopyWithIntCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_NumpyViewCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___radd___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcdiv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_allclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_broadcast_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_byte_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clamp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_clone_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_combinations_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cond_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_contiguous_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cov_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_div_no_rounding_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_ihfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fft_rfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_geqrf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_gradient_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_half_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_half_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hash_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_histc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_index_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_invoke_quant_packed_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_invoke_subgraph_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_isreal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_householder_product_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vecdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_log_softmax_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_and_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logical_xor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logspace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_long_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_map_nested_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_min_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_multinomial_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nanmean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nextafter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_channel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_conv_transpose3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_elu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_fractional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_gelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_group_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hardshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hardtanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_kl_div_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_max_unpool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_circular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_inf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_polygamma_polygamma_n_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_pow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_real_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_roll_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_round_decimals_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_round_decimals_neg_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scan_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_short_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_blackman_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_general_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_take_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trapz_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_triangular_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_true_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unbind_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_view_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_while_loop_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_grad_zeros_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyCatCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyMulScalarCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpyNMSCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_NumpySortCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__native_batch_norm_legit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__softmax_backward_data_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_allclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_allclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_argwhere_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_as_strided_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_atleast_1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_auto_functionalize_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bfloat16_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_byte_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cauchy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cdouble_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cfloat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_char_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_inverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clamp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clamp_min_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_clone_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_column_stack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_column_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_combinations_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_combinations_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_constant_pad_nd_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diag_embed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_erfinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expand_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_exponential_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fft_rfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fliplr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_fmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_half_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_hstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_igammac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_imag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_index_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_inner_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_inner_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_invoke_subgraph_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isnan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isnan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_isreal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_2inputs_2outputs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_jiterator_unary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cholesky_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cholesky_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cholesky_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_pinv_singular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vecdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linalg_vector_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log1p_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_and_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_long_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_map_nested_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_map_triple_nested_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_masked_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_meshgrid_variadic_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_meshgrid_variadic_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_movedim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nanmean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_narrow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_new_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nextafter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_channel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_cosine_similarity_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_elu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_hardtanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_interpolate_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_linear_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_max_unpool3d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_mish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_constant_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_selu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softplus_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_tanhshrink_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_fro_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_norm_inf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ormqr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_qr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_repeat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reshape_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_roll_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scan_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_select_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_blackman_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_hermite_polynomial_he_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_modified_bessel_i0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_modified_bessel_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_ndtri_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_special_xlog1py_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_list_args_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_split_with_sizes_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_std_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tensordot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tile_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_transpose_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_trapz_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_triangular_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_triangular_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tril_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_true_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unflatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_while_loop_simple_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_xlogy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_fn_gradgrad_zeros_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyMulCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpySortCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpySplitCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_NumpyViewCopyCustomOp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_T_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___getitem___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rdiv___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmatmul___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__segment_reduce_offsets_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad__upsample_bilinear2d_aa_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_acosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_acosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcdiv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addcmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addmv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_alias_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_allclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_argwhere_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_as_strided_partial_views_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_asin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_asinh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_baddbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bernoulli_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_block_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bool_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bool_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_broadcast_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_byte_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cartesian_prod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cfloat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_chalf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cholesky_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clamp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_clamp_max_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_combinations_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_conj_physical_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_copysign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_corrcoef_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_corrcoef_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cosh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cosh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_count_nonzero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_count_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cross_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cummin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_cumulative_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diag_embed_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diff_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_div_floor_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_div_no_rounding_mode_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_div_trunc_rounding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_double_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_einsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_empty_permuted_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_erf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expm1_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_eye_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_fftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_ihfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfft2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fft_rfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flip_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fliplr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flipud_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_flipud_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_float_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_fmod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gather_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_geometric_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_grid_sampler_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_gt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_half_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_index_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_int_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isclose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isfinite_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isneginf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_isreal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_item_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_item_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_binary_return_by_ref_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_kron_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ldexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ldexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_le_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lerp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_cond_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigvalsh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_eigvalsh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_householder_product_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_factor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_multi_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_slogdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_svdvals_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_tensorsolve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linalg_vector_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log10_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_log_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logdet_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_lu_unpack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matrix_exp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_maximum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_list_of_tensors_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_meshgrid_list_of_tensors_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_min_reduction_with_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nanmean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nanquantile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_narrow_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_narrow_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_narrow_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_native_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_empty_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_full_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nextafter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_celu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_channel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_channel_shuffle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_dropout2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_elu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_embedding_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_grid_sample_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardswish_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_hardtanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_huber_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_leaky_relu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_max_unpool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_constant_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pad_replicate_negative_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pairwise_distance_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pdist_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_prelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_relu6_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_silu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softsign_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_upsample_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_norm_inf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ormqr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_outer_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_permute_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pinverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polar_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_polygamma_polygamma_n_4_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_positive_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_positive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_pow_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rad2deg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rand_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_randn_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_ravel_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reciprocal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_renorm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_renorm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_repeat_interleave_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_reshape_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resize_as__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_resolve_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_roll_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_roll_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rot90_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_rsub_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scalar_tensor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sgn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_blackman_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_hamming_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_slice_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_bessel_y0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_entr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_i1e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_log_ndtr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_squeeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_mean_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_take_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tan_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensor_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_topk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_transpose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_transpose_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trapz_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_triangular_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_triu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_true_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_trunc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unfold_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unique_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsqueeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_mean_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_as_real_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vstack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_vstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_where_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_where_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zero__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zeros_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_grad_zeros_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_H_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_H_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_T_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___getitem___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___radd___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rdiv___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmatmul___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rmod___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rpow___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rpow___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rsub___cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad___rsub___cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__batch_norm_with_update_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__chunk_cat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad__segment_reduce_lengths_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_abs_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_abs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acos_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_acos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_add_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addbmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addcmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addmm_decomposed_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_addr_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_alias_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_all_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_all_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_allclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_aminmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_angle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_angle_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_any_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_any_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_argsort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_partial_views_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_as_strided_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_asinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atan_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_3d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_atleast_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_baddbmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bfloat16_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_block_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_to_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_broadcast_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_bucketize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_byte_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_byte_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cartesian_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ceil_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chalf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_char_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_inverse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cholesky_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_clamp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_combinations_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_conj_physical_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_constant_pad_nd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_contiguous_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cos_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cov_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_cumulative_trapezoid_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_deg2rad_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagflat_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagflat_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diagonal_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_diff_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_digamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dist_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_double_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_dstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_einsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_permuted_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_empty_strided_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_eq_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_equal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_equal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_erfc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_exp2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expand_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_expm1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_fftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_hfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftshift_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ifftshift_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_ihfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfftn_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_irfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_rfft2_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_rfft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fft_rfftn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flatten_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_flip_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_float_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_float_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_float_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_floor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_floor_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_fmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_frac_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_full_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_full_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_full_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gather_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ge_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_geqrf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_gradient_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_grid_sampler_3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_half_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_heaviside_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hsplit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hstack_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_hypot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_igamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_fill_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_put_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_reduce_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_index_select_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_int_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isclose_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isfinite_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isinf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isposinf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isreal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_isreal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_istft_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_2inputs_2outputs_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_binary_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_jiterator_unary_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_kthvalue_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lerp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lgamma_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cholesky_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_cross_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_det_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_det_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_diagonal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eig_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eig_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_eigvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_inv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_inv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_inv_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_inv_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_factor_ex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_ldl_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lstsq_grad_oriented_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_lu_factor_ex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_power_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_power_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_multi_dot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_norm_subgradients_at_zero_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_hermitian_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_hermitian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_pinv_singular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_slogdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_triangular_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_solve_triangular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svd_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_svdvals_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_tensorinv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_tensorinv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_tensorsolve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linalg_vander_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_tensor_overload_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_linspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log1p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log2_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_log_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logaddexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logcumsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logcumsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logdet_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_not_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_not_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_or_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_or_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logical_xor_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_logspace_tensor_overload_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_solve_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_lu_unpack_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mH_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mH_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mT_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_amax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_argmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_cumprod_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_cumprod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_cumsum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_cumsum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_fill_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_log_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_logaddexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_logsumexp_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_logsumexp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_median_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_normalize_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_normalize_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_std_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_sum_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_masked_var_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_matmul_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_matmul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_matrix_exp_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_pool2d_with_indices_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_max_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_min_reduction_no_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_minimum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mode_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_movedim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_msort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mul_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mv_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mv_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nan_to_num_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nanmean_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nanmean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nanmedian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nansum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_native_dropout_backward_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_native_layer_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ne_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ne_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_neg_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_empty_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_ones_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_new_zeros_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nextafter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_alpha_dropout_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_avg_pool1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_batch_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_bilinear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_channel_shuffle_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv1d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv2d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose2d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_conv_transpose3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_cross_entropy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_ctc_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_dropout3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_elu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_embedding_bag_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_fractional_max_pool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_glu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_hardtanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_instance_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_area_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_interpolate_linear_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_l1_loss_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_l1_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_local_response_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_logsigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_max_unpool3d_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_mse_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_multi_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_circular_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_reflect_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_pad_reflect_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_rms_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_rms_norm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_rrelu_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_silu_complex_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softmin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_softsign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_tanhshrink_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_threshold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nn_functional_upsample_nearest_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_nonzero_static_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_fro_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_inf_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_norm_inf_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_normal_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_normal_in_place_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_normal_in_place_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_normal_number_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ones_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ones_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ones_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_outer_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pca_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pca_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_permute_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_permute_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_permute_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_pinverse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_polygamma_polygamma_n_1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_prod_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_put_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_qr_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rand_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randint_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randint_like_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_randn_like_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_ravel_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reciprocal_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_remainder_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_repeat_interleave_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_reshape_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resize__cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resize__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resize_as__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resolve_conj_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resolve_conj_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_resolve_neg_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_roll_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rot90_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_round_decimals_0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rsqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rsqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_rsub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scalar_tensor_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_add_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_amin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_scatter_reduce_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_select_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sgn_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_short_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sigmoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sign_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_bartlett_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_blackman_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_gaussian_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_general_cosine_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_hann_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_kaiser_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signal_windows_nuttall_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_signbit_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sin_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sin_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sinc_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sinc_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sinh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_slice_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_slice_scatter_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_softmax_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_softmax_with_dtype_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sort_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sparse_mm_reduce_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sparse_sampled_addmm_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sparse_sampled_addmm_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_airy_ai_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_bessel_j1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_bessel_y1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_erfcx_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_hermite_polynomial_h_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_i0e_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_i1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_laguerre_polynomial_l_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_legendre_polynomial_p_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_modified_bessel_k0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_special_spherical_bessel_j0_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_list_args_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_with_sizes_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_with_sizes_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_split_with_sizes_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sqrt_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sqrt_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_square_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_squeeze_multiple_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_unbiased_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_std_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_stft_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sub_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sum_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sum_to_size_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_sum_to_size_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_lowrank_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_svd_lowrank_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_t_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_t_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_t_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_t_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_along_dim_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_take_along_dim_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tanh_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tanh_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tensordot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tile_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_to_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_to_sparse_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_to_sparse_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trace_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trace_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_transpose_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trapezoid_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trapz_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_trapz_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triangular_solve_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_tril_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_triu_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_true_divide_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_true_divide_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unbind_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unflatten_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unfold_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_uniform_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_uniform_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unique_consecutive_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsafe_chunk_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsafe_chunk_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsafe_split_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsafe_split_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsqueeze_copy_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsqueeze_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_unsqueeze_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_mean_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_var_unbiased_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vdot_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vdot_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_as_complex_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_as_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_as_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_copy_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_view_cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_vsplit_cuda_complex128, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zero__cuda_float64, test/test_ops_gradients.py::TestBwdGradientsCUDA::test_inplace_gradgrad_zeros_cuda_complex128 2025-12-04T12:33:50.0998838Z 2025-12-04T12:33:50.0999003Z Finished test_ops_gradients 2/2 ... [2025-12-04 12:33:50.033207][4976412.529389919], took 8.12min 2025-12-04T12:33:50.0999430Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:33:50.0999800Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:33:50.1000017Z Running test_linalg 2/2 ... [2025-12-04 12:33:50.052039][4976412.54822547] 2025-12-04T12:33:50.1000192Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:33:50.1000579Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_linalg.py', '--shard-id=2', '--num-shards=2', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:33:50.052510] 2025-12-04T12:38:08.6854112Z 2025-12-04T12:38:08.6855178Z test_linalg 2/2 was successful, full logs can be found in artifacts with path test/test-reports/test_linalg_2.2_b60d9de6aaf60ebb_.log 2025-12-04T12:38:08.7087969Z Running 612 items in this shard: test/test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_1_sized_with_0_strided_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_1_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_matmul_4bit_m_32_k_64_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_32_cuda, test/test_linalg.py::TestLinalgCUDA::test__dyn_quant_pack_4bit_weight_k_256_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_32_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int4_mm_m_64_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_large_shape_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_32_n_64_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_32_k_64_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_48_compile_True_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_32_n_64_compile_False_slice_True_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_48_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_False_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int8_mm_m_64_k_64_n_64_compile_True_slice_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_0_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_17_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_0_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_16_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_False_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_False_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_1_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_16_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_False_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_False_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_0_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_cpu_m_8_k_32_n_32_use_transpose_a_True_use_transpose_b_True_non_contig_type_2_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_16_n_32_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_16_use_transpose_a_True_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_False_cuda, test/test_linalg.py::TestLinalgCUDA::test__int_mm_k_32_n_32_use_transpose_a_False_use_transpose_b_True_cuda, test/test_linalg.py::TestLinalgCUDA::test_addbmm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_gelu_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_False_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_False_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_0_2_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_0_5_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_False_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_0_beta_1_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_0_2_beta_0_5_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_0_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_0_5_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_mv_transpose_a_True_transpose_b_True_alpha_1_0_beta_1_0_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmm_relu_tunableop_rocm_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addmm_sizes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_addmv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addmv_rowmajor_colmajor_incx_incy_lda_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addr_bool_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_addr_float_and_complex_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_int8, test/test_linalg.py::TestLinalgCUDA::test_addr_integral_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_baddbmm_nan_input_with_zero_beta_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_blas_alpha_beta_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_blas_nan_out_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_blaslog_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_bmm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_broadcast_fused_matmul_cuda, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_ex_non_pd_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_inverse_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_batched_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_cholesky_solve_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ck_blas_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_11008_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_128_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_dyn_quant_matmul_4bit_m_32_k_64_n_4096_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_32_k_64_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_48_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_32_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_compile_int4_mm_m_64_k_64_n_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_cond_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_cond_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_corner_cases_of_cublasltmatmul_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_cross_with_and_without_dim_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_det_logdet_slogdet_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_disable_tuning_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_dot_invalid_args_cuda, test/test_linalg.py::TestLinalgCUDA::test_dump_results_on_exit_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_check_magma_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_compare_backends_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eig_cuda_complex_eigenvectors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eig_with_nan_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigh_lwork_lapack_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigh_svd_illcondition_matrix_input_should_not_crash_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvals_compare_backends_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvals_numpy_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_eigvalsh_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_einsum_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_einsum_error_cases_cuda, test/test_linalg.py::TestLinalgCUDA::test_einsum_output_layout_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_einsum_random_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_einsum_sublist_format_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_32_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_35_k_40_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_36_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_32_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_36_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_40_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_35_cuda, test/test_linalg.py::TestLinalgCUDA::test_fp16_mv_transposed_first_argument_arm_cpu_m_64_k_64_cuda, test/test_linalg.py::TestLinalgCUDA::test_geqrf_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_householder_product_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inv_ex_singular_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_invariance_error_spectral_decompositions_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_inverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_errors_large_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_inverse_many_batches_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_kron_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_kron_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lapack_empty_cuda, test/test_linalg.py::TestLinalgCUDA::test_large_bmm_mm_backward_cuda, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ldl_factor_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ldl_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_cross_with_and_without_dim_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_batch_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lstsq_input_checks_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_cpu_errors_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_family_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_lu_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_batch_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_boundary_cases_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_compare_with_taylor_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linalg_matrix_exp_perverse_nan_values_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_qr_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_linalg_solve_triangular_large_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_linear_algebra_scalar_raises_cuda, test/test_linalg.py::TestLinalgCUDA::test_lobpcg_scipy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lower_precision_accumulation_with_ref_path_cuda, test/test_linalg.py::TestLinalgCUDA::test_lstsq_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_broadcasting_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_batched_many_batches_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_lu_solve_large_matrices_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_lu_unpack_check_input_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matmul_empty_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_mv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_offline_mgpu_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_offline_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_matmul_out_kernel_errors_with_autograd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_1d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_2d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matmul_small_brute_force_3d_Nd_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_exp_backward_input_validation_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_negative_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_power_non_negative_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_atol_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_basic_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_matrix_rank_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_minimum_tuning_iteration_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_mm_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_mm_empty_inputs_mixed_dtype_errors_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_bfloat16_and_half_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_norm_complex_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_dtype_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_errors_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_extreme_values_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_fastpaths_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_fro_2_equivalence_old_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_fused_type_promotion_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_matrix_degenerate_shapes_old_numpy_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_old_nan_propagation_cuda, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_norm_vector_degenerate_shapes_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_exceptions_old_cuda, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_nuclear_norm_out_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_numeric_check_leak_tunableop_rocm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_numerical_check_python_binding_tunableop_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_batched_upper_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_old_cholesky_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_ops_append_to_existing_file_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_ormqr_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_ormqr_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_cuda_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bfloat16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_bool_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex128_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_complex64_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float16_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float32_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_float64_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int16_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int32_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_complex64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int64_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_float64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_int8_uint8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_bool, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_complex128, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float16, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_float32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int32, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int64, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_int8, test/test_linalg.py::TestLinalgCUDA::test_outer_type_promotion_cuda_uint8_uint8, test/test_linalg.py::TestLinalgCUDA::test_pca_lowrank_cuda, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinv_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_pinv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_pinverse_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_preferred_blas_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_preferred_linalg_library_cuda, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_batched_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_qr_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_qr_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_renorm_cuda, test/test_linalg.py::TestLinalgCUDA::test_renorm_ps_cuda, test/test_linalg.py::TestLinalgCUDA::test_rotating_buffer_tunableop_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_rowwise_scaled_gemm_numerics_tunableop_cuda_float8_e4m3fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_offline_tunableop_cuda_float8_e5m2fnuz, test/test_linalg.py::TestLinalgCUDA::test_scaled_gemm_tunableop_cuda_float8_e5m2fnuz, test/test_linalg.py::TestLinalgCUDA::test_slogdet_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_slogdet_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_solve_batched_broadcasting_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_svd_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_lowrank_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_svd_memory_allocation_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_symeig_removed_error_cuda, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorinv_singular_input_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_tensorsolve_empty_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_batched_many_batches_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_large_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_complex64, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_triangular_solve_out_errors_and_warnings_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_complex128, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float16, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_float64, test/test_linalg.py::TestLinalgCUDA::test_triu_tril_extreme_k_values_cuda_int64, test/test_linalg.py::TestLinalgCUDA::test_vdot_vs_numpy_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_bfloat16, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_cuda_float32, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_decom_unbacked_checks_cuda, test/test_linalg.py::TestLinalgCUDA::test_vector_norm_reduce_over_1D_vector_cuda_float32 2025-12-04T12:38:08.7280372Z 2025-12-04T12:38:08.7280584Z Finished test_linalg 2/2 ... [2025-12-04 12:38:08.686255][4976671.182437458], took 4.31min 2025-12-04T12:38:08.7281356Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:38:08.7282093Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:08.7282552Z Running nn/test_packed_sequence 1/1 ... [2025-12-04 12:38:08.705319][4976671.201505562] 2025-12-04T12:38:08.7282933Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:08.7283731Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_packed_sequence.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:08.705822] 2025-12-04T12:38:15.3959969Z 2025-12-04T12:38:15.3961425Z nn/test_packed_sequence 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_packed_sequence_1.1_8764d33fc283c2a0_.log 2025-12-04T12:38:15.3966781Z Running 13 items in this shard: test/nn/test_packed_sequence.py::PackedSequenceTest::test_empty_packed_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pack_padded_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pack_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence_with_non_iterable_sequences, test/nn/test_packed_sequence.py::PackedSequenceTest::test_pad_sequence_with_tensor_sequences, test/nn/test_packed_sequence.py::PackedSequenceTest::test_to, test/nn/test_packed_sequence.py::PackedSequenceTest::test_to_memory_format, test/nn/test_packed_sequence.py::PackedSequenceTest::test_total_length, test/nn/test_packed_sequence.py::PackedSequenceTest::test_type_casts, test/nn/test_packed_sequence.py::PackedSequenceTest::test_unpack_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_unpad_sequence, test/nn/test_packed_sequence.py::PackedSequenceTest::test_wrong_order 2025-12-04T12:38:15.3972271Z 2025-12-04T12:38:15.3972770Z Finished nn/test_packed_sequence 1/1 ... [2025-12-04 12:38:15.395644][4976677.891829888], took 0.11min 2025-12-04T12:38:15.3988061Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:38:15.4140697Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:15.4148563Z Running test_numa_binding 1/1 ... [2025-12-04 12:38:15.414426][4976677.910613456] 2025-12-04T12:38:15.4149331Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:15.4151287Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_numa_binding.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:15.414888] 2025-12-04T12:38:20.2889201Z 2025-12-04T12:38:20.2890655Z test_numa_binding 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_numa_binding_1.1_0173b38247c709ab_.log 2025-12-04T12:38:20.2899426Z Running 21 items in this shard: test/test_numa_binding.py::NumaBindingTest::test_bind_all_threads_in_current_process_to_logical_cpus, test/test_numa_binding.py::NumaBindingTest::test_binds_to_node_0_if_node_stored_as_minus_one, test/test_numa_binding.py::NumaBindingTest::test_callable_entrypoint_basic, test/test_numa_binding.py::NumaBindingTest::test_core_complex_numa_binding_with_extra_l3, test/test_numa_binding.py::NumaBindingTest::test_core_complex_numa_binding_with_fewer_l3_than_gpu, test/test_numa_binding.py::NumaBindingTest::test_core_complex_prefers_caches_with_more_cpus, test/test_numa_binding.py::NumaBindingTest::test_core_complex_tiebreak_prefers_lower_cache_key, test/test_numa_binding.py::NumaBindingTest::test_default_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_exclusive_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_exclusive_raises_if_too_few_physical_cores, test/test_numa_binding.py::NumaBindingTest::test_explicit_numa_options_overrides_default, test/test_numa_binding.py::NumaBindingTest::test_fallback, test/test_numa_binding.py::NumaBindingTest::test_fallback_if_numactl_not_available, test/test_numa_binding.py::NumaBindingTest::test_get_range_str_from_ints, test/test_numa_binding.py::NumaBindingTest::test_get_set_of_int_from_ranges_str, test/test_numa_binding.py::NumaBindingTest::test_no_numa_binding_if_numa_options_not_provided, test/test_numa_binding.py::NumaBindingTest::test_node_numa_binding, test/test_numa_binding.py::NumaBindingTest::test_nproc_must_equal_cuda_device_count_to_use_default_numa_options, test/test_numa_binding.py::NumaBindingTest::test_raises_if_binding_to_empty_set, test/test_numa_binding.py::NumaBindingTest::test_socket_numa_binding_with_multiple_numa_per_socket, test/test_numa_binding.py::NumaBindingTest::test_socket_numa_binding_with_single_numa_per_socket 2025-12-04T12:38:20.2907553Z 2025-12-04T12:38:20.2907914Z Finished test_numa_binding 1/1 ... [2025-12-04 12:38:20.288719][4976682.784903216], took 0.08min 2025-12-04T12:38:20.2920607Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:38:20.3074220Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:20.3080491Z Running test_pruning_op 1/1 ... [2025-12-04 12:38:20.307783][4976682.80396888] 2025-12-04T12:38:20.3084237Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:20.3085486Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_pruning_op.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:20.308271] 2025-12-04T12:38:22.7316808Z 2025-12-04T12:38:22.7318048Z test_pruning_op 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_pruning_op_1.1_522615d49a8a55d1_.log 2025-12-04T12:38:22.7320014Z Running 2 items in this shard: test/test_pruning_op.py::PruningOpTest::test_rowwise_prune_op_32bit_indices, test/test_pruning_op.py::PruningOpTest::test_rowwise_prune_op_64bit_indices 2025-12-04T12:38:22.7321018Z 2025-12-04T12:38:22.7321372Z Finished test_pruning_op 1/1 ... [2025-12-04 12:38:22.731379][4976685.227563608], took 0.04min 2025-12-04T12:38:22.7346396Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:38:22.7500015Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:38:22.7505341Z Running test_jit_fuser_te 1/1 ... [2025-12-04 12:38:22.750271][4976685.246457115] 2025-12-04T12:38:22.7505917Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:38:22.7510210Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_jit_fuser_te.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:38:22.750733] 2025-12-04T12:45:48.7333349Z 2025-12-04T12:45:48.7334460Z test_jit_fuser_te 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_jit_fuser_te_1.1_681025e785c88c3e_.log 2025-12-04T12:45:48.8550393Z Running 6825 items in this shard: test/test_jit_fuser_te.py::TestFuserCommon::test_autodiff_fallback, test/test_jit_fuser_te.py::TestTEFuserStatic::test_abs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_adaptive_avg_pool2d, test/test_jit_fuser_te.py::TestTEFuserStatic::test_add_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_addcmul, test/test_jit_fuser_te.py::TestTEFuserStatic::test_arg_configurations_smoke, test/test_jit_fuser_te.py::TestTEFuserStatic::test_autocast_down, test/test_jit_fuser_te.py::TestTEFuserStatic::test_autocast_up, test/test_jit_fuser_te.py::TestTEFuserStatic::test_batch_norm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_div_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_pow, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_binary_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_bitwise_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_broadcast, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cat_2k_args, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cat_graph_opt, test/test_jit_fuser_te.py::TestTEFuserStatic::test_channels_last_dims_dynamic, test/test_jit_fuser_te.py::TestTEFuserStatic::test_checks_cat_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_correctness, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_distributes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_motion_deduplicates_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_mul_one, test/test_jit_fuser_te.py::TestTEFuserStatic::test_chunk_multiple, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp_double, test/test_jit_fuser_te.py::TestTEFuserStatic::test_clamp_int, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_eq_ne, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_ge_le, test/test_jit_fuser_te.py::TestTEFuserStatic::test_comparison_gt_lt, test/test_jit_fuser_te.py::TestTEFuserStatic::test_concat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_concat_invariant, test/test_jit_fuser_te.py::TestTEFuserStatic::test_constant_chunk_shapes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_conv2d, test/test_jit_fuser_te.py::TestTEFuserStatic::test_conv2d_depthwise, test/test_jit_fuser_te.py::TestTEFuserStatic::test_cuda_half, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dims, test/test_jit_fuser_te.py::TestTEFuserStatic::test_disabled, test/test_jit_fuser_te.py::TestTEFuserStatic::test_div_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dynamic_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_dynamic_shapes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_eq_unsqueeze_type_as, test/test_jit_fuser_te.py::TestTEFuserStatic::test_erf, test/test_jit_fuser_te.py::TestTEFuserStatic::test_exhaust_specializations, test/test_jit_fuser_te.py::TestTEFuserStatic::test_exp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_fusion_reuse_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_gelu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_hardsigmoid_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_hardswish_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_inlined_optimized_graph, test/test_jit_fuser_te.py::TestTEFuserStatic::test_isnan, test/test_jit_fuser_te.py::TestTEFuserStatic::test_kernel_cache_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lerp, test/test_jit_fuser_te.py::TestTEFuserStatic::test_list_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_concat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_gates_permutations, test/test_jit_fuser_te.py::TestTEFuserStatic::test_lstm_traced, test/test_jit_fuser_te.py::TestTEFuserStatic::test_masked_fill, test/test_jit_fuser_te.py::TestTEFuserStatic::test_matmul, test/test_jit_fuser_te.py::TestTEFuserStatic::test_milstm, test/test_jit_fuser_te.py::TestTEFuserStatic::test_minmax, test/test_jit_fuser_te.py::TestTEFuserStatic::test_minmax_int_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_mul_bool, test/test_jit_fuser_te.py::TestTEFuserStatic::test_neg_pow, test/test_jit_fuser_te.py::TestTEFuserStatic::test_nonzero_device_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_nop, test/test_jit_fuser_te.py::TestTEFuserStatic::test_pow_multiple_dtype, test/test_jit_fuser_te.py::TestTEFuserStatic::test_profiler, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_broadcast_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_cuda, test/test_jit_fuser_te.py::TestTEFuserStatic::test_rand_diamond, test/test_jit_fuser_te.py::TestTEFuserStatic::test_relu, test/test_jit_fuser_te.py::TestTEFuserStatic::test_relu_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserStatic::test_remove_output_used_only_in_size, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar_arg, test/test_jit_fuser_te.py::TestTEFuserStatic::test_scalar_only_inputs, test/test_jit_fuser_te.py::TestTEFuserStatic::test_skip_grad_in_check, test/test_jit_fuser_te.py::TestTEFuserStatic::test_small_constant, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sub_gt_and, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_dim, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_keepdim_cast, test/test_jit_fuser_te.py::TestTEFuserStatic::test_sum_simple, test/test_jit_fuser_te.py::TestTEFuserStatic::test_superslomo, test/test_jit_fuser_te.py::TestTEFuserStatic::test_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_ternary_norm_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_ternary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_threshold, test/test_jit_fuser_te.py::TestTEFuserStatic::test_to_device, test/test_jit_fuser_te.py::TestTEFuserStatic::test_to_dtype, test/test_jit_fuser_te.py::TestTEFuserStatic::test_torch_to, test/test_jit_fuser_te.py::TestTEFuserStatic::test_type_as_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_typecheck, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unary_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unrolled_cat, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsqueeze_size_calculation, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsqueeze_var_dim, test/test_jit_fuser_te.py::TestTEFuserStatic::test_unsupported_dtypes, test/test_jit_fuser_te.py::TestTEFuserStatic::test_where_and_typing, test/test_jit_fuser_te.py::TestTEFuserStatic::test_where_ops, test/test_jit_fuser_te.py::TestTEFuserStatic::test_with_strict_fusion, test/test_jit_fuser_te.py::TestTEFuserStatic::test_zero_element_tensors, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_abs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_adaptive_avg_pool2d, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_add_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_addcmul, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_arg_configurations_smoke, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_autocast_down, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_autocast_up, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_batch_norm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_div_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_pow, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_binary_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_bitwise_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_broadcast, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cat_2k_args, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cat_graph_opt, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_channels_last_dims_dynamic, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_checks_cat_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_correctness, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_distributes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_motion_deduplicates_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_mul_one, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_chunk_multiple, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp_double, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_clamp_int, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_eq_ne, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_ge_le, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_comparison_gt_lt, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_concat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_concat_invariant, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_constant_chunk_shapes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_conv2d, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_conv2d_depthwise, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_cuda_half, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dims, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_disabled, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_div_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dynamic_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_dynamic_shapes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_eq_unsqueeze_type_as, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_erf, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_exhaust_specializations, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_exp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_fusion_reuse_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_gelu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_hardsigmoid_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_hardswish_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_inlined_optimized_graph, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_isnan, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_kernel_cache_multi_gpu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lerp, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_list_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_concat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_gates_permutations, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_lstm_traced, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_masked_fill, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_matmul, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_milstm, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_minmax, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_minmax_int_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_mul_bool, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_neg_pow, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_nonzero_device_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_nop, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_pow_multiple_dtype, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_profiler, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_broadcast_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_cuda, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_rand_diamond, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_relu, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_relu_fwd_bwd, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_remove_output_used_only_in_size, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar_arg, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_scalar_only_inputs, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_skip_grad_in_check, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_small_constant, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sub_gt_and, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_dim, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_keepdim_cast, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_sum_simple, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_superslomo, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_tensor_scalar_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_ternary_norm_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_ternary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_threshold, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_to_device, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_to_dtype, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_torch_to, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_type_as_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_typecheck, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unary_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unrolled_cat, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsqueeze_size_calculation, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsqueeze_var_dim, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_unsupported_dtypes, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_where_and_typing, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_where_ops, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_with_strict_fusion, test/test_jit_fuser_te.py::TestTEFuserDynamic::test_zero_element_tensors, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures___rmatmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures_frac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_failures_matmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_H_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_T_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___getitem___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___radd___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rand___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rdiv___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmod___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rmul___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___ror___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rpow___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rsub___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness___rxor___cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__batch_norm_with_update_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__chunk_cat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__native_batch_norm_legit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_lengths_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__segment_reduce_offsets_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__softmax_backward_data_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__unsafe_masked_index_put_accumulate_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness__upsample_bilinear2d_aa_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_abs_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acos_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_acosh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addbmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcdiv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addcmul_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmm_decomposed_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addmv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_addr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_alias_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_all_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_allclose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_aminmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_angle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_any_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_arange_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argsort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_argwhere_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_partial_views_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_as_strided_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_asinh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atanh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_1d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_2d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_atleast_3d_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_baddbmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bernoulli_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bfloat16_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bincount_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_and_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_left_shift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_not_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_or_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_right_shift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bitwise_xor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_block_diag_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bool_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_shapes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_broadcast_to_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_bucketize_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_byte_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cartesian_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cauchy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cdouble_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ceil_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cfloat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chalf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_char_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_inverse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cholesky_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_chunk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_max_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clamp_min_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_clone_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_column_stack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_combinations_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_complex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_conj_physical_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_constant_pad_nd_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_contiguous_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_copysign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_corrcoef_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cos_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cosh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_count_nonzero_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cov_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cross_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cummin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumprod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumsum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_cumulative_trapezoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_deg2rad_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diag_embed_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagflat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diagonal_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_diff_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_digamma_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_floor_rounding_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_no_rounding_mode_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_div_trunc_rounding_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_double_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_dstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_einsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_permuted_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_empty_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eq_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_equal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_erfinv_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expand_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_expm1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_exponential_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e4m3fnuz, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e5m2, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_float8_e5m2fnuz, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_eye_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_fftshift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_hfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ifftshift_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_ihfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_irfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfft_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fft_rfftn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flatten_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flip_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fliplr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_flipud_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_float_power_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_floor_divide_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_fmod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_frexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_full_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gather_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gcd_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ge_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geometric_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_geqrf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gradient_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_grid_sampler_3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_gt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_half_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hash_tensor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_heaviside_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_histc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_hypot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_i0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igammac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_igammac_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_imag_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_put_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_mean_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_reduce_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_index_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_inner_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_int_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isclose_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isfinite_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isinf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isnan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isneginf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isposinf_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_isreal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_istft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_istft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_item_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_2inputs_2outputs_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_4inputs_with_extra_args_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_binary_return_by_ref_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_jiterator_unary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kron_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_kthvalue_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lcm_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ldexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_le_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lerp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lgamma_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cholesky_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cond_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_cross_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_det_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_diagonal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eig_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvals_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_eigvalsh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_householder_product_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_inv_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_factor_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_ldl_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lstsq_grad_oriented_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_factor_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_lu_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_power_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_matrix_rank_hermitian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_multi_dot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_norm_subgradients_at_zero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_hermitian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_pinv_singular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_qr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_slogdet_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_ex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_solve_triangular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_svdvals_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorinv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_tensorsolve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vander_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vecdot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linalg_vector_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_linspace_tensor_overload_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log10_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log1p_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_normal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_log_softmax_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logaddexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logcumsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logdet_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_and_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_not_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_or_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logical_xor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logspace_tensor_overload_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_logsumexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_long_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_lu_unpack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mH_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mT_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_argmin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumprod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_cumsum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_fill_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_log_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logaddexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_logsumexp_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_median_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_normalize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_softmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_std_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_masked_var_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_matrix_exp_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_pool2d_with_indices_backward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_no_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_max_reduction_with_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_maximum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_median_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_list_of_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_meshgrid_variadic_tensors_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_binary_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_no_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_min_reduction_with_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_minimum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mode_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_movedim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_msort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mul_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_multinomial_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mv_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nan_to_num_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanmedian_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanquantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nanquantile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nansum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_narrow_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_batch_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_dropout_backward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_native_layer_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ne_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_neg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_empty_strided_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_full_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_ones_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_new_zeros_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nextafter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_adaptive_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_alpha_dropout_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_avg_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_batch_norm_without_cudnn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_celu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_channel_shuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_conv_transpose3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_embedding_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cosine_similarity_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_cross_entropy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_ctc_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_ctc_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_dropout_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_elu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_bag_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_embedding_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_with_train_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_feature_alpha_dropout_without_train_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_fractional_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gaussian_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_gelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_glu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_grid_sample_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_group_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardsigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardswish_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hardtanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_hinge_embedding_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_huber_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_instance_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_area_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bicubic_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_linear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest-exact_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_nearest_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_interpolate_trilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_kl_div_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_l1_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_layer_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_leaky_relu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_linear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_local_response_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_logsigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_margin_ranking_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_pool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool1d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool2d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_max_unpool3d_grad_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mish_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_mse_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_head_attention_forward_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multi_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_normalize_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_one_hot_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_circular_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_constant_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_reflect_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pad_replicate_negative_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pairwise_distance_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pdist_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_shuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_pixel_unshuffle_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_poisson_nll_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_prelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu6_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_relu_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rms_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_rrelu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_selu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_complex_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_complex_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_silu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_smooth_l1_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_soft_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softmin_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softplus_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_softsign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_tanhshrink_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_threshold_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_triplet_margin_with_distance_loss_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_unfold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_bilinear_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nn_functional_upsample_nearest_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_nonzero_static_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_fro_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_inf_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_norm_nuc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_in_place_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_normal_number_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ones_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ormqr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_outer_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pca_lowrank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_permute_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pinverse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polar_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polar_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_2_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_3_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_polygamma_polygamma_n_4_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_positive_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_pow_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_put_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_qr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_quantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_quantile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rad2deg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rand_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randint_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_randn_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_ravel_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_real_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reciprocal_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_remainder_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_renorm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_repeat_interleave_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_reshape_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resize_as__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_conj_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_resolve_neg_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_roll_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rot90_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_round_decimals_neg_3_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsqrt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_rsub_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scalar_tensor_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_add_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amax_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_amin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_mean_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_prod_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_scatter_reduce_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_searchsorted_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_select_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sgn_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_short_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sigmoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sign_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_bartlett_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_bartlett_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_blackman_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_blackman_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_cosine_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_exponential_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_gaussian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_gaussian_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_cosine_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_general_hamming_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hamming_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hann_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_hann_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_kaiser_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_kaiser_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_nuttall_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signal_windows_nuttall_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_signbit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sin_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sinh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_slice_scatter_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_softmax_with_dtype_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sort_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_mm_reduce_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sparse_sampled_addmm_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_airy_ai_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_j1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_bessel_y1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_u_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_v_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_chebyshev_polynomial_w_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_entr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_erfcx_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_h_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_hermite_polynomial_he_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i0e_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_i1e_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_laguerre_polynomial_l_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_legendre_polynomial_p_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_log_ndtr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_i1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_modified_bessel_k1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtr_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_ndtri_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_scaled_modified_bessel_k1_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_spherical_bessel_j0_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_xlog1py_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_special_zeta_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_list_args_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_split_with_sizes_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sqrt_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_square_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_squeeze_multiple_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_mean_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_std_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_stft_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sub_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_sum_to_size_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_svd_lowrank_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_t_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_along_dim_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_take_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tan_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tanh_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensor_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tensordot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tile_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_to_sparse_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_topk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch__scaled_mm_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__efficient_attention_forward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__efficient_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__flash_attention_forward_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_torch_ops_aten__safe_softmax_default_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trace_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_transpose_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapezoid_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trapz_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triangular_solve_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_indices_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_tril_indices_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_indices_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_triu_indices_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_true_divide_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_trunc_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unbind_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unflatten_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unfold_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_uniform_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_consecutive_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unique_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unravel_index_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_chunk_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsafe_split_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_unsqueeze_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_mean_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_var_unbiased_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vdot_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_complex_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_real_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_as_real_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_copy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_view_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vsplit_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_vstack_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_where_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_xlogy_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zero__cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_bfloat16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_bool, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex128, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_complex64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_float64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int16, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int64, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_int8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_nnc_correctness_zeros_like_cuda_uint8, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_H_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_T_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___getitem___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___rpow___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported___rsub___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__batch_norm_with_update_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__chunk_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__native_batch_norm_legit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__segment_reduce_lengths_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__segment_reduce_offsets_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__softmax_backward_data_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__unsafe_masked_index_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__unsafe_masked_index_put_accumulate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported__upsample_bilinear2d_aa_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_acosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addcdiv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addmv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_addr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_alias_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_all_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_allclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_aminmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_angle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_any_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_arange_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argsort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_argwhere_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_partial_views_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_as_strided_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_asinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_atleast_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_baddbmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bernoulli_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bfloat16_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_block_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_shapes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_broadcast_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_bucketize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cartesian_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cauchy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cdouble_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cfloat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_chalf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_inverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cholesky_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clamp_max_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clamp_min_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_clone_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_column_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_combinations_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_conj_physical_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_constant_pad_nd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_copysign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_corrcoef_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_count_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cov_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cummax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cummin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_cumulative_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_deg2rad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diag_embed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagflat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diagonal_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_diff_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_digamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_dstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_einsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_permuted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_equal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_erfinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_exp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_expand_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_eye_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_fftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_hfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ifftshift_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_ihfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_irfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfft2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fft_rfftn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flip_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fliplr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_flipud_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_float_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_floor_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_fmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_frexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_full_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_gather_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_geometric_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_geqrf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_gradient_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_grid_sampler_2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_grid_sampler_3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hash_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_heaviside_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_histc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_hypot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_igamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_igammac_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_index_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_inner_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isclose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isfinite_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isneginf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isposinf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_isreal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_item_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_2inputs_2outputs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_4inputs_with_extra_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_binary_return_by_ref_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_jiterator_unary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_kron_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_kthvalue_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ldexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cholesky_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cholesky_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cond_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_cross_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_det_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_diagonal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eig_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_eigvalsh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_householder_product_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_inv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_inv_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_ldl_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lstsq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lstsq_grad_oriented_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_factor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_factor_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_power_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_rank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_matrix_rank_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_multi_dot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_norm_subgradients_at_zero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_hermitian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_pinv_singular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_slogdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_ex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_solve_triangular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_svdvals_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_tensorinv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_tensorsolve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vander_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vecdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linalg_vector_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_linspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_log_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logaddexp2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logcumsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logdet_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_and_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_not_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_or_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logical_xor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logspace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logspace_tensor_overload_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_lu_unpack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mH_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mT_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_argmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_argmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_cumprod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_cumsum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_log_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_logaddexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_logsumexp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_masked_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_matrix_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_pool2d_with_indices_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_max_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_maximum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_median_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_meshgrid_list_of_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_meshgrid_variadic_tensors_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_min_reduction_no_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_min_reduction_with_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_minimum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_movedim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_msort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_multinomial_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mv_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nan_to_num_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanmean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanmedian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nanquantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nansum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_narrow_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_narrow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_dropout_backward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_native_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_empty_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_empty_strided_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_full_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_new_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nextafter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_alpha_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_avg_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_batch_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_binary_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_celu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_channel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_conv_transpose3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cosine_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cosine_similarity_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_cross_entropy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_ctc_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_dropout_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_elu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_embedding_bag_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_embedding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_fractional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_fractional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_gaussian_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_gelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_glu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_grid_sample_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_group_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_hinge_embedding_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_huber_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_instance_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_area_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_bicubic_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_interpolate_trilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_kl_div_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_layer_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_linear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_local_response_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_logsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_margin_ranking_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_pool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool1d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool1d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool2d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool2d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool3d_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_max_unpool3d_grad_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_mish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_mse_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multi_head_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multi_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multilabel_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_normalize_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_circular_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_constant_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_reflect_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_replicate_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pad_replicate_negative_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pairwise_distance_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pdist_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pixel_shuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_pixel_unshuffle_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_poisson_nll_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_prelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_rms_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_rrelu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_selu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_silu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_smooth_l1_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_soft_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softmin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softmin_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_softshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_triplet_margin_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_upsample_bilinear_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nn_functional_upsample_nearest_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nonzero_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_nonzero_static_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_fro_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_inf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_norm_nuc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_in_place_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_normal_number_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ones_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ones_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ormqr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_outer_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_pca_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_permute_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_pinverse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polar_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_polygamma_polygamma_n_4_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_positive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_put_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_qr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_quantile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rad2deg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rand_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randint_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randint_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_randn_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_ravel_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_real_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_renorm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_repeat_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_repeat_interleave_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resize__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resize_as__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resolve_conj_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_resolve_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_roll_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_rot90_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_round_decimals_neg_3_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scalar_tensor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_amax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_amin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_prod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_scatter_reduce_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_searchsorted_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_select_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_select_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sgn_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_bartlett_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_blackman_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_exponential_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_gaussian_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_general_cosine_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_general_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_hamming_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_hann_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_kaiser_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signal_windows_nuttall_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_signbit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sinc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_slice_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_slice_scatter_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_softmax_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_softmax_with_dtype_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sort_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sparse_mm_reduce_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sparse_sampled_addmm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_airy_ai_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_j1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_y0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_bessel_y1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_entr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_erfcx_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_hermite_polynomial_h_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_hermite_polynomial_he_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i0e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_i1e_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_laguerre_polynomial_l_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_legendre_polynomial_p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_log_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_i0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_i1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_ndtr_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_ndtri_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_scaled_modified_bessel_k0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_scaled_modified_bessel_k1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_spherical_bessel_j0_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_xlog1py_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_special_zeta_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_list_args_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_with_sizes_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_split_with_sizes_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_square_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_squeeze_multiple_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_stack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_std_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_stft_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_sum_to_size_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_svd_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_svd_lowrank_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_t_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_take_along_dim_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_take_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tensor_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tensordot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tile_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_to_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_to_sparse_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_topk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trace_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_transpose_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trapezoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_trapz_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_triangular_solve_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_tril_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_triu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unbind_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unbind_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unflatten_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unfold_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unfold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_uniform_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unique_consecutive_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unique_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsafe_chunk_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsafe_split_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_unsqueeze_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_mean_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_var_unbiased_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vdot_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_view_as_complex_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_view_copy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vsplit_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_vstack_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_xlogy_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zero__cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zeros_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_unsupported_zeros_like_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___radd___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rdiv___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rmod___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working___rmul___cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_abs_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_acos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_add_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_addcmul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_addmm_decomposed_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_asin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_atan2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_atan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_bool_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_byte_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ceil_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_char_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_clamp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_contiguous_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_cos_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_cosh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_floor_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_no_rounding_mode_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_div_trunc_rounding_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_double_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_eq_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_erf_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_erfc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_exp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expand_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expand_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_expm1_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_float_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_floor_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_fmod_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ge_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_gt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_half_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_int_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_isnan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_le_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lerp_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lgamma_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log10_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log1p_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log2_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_log_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_long_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_lt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_masked_fill_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_max_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mean_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_min_binary_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mm_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_mul_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_ne_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_neg_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardsigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardswish_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_hardtanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_leaky_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_relu6_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_relu_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_softplus_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_softsign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_tanhshrink_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_nn_functional_threshold_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_permute_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_pow_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reciprocal_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_remainder_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reshape_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_reshape_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_round_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_rsqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_rsub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_short_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sigmoid_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sign_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sin_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sinh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sqrt_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sub_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_sum_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_t_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_tan_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_tanh_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_transpose_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_true_divide_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_trunc_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_unsqueeze_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_view_as_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_view_cuda_float32, test/test_jit_fuser_te.py::TestNNCOpInfoCUDA::test_working_where_cuda_float32 2025-12-04T12:45:48.9347592Z 2025-12-04T12:45:48.9347759Z Finished test_jit_fuser_te 1/1 ... [2025-12-04 12:45:48.743920][4977131.240102888], took 7.43min 2025-12-04T12:45:48.9348256Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:45:48.9348618Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:45:48.9348844Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T12:45:48.9349033Z Uploading artifacts took 0.00 seconds 2025-12-04T12:45:48.9349208Z Running nn/test_dropout 1/1 ... [2025-12-04 12:45:48.762877][4977131.259063721] 2025-12-04T12:45:48.9349387Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:45:48.9349821Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_dropout.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:45:48.763360] 2025-12-04T12:45:52.1879258Z 2025-12-04T12:45:52.1880390Z nn/test_dropout 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_dropout_1.1_b428b0c60966961e_.log 2025-12-04T12:45:52.1882321Z Running 9 items in this shard: test/nn/test_dropout.py::TestDropoutNN::test_AlphaDropout, test/nn/test_dropout.py::TestDropoutNN::test_FeatureAlphaDropout, test/nn/test_dropout.py::TestDropoutNN::test_invalid_dropout_p, test/nn/test_dropout.py::TestDropoutNN::test_native_dropout_corner_case, test/nn/test_dropout.py::TestDropoutNNDeviceTypeCUDA::test_Dropout1d_cuda_float64, test/nn/test_dropout.py::TestDropoutNNDeviceTypeCUDA::test_Dropout2d_cuda, test/nn/test_dropout.py::TestDropoutNNDeviceTypeCUDA::test_Dropout3d_cuda, test/nn/test_dropout.py::TestDropoutNNDeviceTypeCUDA::test_Dropout_cuda, test/nn/test_dropout.py::TestDropoutNNDeviceTypeCUDA::test_empty_dropout_cuda 2025-12-04T12:45:52.1883927Z 2025-12-04T12:45:52.1884122Z Finished nn/test_dropout 1/1 ... [2025-12-04 12:45:52.187619][4977134.683803856], took 0.06min 2025-12-04T12:45:52.1909447Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:45:52.2064562Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:45:52.2070671Z Running functorch/dim/test_split 1/1 ... [2025-12-04 12:45:52.206801][4977134.702986946] 2025-12-04T12:45:52.2071328Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:45:52.2074999Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/dim/test_split.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:45:52.207282] 2025-12-04T12:45:54.4261911Z 2025-12-04T12:45:54.4263109Z functorch/dim/test_split 1/1 was successful, full logs can be found in artifacts with path test/test-reports/functorch.dim.test_split_1.1_fccf1d6c356abbdd_.log 2025-12-04T12:45:54.4272870Z Running 23 items in this shard: test/functorch/dim/test_split.py::TestSplit::test_device_handling, test/functorch/dim/test_split.py::TestSplit::test_dim_object_split_all_bound, test/functorch/dim/test_split.py::TestSplit::test_dim_object_split_mixed_bound_unbound, test/functorch/dim/test_split.py::TestSplit::test_dim_object_split_multiple_unbound, test/functorch/dim/test_split.py::TestSplit::test_dim_object_split_unbound, test/functorch/dim/test_split.py::TestSplit::test_dim_object_split_uneven_remainder, test/functorch/dim/test_split.py::TestSplit::test_dimension_binding_consistency, test/functorch/dim/test_split.py::TestSplit::test_edge_case_single_element_splits, test/functorch/dim/test_split.py::TestSplit::test_error_bound_sizes_exceed_tensor, test/functorch/dim/test_split.py::TestSplit::test_error_dim_parameter_with_int_sizes, test/functorch/dim/test_split.py::TestSplit::test_error_mixed_types, test/functorch/dim/test_split.py::TestSplit::test_error_nonexistent_dimension, test/functorch/dim/test_split.py::TestSplit::test_error_size_mismatch, test/functorch/dim/test_split.py::TestSplit::test_large_tensor_split, test/functorch/dim/test_split.py::TestSplit::test_plain_tensor_regular_split_still_works, test/functorch/dim/test_split.py::TestSplit::test_split_different_dims, test/functorch/dim/test_split.py::TestSplit::test_split_function_directly, test/functorch/dim/test_split.py::TestSplit::test_split_on_plain_tensor_with_fcd_args, test/functorch/dim/test_split.py::TestSplit::test_split_preserves_dtype, test/functorch/dim/test_split.py::TestSplit::test_split_result_tensor_types, test/functorch/dim/test_split.py::TestSplit::test_split_single_dim_object, test/functorch/dim/test_split.py::TestSplit::test_split_with_dim_object_parameter, test/functorch/dim/test_split.py::TestSplit::test_split_with_requires_grad 2025-12-04T12:45:54.4281608Z 2025-12-04T12:45:54.4282004Z Finished functorch/dim/test_split 1/1 ... [2025-12-04 12:45:54.425906][4977136.922090426], took 0.04min 2025-12-04T12:45:54.4297230Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:45:54.4450489Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:45:54.4457428Z Running torch_np/numpy_tests/lib/test_type_check 1/1 ... [2025-12-04 12:45:54.445433][4977136.941619469] 2025-12-04T12:45:54.4458221Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:45:54.4461549Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/lib/test_type_check.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:45:54.445915] 2025-12-04T12:45:56.7646487Z 2025-12-04T12:45:56.7648455Z torch_np/numpy_tests/lib/test_type_check 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.lib.test_type_check_1.1_78d94179e9ec9b1c_.log 2025-12-04T12:45:56.7667064Z Running 50 items in this shard: test/torch_np/numpy_tests/lib/test_type_check.py::TestCommonType::test_basic, test/torch_np/numpy_tests/lib/test_type_check.py::TestMintypecode::test_default_1, test/torch_np/numpy_tests/lib/test_type_check.py::TestMintypecode::test_default_2, test/torch_np/numpy_tests/lib/test_type_check.py::TestMintypecode::test_default_3, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsscalar::test_basic, test/torch_np/numpy_tests/lib/test_type_check.py::TestReal::test_cmplx, test/torch_np/numpy_tests/lib/test_type_check.py::TestReal::test_real, test/torch_np/numpy_tests/lib/test_type_check.py::TestImag::test_cmplx, test/torch_np/numpy_tests/lib/test_type_check.py::TestImag::test_real, test/torch_np/numpy_tests/lib/test_type_check.py::TestIscomplex::test_fail, test/torch_np/numpy_tests/lib/test_type_check.py::TestIscomplex::test_pass, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsreal::test_fail, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsreal::test_isreal_real, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsreal::test_pass, test/torch_np/numpy_tests/lib/test_type_check.py::TestIscomplexobj::test_basic, test/torch_np/numpy_tests/lib/test_type_check.py::TestIscomplexobj::test_list, test/torch_np/numpy_tests/lib/test_type_check.py::TestIscomplexobj::test_scalar, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsrealobj::test_basic, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_complex, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_complex1, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_goodvalues, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_ind, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_integer, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_neginf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsnan::test_posinf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_complex, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_complex1, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_goodvalues, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_ind, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_integer, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_neginf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsfinite::test_posinf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_goodvalues, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_ind, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_neginf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_neginf_scalar, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_posinf, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsinf::test_posinf_scalar, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsposinf::test_generic, test/torch_np/numpy_tests/lib/test_type_check.py::TestIsneginf::test_generic, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_array, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_complex_bad, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_complex_bad2, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_complex_good, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_do_not_rewrite_previous_keyword, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_float, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_generic, test/torch_np/numpy_tests/lib/test_type_check.py::TestNanToNum::test_integer, test/torch_np/numpy_tests/lib/test_type_check.py::TestRealIfClose::test_basic, test/torch_np/numpy_tests/lib/test_type_check.py::TestArrayConversion::test_asfarray 2025-12-04T12:45:56.7684206Z 2025-12-04T12:45:56.7684667Z Finished torch_np/numpy_tests/lib/test_type_check 1/1 ... [2025-12-04 12:45:56.764332][4977139.260517252], took 0.04min 2025-12-04T12:45:56.7686029Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:45:56.7830342Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:45:56.7836091Z Running cpp_extensions/test_libtorch_agnostic 1/1 ... [2025-12-04 12:45:56.783372][4977139.279558394] 2025-12-04T12:45:56.7836794Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:45:56.7841786Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'cpp_extensions/test_libtorch_agnostic.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:45:56.783886] 2025-12-04T12:46:13.5795684Z 2025-12-04T12:46:13.5797251Z cpp_extensions/test_libtorch_agnostic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp_extensions.test_libtorch_agnostic_1.1_db0fc8526f21b465_.log 2025-12-04T12:46:13.5824785Z Running 55 items in this shard: test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_default_constructor_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_device_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_device_guard_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_device_guard_set_index_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_divide_neg_exp_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_exp_neg_is_leaf_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_fill_infinity_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_get_any_data_ptr_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_get_current_device_index_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_get_num_threads_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_get_template_any_data_ptr_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_identity_does_not_hog_memory_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_is_contiguous_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_make_tensor_clones_and_call_foreach_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_mv_tensor_accessor_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my__foreach_mul__cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my__foreach_mul_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_abs_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_amax_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_amax_vec_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_clone_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_copy__cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_cuda_stream_synchronize_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout0_memory_format0_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout0_memory_format1_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout0_memory_format2_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout1_memory_format0_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout1_memory_format1_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout1_memory_format2_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout2_memory_format0_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout2_memory_format1_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_layout2_memory_format2_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_empty_like_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_flatten_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_get_cuda_stream_from_pool_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_get_curr_cuda_blas_handle_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_get_current_cuda_stream_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_is_cpu_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_narrow_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_new_empty_dtype_variant_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_new_zeros_dtype_variant_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_ones_like_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_pad_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_reshape_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_set_current_cuda_stream_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_shape_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_string_op_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_transpose_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_view_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_my_zero__cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_neg_exp_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_parallel_for_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_slow_sgd_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_stream_cuda, test/cpp_extensions/test_libtorch_agnostic.py::TestLibtorchAgnosticCUDA::test_tensor_device_cuda 2025-12-04T12:46:13.5850303Z 2025-12-04T12:46:13.5850751Z Finished cpp_extensions/test_libtorch_agnostic 1/1 ... [2025-12-04 12:46:13.579547][4977156.075730745], took 0.28min 2025-12-04T12:46:13.5852097Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:46:13.5981350Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:46:13.5987920Z Running test_cpp_extensions_stream_and_event 1/1 ... [2025-12-04 12:46:13.598532][4977156.094719187] 2025-12-04T12:46:13.5988683Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:46:13.5992942Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_cpp_extensions_stream_and_event.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:46:13.599035] 2025-12-04T12:46:15.4676972Z 2025-12-04T12:46:15.4678746Z test_cpp_extensions_stream_and_event 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_cpp_extensions_stream_and_event_1.1_48aaaa0ae62a5229_.log 2025-12-04T12:46:15.4680258Z Running 1 items in this shard: test/test_cpp_extensions_stream_and_event.py::TestCppExtensionStreamAndEvent::test_stream_event 2025-12-04T12:46:15.4680916Z 2025-12-04T12:46:15.4681341Z Finished test_cpp_extensions_stream_and_event 1/1 ... [2025-12-04 12:46:15.467285][4977157.963469213], took 0.03min 2025-12-04T12:46:15.4707375Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:46:15.4859250Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:46:15.4865311Z Running profiler/test_execution_trace 1/1 ... [2025-12-04 12:46:15.486257][4977157.982443836] 2025-12-04T12:46:15.4866008Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:46:15.4869854Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'profiler/test_execution_trace.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:46:15.486744] 2025-12-04T12:46:22.3256323Z 2025-12-04T12:46:22.3257642Z profiler/test_execution_trace 1/1 was successful, full logs can be found in artifacts with path test/test-reports/profiler.test_execution_trace_1.1_75a88de72abd6544_.log 2025-12-04T12:46:22.3265806Z Running 13 items in this shard: test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_alone_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_env_disabled_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_env_enabled_with_kineto_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_env_enabled_with_pt2_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_nested_tensor_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_no_capture_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_record_integral_tensor_data_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_record_integral_tensor_range_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_repeat_in_loop_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_start_stop_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_with_kineto_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_execution_trace_with_pt2_cuda, test/profiler/test_execution_trace.py::TestExecutionTraceCUDA::test_triton_fx_graph_with_et_cuda 2025-12-04T12:46:22.3272579Z 2025-12-04T12:46:22.3273004Z Finished profiler/test_execution_trace 1/1 ... [2025-12-04 12:46:22.325299][4977164.821483285], took 0.11min 2025-12-04T12:46:22.3288013Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:46:22.3442554Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:46:22.3447960Z Running test_dispatch 1/1 ... [2025-12-04 12:46:22.344568][4977164.840754283] 2025-12-04T12:46:22.3448611Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:46:22.3453133Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_dispatch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:46:22.345048] 2025-12-04T12:46:44.8500740Z 2025-12-04T12:46:44.8501581Z test_dispatch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_dispatch_1.1_da1fd1cd694f930b_.log 2025-12-04T12:46:44.8506375Z Running 32 items in this shard: test/test_dispatch.py::TestDispatch::test_all_invariants, test/test_dispatch.py::TestDispatch::test_computed_table, test/test_dispatch.py::TestDispatch::test_computed_table_with_ambiguous_autogradother, test/test_dispatch.py::TestDispatch::test_computed_table_with_autograd, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_autograd_defaultbackend, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_autograd_math, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_autograd_math_defaultbackend, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_defaultbackend, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_math, test/test_dispatch.py::TestDispatch::test_computed_table_with_cpu_math_autogradcpu_fallthrough, test/test_dispatch.py::TestDispatch::test_computed_table_with_math, test/test_dispatch.py::TestDispatch::test_def, test/test_dispatch.py::TestDispatch::test_def_impl_schema_mismatch, test/test_dispatch.py::TestDispatch::test_def_only, test/test_dispatch.py::TestDispatch::test_def_with_explicit_alias, test/test_dispatch.py::TestDispatch::test_def_with_inference, test/test_dispatch.py::TestDispatch::test_dispatch_print_registrations_for_dispatch_key_invalid, test/test_dispatch.py::TestDispatch::test_find_dangling_impls, test/test_dispatch.py::TestDispatch::test_find_dangling_impls_ext, test/test_dispatch.py::TestDispatch::test_impl_only, test/test_dispatch.py::TestDispatch::test_multiple_def_alias_defaulting, test/test_dispatch.py::TestDispatch::test_multiple_def_alias_mismatch, test/test_dispatch.py::TestDispatch::test_multiple_def_error, test/test_dispatch.py::TestDispatch::test_multiple_fallback, test/test_dispatch.py::TestDispatch::test_overwrite_math, test/test_dispatch.py::TestPythonDispatcher::test_autogradother, test/test_dispatch.py::TestPythonDispatcher::test_basic, test/test_dispatch.py::TestPythonDispatcher::test_defaultbackend_autogradcpu, test/test_dispatch.py::TestPythonDispatcher::test_defaultbackend_math, test/test_dispatch.py::TestPythonDispatcher::test_duplicate_registrations, test/test_dispatch.py::TestPythonDispatcher::test_math_autogradcpu, test/test_dispatch.py::TestPythonDispatcher::test_quantized_structured_not_implemented 2025-12-04T12:46:44.8512464Z 2025-12-04T12:46:44.8512814Z Finished test_dispatch 1/1 ... [2025-12-04 12:46:44.849675][4977187.345863718], took 0.38min 2025-12-04T12:46:44.8531693Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:46:44.8684645Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:46:44.8691054Z Running test_datapipe 1/1 ... [2025-12-04 12:46:44.868826][4977187.365011458] 2025-12-04T12:46:44.8691664Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:46:44.8694953Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_datapipe.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:46:44.869286] 2025-12-04T12:46:53.7982659Z 2025-12-04T12:46:53.7984022Z test_datapipe 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_datapipe_1.1_54423db16ae0cca3_.log 2025-12-04T12:46:53.8017295Z Running 93 items in this shard: test/test_datapipe.py::TestDataChunk::test_as_string, test/test_datapipe.py::TestDataChunk::test_getitem, test/test_datapipe.py::TestDataChunk::test_iter, test/test_datapipe.py::TestDataChunk::test_len, test/test_datapipe.py::TestDataChunk::test_random_shuffle, test/test_datapipe.py::TestDataChunk::test_reverse, test/test_datapipe.py::TestDataChunk::test_sort, test/test_datapipe.py::TestStreamWrapper::test_api, test/test_datapipe.py::TestStreamWrapper::test_dir, test/test_datapipe.py::TestStreamWrapper::test_pickle, test/test_datapipe.py::TestStreamWrapper::test_repr, test/test_datapipe.py::TestIterableDataPipeBasic::test_demux_mux_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_groupby_iterable_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_listdirfiles_iterable_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_listdirfilesdeterministic_iterable_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_map_with_col_file_handle_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_openfilesfromdisk_iterable_datapipe, test/test_datapipe.py::TestIterableDataPipeBasic::test_routeddecoder_iterable_datapipe, test/test_datapipe.py::TestCaptureDataFrame::test_basic_capture, test/test_datapipe.py::TestDataFramesPipes::test_batch, test/test_datapipe.py::TestDataFramesPipes::test_capture, test/test_datapipe.py::TestDataFramesPipes::test_collate, test/test_datapipe.py::TestDataFramesPipes::test_filter, test/test_datapipe.py::TestDataFramesPipes::test_shuffle, test/test_datapipe.py::TestDataFramesPipes::test_unbatch, test/test_datapipe.py::TestFunctionalIterDataPipe::test_batch_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_collate_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_concat_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_demux_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_docstring, test/test_datapipe.py::TestFunctionalIterDataPipe::test_filter_datapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_fork_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_iterable_wrapper_datapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_map_dict_with_col_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_map_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_map_tuple_list_with_col_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_mux_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_sampler_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_serializable, test/test_datapipe.py::TestFunctionalIterDataPipe::test_serializable_with_dill, test/test_datapipe.py::TestFunctionalIterDataPipe::test_shuffler_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_stream_reader_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_unbatch_iterdatapipe, test/test_datapipe.py::TestFunctionalIterDataPipe::test_zip_iterdatapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_batch_mapdatapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_concat_mapdatapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_docstring, test/test_datapipe.py::TestFunctionalMapDataPipe::test_map_mapdatapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_sequence_wrapper_datapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_serializable, test/test_datapipe.py::TestFunctionalMapDataPipe::test_serializable_with_dill, test/test_datapipe.py::TestFunctionalMapDataPipe::test_shuffler_mapdatapipe, test/test_datapipe.py::TestFunctionalMapDataPipe::test_zip_mapdatapipe, test/test_datapipe.py::TestTyping::test_compile_time, test/test_datapipe.py::TestTyping::test_construct_time, test/test_datapipe.py::TestTyping::test_isinstance, test/test_datapipe.py::TestTyping::test_issubinstance, test/test_datapipe.py::TestTyping::test_protocol, test/test_datapipe.py::TestTyping::test_reinforce, test/test_datapipe.py::TestTyping::test_runtime, test/test_datapipe.py::TestTyping::test_subtype, test/test_datapipe.py::TestGraph::test_simple_traverse, test/test_datapipe.py::TestGraph::test_traverse_circular_datapipe, test/test_datapipe.py::TestGraph::test_traverse_forked, test/test_datapipe.py::TestGraph::test_traverse_mapdatapipe, test/test_datapipe.py::TestGraph::test_traverse_mixdatapipe, test/test_datapipe.py::TestGraph::test_traverse_unhashable_datapipe, test/test_datapipe.py::TestSerialization::test_spawn_lambdas_iter, test/test_datapipe.py::TestSerialization::test_spawn_lambdas_map, test/test_datapipe.py::TestCircularSerialization::test_circular_serialization_with_dill, test/test_datapipe.py::TestCircularSerialization::test_circular_serialization_with_pickle, test/test_datapipe.py::TestSharding::test_legacy_custom_sharding, test/test_datapipe.py::TestSharding::test_legacy_custom_sharding_with_old_dataloader, test/test_datapipe.py::TestSharding::test_multi_sharding, test/test_datapipe.py::TestSharding::test_old_dataloader, test/test_datapipe.py::TestSharding::test_sharding_groups, test/test_datapipe.py::TestSharding::test_sharding_length, test/test_datapipe.py::TestSharding::test_simple_sharding, test/test_datapipe.py::TestIterDataPipeSingletonConstraint::test_iterdatapipe_singleton_buggy, test/test_datapipe.py::TestIterDataPipeSingletonConstraint::test_iterdatapipe_singleton_constraint_multiple_outputs, test/test_datapipe.py::TestIterDataPipeSingletonConstraint::test_iterdatapipe_singleton_generator, test/test_datapipe.py::TestIterDataPipeSingletonConstraint::test_iterdatapipe_singleton_new_object, test/test_datapipe.py::TestIterDataPipeSingletonConstraint::test_iterdatapipe_singleton_self_next, test/test_datapipe.py::TestIterDataPipeCountSampleYielded::test_iterdatapipe_sample_yielded_generator_function, test/test_datapipe.py::TestIterDataPipeCountSampleYielded::test_iterdatapipe_sample_yielded_generator_function_exception, test/test_datapipe.py::TestIterDataPipeCountSampleYielded::test_iterdatapipe_sample_yielded_next, test/test_datapipe.py::TestIterDataPipeCountSampleYielded::test_iterdatapipe_sample_yielded_next_exception, test/test_datapipe.py::TestIterDataPipeCountSampleYielded::test_iterdatapipe_sample_yielded_return_self, test/test_datapipe.py::TestIterDataPipeGraphFastForward::test_simple_snapshot_custom_non_generator, test/test_datapipe.py::TestIterDataPipeGraphFastForward::test_simple_snapshot_custom_self_next, test/test_datapipe.py::TestIterDataPipeGraphFastForward::test_simple_snapshot_graph, test/test_datapipe.py::TestIterDataPipeGraphFastForward::test_simple_snapshot_graph_repeated, test/test_datapipe.py::TestIterDataPipeGraphFastForward::test_simple_snapshot_graph_with_serialization 2025-12-04T12:46:53.8048683Z 2025-12-04T12:46:53.8049148Z Finished test_datapipe 1/1 ... [2025-12-04 12:46:53.797954][4977196.294137802], took 0.15min 2025-12-04T12:46:53.8050434Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T12:46:53.8169637Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T12:46:53.8175900Z Running test_ops 1/5 ... [2025-12-04 12:46:53.817325][4977196.313511508] 2025-12-04T12:46:53.8176479Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T12:46:53.8180268Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '--shard-id=1', '--num-shards=5', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 12:46:53.817794] 2025-12-04T13:35:25.6566950Z 2025-12-04T13:35:25.6567712Z PRINTING LOG FILE of test_ops 1/5 (test/test-reports/test_ops_1.5_495b45214c930060_.log) 2025-12-04T13:35:25.6568944Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-6b6e7e1c3695e7c0.xml 2025-12-04T13:35:25.6570160Z ============================= test session starts ============================== 2025-12-04T13:35:25.6570876Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:35:25.6571514Z cachedir: .pytest_cache 2025-12-04T13:35:25.6572547Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:35:25.6573626Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T13:35:25.6574196Z configfile: pytest.ini 2025-12-04T13:35:25.6575190Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T13:35:25.6576018Z collecting ... collected 33666 items 2025-12-04T13:35:25.6576495Z stepcurrent: Cannot find last run test, not skipping 2025-12-04T13:35:25.7761580Z Running 6911 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_torch__scaled_mm_v2_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_compare_cpu_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing___getitem___cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_alias_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nanmean_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_static_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rand_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_resolve_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes_H_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___radd___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softplus_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_offsets_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_baddbmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_inverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cummin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_frac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mT_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_binary_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_grid_sample_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multilabel_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_roll_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_polygamma_special_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_with_sizes_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__safe_softmax_default_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zero__cuda, test/test_ops.py::TestCommonCUDA::test_errors___rand___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_errors__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_amax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cov_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ldexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sub_cuda, test/test_ops.py::TestCommonCUDA::test_errors_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hash_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hash_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_H_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___radd___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_argwhere_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_combinations_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_permuted_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gather_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hash_tensor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_binary_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_msort_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_feature_alpha_dropout_without_train_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pad_circular_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_static_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_outer_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_3_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_interleave_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sort_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tile_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_sparse_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unbind_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zero__cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vander_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_static_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unbind_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___radd___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmatmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___ror___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__batch_norm_with_update_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__native_batch_norm_legit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cdouble_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_arange_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_shapes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_gelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_select_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_stft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__unsafe_masked_index_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_arange_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_combinations_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eye_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_geqrf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_subgradients_at_zero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vander_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_unpack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_median_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_msort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_with_logits_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_fractional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resize_as__cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_searchsorted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_bartlett_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_log_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_with_sizes_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_take_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_topk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unbind_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e4m3fn, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_all_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_cuda, test/test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_chalf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_grid_sampler_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hash_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rand_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hypot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_nuc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zeros_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___getitem___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rmatmul___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_alias_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_item_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_or_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_true_divide_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_argwhere_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_block_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_inverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_constant_pad_nd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dist_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gather_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_geqrf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gradient_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_hstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_kron_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eig_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svdvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_matrix_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_nuc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_normal_in_place_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_put_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_interleave_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize_as__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_transpose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_short_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_abs_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addcmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_contiguous_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_masked_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_trace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_angle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bool_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cartesian_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_char_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_combinations_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_einsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eye_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_geqrf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isfinite_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_kron_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_triangular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logcumsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_not_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nanmean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_circular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softsign_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_unfold_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_normal_in_place_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_outer_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_positive_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_randn_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_interleave_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_add_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_list_args_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_to_size_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tensordot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsafe_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___getitem___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmatmul___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmod___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__batch_norm_with_update_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_polar_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_allclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_any_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atan2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_chunk_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clone_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_deg2rad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_equal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flip_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_le_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lerp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_and_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_mul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trunc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unfold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unsqueeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__upsample_bilinear2d_aa_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_decomposed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_alias_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ceil_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_min_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diff_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fliplr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flipud_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_floor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_geqrf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hash_tensor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_4inputs_with_extra_args_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvalsh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_singular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_tensorinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vander_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log10_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logdet_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mT_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matrix_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_movedim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_celu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_grid_sample_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_kl_div_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_circular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_rrelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_scaled_dot_product_attention_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_fro_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_nuc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_4_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_quantile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_interleave_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_searchsorted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_entr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_laguerre_polynomial_l_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_along_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_tensordot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_transpose_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_vdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zero__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_like_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch__scaled_mm_cuda_float8_e4m3fn, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geqrf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hash_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_igammac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_char_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flip_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex32, test/test_ops.py::TestTagsCUDA::test_tags___rdiv___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rsub___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bool_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_short_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_allclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_physical_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_embed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fliplr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_floor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_floor_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_istft_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_lerp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_not_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_maximum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nextafter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rad2deg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rsub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_select_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_triu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_trunc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_as_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_xlogy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addbmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argsort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bernoulli_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_right_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_bmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_char_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_double_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_equal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_exponential_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ihfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_full_like_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_inner_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isreal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ldexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lgamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_householder_product_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_xor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matrix_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_multinomial_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nextafter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_mish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softsign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pca_lowrank_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_permute_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randint_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reciprocal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reshape_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reshape_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_blackman_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_y1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_square_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_sparse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trapz_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trunc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unbind_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zero__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_zeros_like_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_floor_rounding_cuda_float32, test/test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_mul_cuda_float32 2025-12-04T13:35:25.8522302Z 2025-12-04T13:35:25.8522991Z test_ops.py::TestCommonCUDA::test_compare_cpu_H_cuda_float32 SKIPPED [0.0972s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8523385Z test_ops.py::TestCommonCUDA::test_compare_cpu___rand___cuda_int64 SKIPPED [0.0013s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8523776Z test_ops.py::TestCommonCUDA::test_compare_cpu___rmatmul___cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8524170Z test_ops.py::TestCommonCUDA::test_compare_cpu___rpow___cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8524556Z test_ops.py::TestCommonCUDA::test_compare_cpu___rxor___cuda_int64 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8524940Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_T_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8525390Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_alias_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8525827Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_scatter_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8526241Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atan2_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8526643Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_3d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8527051Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bucketize_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8527460Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_chunk_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8527887Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_column_stack_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8528350Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8528781Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_div_trunc_rounding_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8529171Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 0%] 2025-12-04T13:35:25.8529546Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_as_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8529950Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8530359Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8530830Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fmax_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8531228Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igammac_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8531629Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_fill_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8532038Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8532418Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_new_empty_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 0%] 2025-12-04T13:35:25.8532829Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hinge_embedding_loss_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8533298Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8533699Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 0%] 2025-12-04T13:35:25.8534107Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8534553Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_special_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8534991Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_split_with_sizes_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8535397Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_tril_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8535805Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unsqueeze_copy_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8536218Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vstack_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8536616Z test_ops.py::TestCommonCUDA::test_compare_cpu__refs_zeros_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8537043Z test_ops.py::TestCommonCUDA::test_compare_cpu__softmax_backward_data_cuda_float32 SKIPPED [0.0012s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8537474Z test_ops.py::TestCommonCUDA::test_compare_cpu__upsample_bilinear2d_aa_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8537895Z test_ops.py::TestCommonCUDA::test_compare_cpu_addcdiv_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8538323Z test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8538711Z test_ops.py::TestCommonCUDA::test_compare_cpu_addmm_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8539122Z test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_partial_views_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8539532Z test_ops.py::TestCommonCUDA::test_compare_cpu_atan2_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8539926Z test_ops.py::TestCommonCUDA::test_compare_cpu_atleast_1d_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8540322Z test_ops.py::TestCommonCUDA::test_compare_cpu_bfloat16_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8540713Z test_ops.py::TestCommonCUDA::test_compare_cpu_bool_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8541094Z test_ops.py::TestCommonCUDA::test_compare_cpu_byte_cuda_float32 SKIPPED [0.0001s] (Overflow when downcasting signed type is undefined) [ 0%] 2025-12-04T13:35:25.8541479Z test_ops.py::TestCommonCUDA::test_compare_cpu_cdist_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8541863Z test_ops.py::TestCommonCUDA::test_compare_cpu_chunk_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8542261Z test_ops.py::TestCommonCUDA::test_compare_cpu_column_stack_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8542660Z test_ops.py::TestCommonCUDA::test_compare_cpu_complex_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8543095Z test_ops.py::TestCommonCUDA::test_compare_cpu_cov_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8543508Z test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8543899Z test_ops.py::TestCommonCUDA::test_compare_cpu_expand_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8544294Z test_ops.py::TestCommonCUDA::test_compare_cpu_fft_fftshift_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8544751Z test_ops.py::TestCommonCUDA::test_compare_cpu_histc_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8545140Z test_ops.py::TestCommonCUDA::test_compare_cpu_igamma_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8545533Z test_ops.py::TestCommonCUDA::test_compare_cpu_index_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8545956Z test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_mean_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8546363Z test_ops.py::TestCommonCUDA::test_compare_cpu_index_select_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8546763Z test_ops.py::TestCommonCUDA::test_compare_cpu_ldexp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8547157Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigh_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8547569Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvals_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8547987Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8548459Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8548886Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_slogdet_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8549290Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8549699Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_ex_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8550108Z test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svd_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8550520Z test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8550930Z test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 0%] 2025-12-04T13:35:25.8551330Z test_ops.py::TestCommonCUDA::test_compare_cpu_logsumexp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8551716Z test_ops.py::TestCommonCUDA::test_compare_cpu_mH_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8552146Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8552581Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_normalize_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8552994Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_scatter_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8553403Z test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmin_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8553750Z test_ops.py::TestCommonCUDA::test_compare_cpu_mode_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 1%] 2025-12-04T13:35:25.8554062Z test_ops.py::TestCommonCUDA::test_compare_cpu_multinomial_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T13:35:25.8554435Z test_ops.py::TestCommonCUDA::test_compare_cpu_nanmedian_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8554861Z test_ops.py::TestCommonCUDA::test_compare_cpu_native_batch_norm_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8555269Z test_ops.py::TestCommonCUDA::test_compare_cpu_new_zeros_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8555697Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool1d_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8556477Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv1d_cuda_float32 SKIPPED [0.0002s] (Test is disabled because an issue exists disabling it: https://github.com/pytorch/pytorch/issues/155822 for platform(s) linux, mac, macos, rocm, win. If you're seeing this on your local machine and would like to enable this test, please make sure CI is not set and you are not using the flag --import-disabled-tests.) [ 1%] 2025-12-04T13:35:25.8557233Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8557639Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout2d_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T13:35:25.8558010Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout3d_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T13:35:25.8558388Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_embedding_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 1%] 2025-12-04T13:35:25.8558764Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8559223Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_nearest-exact_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8559674Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_kl_div_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8560112Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool2d_grad_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8560562Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_grad_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8561037Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8561480Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_normalize_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8561914Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_constant_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8562367Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_scaled_dot_product_attention_cuda_float32 SKIPPED [0.0002s] (output is non-deterministic (when dropout_p > 0)) [ 1%] 2025-12-04T13:35:25.8562808Z test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8563220Z test_ops.py::TestCommonCUDA::test_compare_cpu_norm_fro_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8563619Z test_ops.py::TestCommonCUDA::test_compare_cpu_quantile_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8563999Z test_ops.py::TestCommonCUDA::test_compare_cpu_rand_like_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T13:35:25.8564359Z test_ops.py::TestCommonCUDA::test_compare_cpu_resize__cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8564744Z test_ops.py::TestCommonCUDA::test_compare_cpu_rot90_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8565131Z test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8565539Z test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_mean_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8565940Z test_ops.py::TestCommonCUDA::test_compare_cpu_short_cuda_float32 SKIPPED [0.0001s] (Overflow when downcasting signed type is undefined) [ 1%] 2025-12-04T13:35:25.8566321Z test_ops.py::TestCommonCUDA::test_compare_cpu_slice_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8566740Z test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8567195Z test_ops.py::TestCommonCUDA::test_compare_cpu_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8567624Z test_ops.py::TestCommonCUDA::test_compare_cpu_split_list_args_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8568038Z test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8568565Z test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_unbiased_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8568966Z test_ops.py::TestCommonCUDA::test_compare_cpu_sum_to_size_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8569352Z test_ops.py::TestCommonCUDA::test_compare_cpu_t_copy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8569734Z test_ops.py::TestCommonCUDA::test_compare_cpu_topk_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8570139Z test_ops.py::TestCommonCUDA::test_compare_cpu_torch__scaled_mm_v2_cuda_float8_e4m3fn SKIPPED [0.0005s] (Requires CUDA SM >= 8.9) [ 1%] 2025-12-04T13:35:25.8570533Z test_ops.py::TestCommonCUDA::test_compare_cpu_transpose_copy_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8570936Z test_ops.py::TestCommonCUDA::test_compare_cpu_trapezoid_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8571322Z test_ops.py::TestCommonCUDA::test_compare_cpu_tril_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8571709Z test_ops.py::TestCommonCUDA::test_compare_cpu_true_divide_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8572071Z test_ops.py::TestCommonCUDA::test_compare_cpu_uniform_cuda_float32 SKIPPED [0.0001s] (output is non-deterministic) [ 1%] 2025-12-04T13:35:25.8572435Z test_ops.py::TestCommonCUDA::test_compare_cpu_unsafe_split_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8572854Z test_ops.py::TestCommonCUDA::test_compare_cpu_var_mean_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8573237Z test_ops.py::TestCommonCUDA::test_compare_cpu_vdot_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8573633Z test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_complex_cuda_float32 SKIPPED [0.0010s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8574029Z test_ops.py::TestCommonCUDA::test_compare_cpu_vsplit_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8574417Z test_ops.py::TestCommonCUDA::test_compare_cpu_xlogy_cuda_float32 SKIPPED [0.0009s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 1%] 2025-12-04T13:35:25.8574768Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing___getitem___cuda_complex32 PASSED [1.0654s] [ 1%] 2025-12-04T13:35:25.8575069Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_abs_cuda_complex32 PASSED [0.8105s] [ 1%] 2025-12-04T13:35:25.8575369Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_alias_copy_cuda_complex32 PASSED [0.7711s] [ 1%] 2025-12-04T13:35:25.8575727Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_cuda_complex32 SKIPPED [0.0002s] (Errors when storage_offset is included) [ 1%] 2025-12-04T13:35:25.8576099Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32 XFAIL [0.0950s] [ 1%] 2025-12-04T13:35:25.8576419Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_bfloat16_cuda_complex32 PASSED [1.5586s] [ 1%] 2025-12-04T13:35:25.8576718Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chalf_cuda_complex32 PASSED [0.7793s] [ 1%] 2025-12-04T13:35:25.8577015Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_clone_cuda_complex32 PASSED [0.7501s] [ 1%] 2025-12-04T13:35:25.8577311Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_embed_cuda_complex32 PASSED [0.7684s] [ 1%] 2025-12-04T13:35:25.8577616Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_double_cuda_complex32 PASSED [0.7682s] [ 1%] 2025-12-04T13:35:25.8577910Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dstack_cuda_complex32 PASSED [0.7614s] [ 1%] 2025-12-04T13:35:25.8578279Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_cuda_complex32 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 1%] 2025-12-04T13:35:25.8578642Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_like_cuda_complex32 SKIPPED [0.0002s] (Skipped!) [ 1%] 2025-12-04T13:35:25.8578991Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftn_cuda_complex32 PASSED [7.5872s] [ 2%] 2025-12-04T13:35:25.8579312Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftshift_cuda_complex32 PASSED [0.7765s] [ 2%] 2025-12-04T13:35:25.8579615Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft_cuda_complex32 PASSED [3.2430s] [ 2%] 2025-12-04T13:35:25.8579916Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft2_cuda_complex32 PASSED [2.0237s] [ 2%] 2025-12-04T13:35:25.8580222Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifftshift_cuda_complex32 PASSED [0.7619s] [ 2%] 2025-12-04T13:35:25.8580529Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfftn_cuda_complex32 PASSED [1.7262s] [ 2%] 2025-12-04T13:35:25.8580825Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_full_cuda_complex32 PASSED [0.7714s] [ 2%] 2025-12-04T13:35:25.8581121Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isfinite_cuda_complex32 PASSED [0.7656s] [ 2%] 2025-12-04T13:35:25.8581433Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32 PASSED [0.7711s] [ 2%] 2025-12-04T13:35:25.8581723Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_long_cuda_complex32 PASSED [0.7596s] [ 2%] 2025-12-04T13:35:25.8582016Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nanmean_cuda_complex32 PASSED [0.0435s] [ 2%] 2025-12-04T13:35:25.8582314Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_ones_cuda_complex32 PASSED [0.7685s] [ 2%] 2025-12-04T13:35:25.8582644Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_static_cuda_complex32 SKIPPED [0.0011s] (Only runs on cpu) [ 2%] 2025-12-04T13:35:25.8582976Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_like_cuda_complex32 PASSED [0.7722s] [ 2%] 2025-12-04T13:35:25.8583277Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_cuda_complex32 PASSED [0.7477s] [ 2%] 2025-12-04T13:35:25.8583570Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_prod_cuda_complex32 PASSED [0.0215s] [ 2%] 2025-12-04T13:35:25.8583936Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rand_like_cuda_complex32 SKIPPED [0.0002s] (Expected: randn_like is not comparable between dtypes) [ 2%] 2025-12-04T13:35:25.8584299Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_real_cuda_complex32 PASSED [0.7630s] [ 2%] 2025-12-04T13:35:25.8584598Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_resolve_neg_cuda_complex32 PASSED [0.0061s] [ 2%] 2025-12-04T13:35:25.8584913Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_split_with_sizes_cuda_complex32 PASSED [0.7620s] [ 2%] 2025-12-04T13:35:25.8585215Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sqrt_cuda_complex32 PASSED [0.0082s] [ 2%] 2025-12-04T13:35:25.8585502Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sub_cuda_complex32 PASSED [0.7832s] [ 2%] 2025-12-04T13:35:25.8585792Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tan_cuda_complex32 PASSED [0.0115s] [ 2%] 2025-12-04T13:35:25.8586080Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_trace_cuda_complex32 PASSED [0.7669s] [ 2%] 2025-12-04T13:35:25.8586368Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_triu_cuda_complex32 PASSED [0.0131s] [ 2%] 2025-12-04T13:35:25.8586665Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_chunk_cuda_complex32 PASSED [0.7822s] [ 2%] 2025-12-04T13:35:25.8586978Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsqueeze_copy_cuda_complex32 PASSED [0.0068s] [ 2%] 2025-12-04T13:35:25.8587284Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vsplit_cuda_complex32 PASSED [0.7806s] [ 2%] 2025-12-04T13:35:25.8587604Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_vstack_cuda_complex32 PASSED [0.0047s] [ 2%] 2025-12-04T13:35:25.8587907Z test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_where_cuda_complex32 PASSED [0.7877s] [ 2%] 2025-12-04T13:35:25.8588204Z test_ops.py::TestCommonCUDA::test_dtypes_H_cuda PASSED [0.7694s] [ 2%] 2025-12-04T13:35:25.8588420Z test_ops.py::TestCommonCUDA::test_dtypes___radd___cuda PASSED [0.7902s] [ 2%] 2025-12-04T13:35:25.8588631Z test_ops.py::TestCommonCUDA::test_dtypes___rpow___cuda PASSED [0.9155s] [ 2%] 2025-12-04T13:35:25.8588843Z test_ops.py::TestCommonCUDA::test_dtypes___rsub___cuda PASSED [0.8053s] [ 2%] 2025-12-04T13:35:25.8589058Z test_ops.py::TestCommonCUDA::test_dtypes__chunk_cat_cuda PASSED [0.7998s] [ 2%] 2025-12-04T13:35:25.8589297Z test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_double_cuda PASSED [0.7916s] [ 2%] 2025-12-04T13:35:25.8589551Z test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_float_cuda PASSED [0.7994s] [ 2%] 2025-12-04T13:35:25.8589787Z test_ops.py::TestCommonCUDA::test_dtypes__refs_abs_cuda PASSED [0.7830s] [ 2%] 2025-12-04T13:35:25.8590021Z test_ops.py::TestCommonCUDA::test_dtypes__refs_acos_cuda PASSED [0.7977s] [ 2%] 2025-12-04T13:35:25.8590240Z test_ops.py::TestCommonCUDA::test_dtypes__refs_addcmul_cuda PASSED [0.8483s] [ 2%] 2025-12-04T13:35:25.8590460Z test_ops.py::TestCommonCUDA::test_dtypes__refs_addr_cuda PASSED [0.7859s] [ 2%] 2025-12-04T13:35:25.8590683Z test_ops.py::TestCommonCUDA::test_dtypes__refs_alias_copy_cuda PASSED [0.7536s] [ 2%] 2025-12-04T13:35:25.8590925Z test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_scatter_cuda PASSED [0.7597s] [ 2%] 2025-12-04T13:35:25.8591165Z test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_3d_cuda PASSED [0.7715s] [ 2%] 2025-12-04T13:35:25.8591395Z test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_or_cuda PASSED [0.7662s] [ 2%] 2025-12-04T13:35:25.8591625Z test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_xor_cuda PASSED [0.7547s] [ 2%] 2025-12-04T13:35:25.8591858Z test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_max_cuda PASSED [0.7999s] [ 2%] 2025-12-04T13:35:25.8592088Z test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_min_cuda PASSED [0.7539s] [ 2%] 2025-12-04T13:35:25.8592320Z test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_physical_cuda PASSED [0.7287s] [ 2%] 2025-12-04T13:35:25.8592548Z test_ops.py::TestCommonCUDA::test_dtypes__refs_cosh_cuda PASSED [0.7494s] [ 2%] 2025-12-04T13:35:25.8592781Z test_ops.py::TestCommonCUDA::test_dtypes__refs_div_floor_rounding_cuda PASSED [0.8844s] [ 2%] 2025-12-04T13:35:25.8593019Z test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_like_cuda PASSED [0.7623s] [ 2%] 2025-12-04T13:35:25.8593252Z test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_strided_cuda PASSED [0.7413s] [ 2%] 2025-12-04T13:35:25.8593477Z test_ops.py::TestCommonCUDA::test_dtypes__refs_erf_cuda PASSED [0.7569s] [ 2%] 2025-12-04T13:35:25.8593719Z test_ops.py::TestCommonCUDA::test_dtypes__refs_exponential_cuda SKIPPED [0.0002s] (Skipped!) [ 2%] 2025-12-04T13:35:25.8593967Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftn_cuda PASSED [8.5552s] [ 2%] 2025-12-04T13:35:25.8594200Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftshift_cuda PASSED [1.2216s] [ 2%] 2025-12-04T13:35:25.8594436Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_ihfft2_cuda PASSED [7.0667s] [ 2%] 2025-12-04T13:35:25.8594668Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft_cuda PASSED [1.7840s] [ 2%] 2025-12-04T13:35:25.8594898Z test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfftn_cuda PASSED [1.8124s] [ 2%] 2025-12-04T13:35:25.8595126Z test_ops.py::TestCommonCUDA::test_dtypes__refs_flip_cuda PASSED [0.7535s] [ 2%] 2025-12-04T13:35:25.8595350Z test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_cuda PASSED [0.7453s] [ 2%] 2025-12-04T13:35:25.8595574Z test_ops.py::TestCommonCUDA::test_dtypes__refs_gcd_cuda PASSED [1.0679s] [ 2%] 2025-12-04T13:35:25.8595849Z test_ops.py::TestCommonCUDA::test_dtypes__refs_geometric_cuda SKIPPED [0.0002s] (Skipped!) [ 2%] 2025-12-04T13:35:25.8596113Z test_ops.py::TestCommonCUDA::test_dtypes__refs_hstack_cuda PASSED [0.7397s] [ 2%] 2025-12-04T13:35:25.8596345Z test_ops.py::TestCommonCUDA::test_dtypes__refs_index_copy_cuda PASSED [0.8325s] [ 2%] 2025-12-04T13:35:25.8596576Z test_ops.py::TestCommonCUDA::test_dtypes__refs_lgamma_cuda PASSED [0.7453s] [ 2%] 2025-12-04T13:35:25.8596809Z test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_cross_cuda PASSED [0.7500s] [ 2%] 2025-12-04T13:35:25.8597042Z test_ops.py::TestCommonCUDA::test_dtypes__refs_log2_cuda PASSED [0.8833s] [ 2%] 2025-12-04T13:35:25.8597262Z test_ops.py::TestCommonCUDA::test_dtypes__refs_log_cuda PASSED [0.7410s] [ 3%] 2025-12-04T13:35:25.8597488Z test_ops.py::TestCommonCUDA::test_dtypes__refs_log_normal_cuda PASSED [0.7685s] [ 3%] 2025-12-04T13:35:25.8597723Z test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_not_cuda PASSED [0.7437s] [ 3%] 2025-12-04T13:35:25.8597981Z test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_variadic_tensors_cuda PASSED [0.7791s] [ 3%] 2025-12-04T13:35:25.8598288Z test_ops.py::TestCommonCUDA::test_dtypes__refs_native_layer_norm_cuda PASSED [0.8399s] [ 3%] 2025-12-04T13:35:25.8598542Z test_ops.py::TestCommonCUDA::test_dtypes__refs_neg_cuda PASSED [0.7396s] [ 3%] 2025-12-04T13:35:25.8598778Z test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_strided_cuda PASSED [0.7697s] [ 3%] 2025-12-04T13:35:25.8599042Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_alpha_dropout_cuda PASSED [0.7695s] [ 3%] 2025-12-04T13:35:25.8599314Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_huber_loss_cuda PASSED [0.7842s] [ 3%] 2025-12-04T13:35:25.8599596Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_log_softmax_with_dtype_cuda PASSED [0.7711s] [ 3%] 2025-12-04T13:35:25.8599879Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_nll_loss_cuda PASSED [1.0053s] [ 3%] 2025-12-04T13:35:25.8600156Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda PASSED [0.9748s] [ 3%] 2025-12-04T13:35:25.8600432Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_prelu_cuda PASSED [0.7957s] [ 3%] 2025-12-04T13:35:25.8600696Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_softplus_cuda PASSED [0.7579s] [ 3%] 2025-12-04T13:35:25.8600975Z test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_triplet_margin_loss_cuda PASSED [0.7882s] [ 3%] 2025-12-04T13:35:25.8601229Z test_ops.py::TestCommonCUDA::test_dtypes__refs_norm_cuda PASSED [0.8318s] [ 3%] 2025-12-04T13:35:25.8601449Z test_ops.py::TestCommonCUDA::test_dtypes__refs_ones_cuda PASSED [0.7368s] [ 3%] 2025-12-04T13:35:25.8601668Z test_ops.py::TestCommonCUDA::test_dtypes__refs_prod_cuda PASSED [0.8710s] [ 3%] 2025-12-04T13:35:25.8601886Z test_ops.py::TestCommonCUDA::test_dtypes__refs_real_cuda PASSED [0.7485s] [ 3%] 2025-12-04T13:35:25.8602109Z test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_cuda PASSED [0.7662s] [ 3%] 2025-12-04T13:35:25.8602337Z test_ops.py::TestCommonCUDA::test_dtypes__refs_rsqrt_cuda PASSED [0.7462s] [ 3%] 2025-12-04T13:35:25.8602560Z test_ops.py::TestCommonCUDA::test_dtypes__refs_sinh_cuda PASSED [0.9469s] [ 3%] 2025-12-04T13:35:25.8602795Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j0_cuda PASSED [0.7661s] [ 3%] 2025-12-04T13:35:25.8603045Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j1_cuda PASSED [0.7490s] [ 3%] 2025-12-04T13:35:25.8603289Z test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtr_cuda PASSED [0.7438s] [ 3%] 2025-12-04T13:35:25.8603532Z test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_multiple_cuda PASSED [0.7498s] [ 3%] 2025-12-04T13:35:25.8603768Z test_ops.py::TestCommonCUDA::test_dtypes__refs_stack_cuda PASSED [0.7617s] [ 3%] 2025-12-04T13:35:25.8603989Z test_ops.py::TestCommonCUDA::test_dtypes__refs_tanh_cuda PASSED [0.7451s] [ 3%] 2025-12-04T13:35:25.8604214Z test_ops.py::TestCommonCUDA::test_dtypes__refs_transpose_cuda PASSED [0.7603s] [ 3%] 2025-12-04T13:35:25.8604485Z test_ops.py::TestCommonCUDA::test_dtypes__refs_true_divide_cuda PASSED [0.7732s] [ 3%] 2025-12-04T13:35:25.8604748Z test_ops.py::TestCommonCUDA::test_dtypes__refs_unbind_cuda PASSED [0.7343s] [ 3%] 2025-12-04T13:35:25.8604978Z test_ops.py::TestCommonCUDA::test_dtypes__refs_view_copy_cuda PASSED [0.7211s] [ 3%] 2025-12-04T13:35:25.8605222Z test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_offsets_cuda PASSED [1.0040s] [ 3%] 2025-12-04T13:35:25.8605456Z test_ops.py::TestCommonCUDA::test_dtypes_abs_cuda PASSED [0.7346s] [ 3%] 2025-12-04T13:35:25.8605673Z test_ops.py::TestCommonCUDA::test_dtypes_acosh_cuda PASSED [0.7416s] [ 3%] 2025-12-04T13:35:25.8605889Z test_ops.py::TestCommonCUDA::test_dtypes_add_cuda PASSED [0.7613s] [ 3%] 2025-12-04T13:35:25.8606105Z test_ops.py::TestCommonCUDA::test_dtypes_alias_copy_cuda PASSED [0.7191s] [ 3%] 2025-12-04T13:35:25.8606321Z test_ops.py::TestCommonCUDA::test_dtypes_argmin_cuda PASSED [0.7416s] [ 3%] 2025-12-04T13:35:25.8606546Z test_ops.py::TestCommonCUDA::test_dtypes_as_strided_copy_cuda PASSED [0.7265s] [ 3%] 2025-12-04T13:35:25.8606788Z test_ops.py::TestCommonCUDA::test_dtypes_as_strided_cuda PASSED [0.7360s] [ 3%] 2025-12-04T13:35:25.8607007Z test_ops.py::TestCommonCUDA::test_dtypes_baddbmm_cuda PASSED [8.1871s] [ 3%] 2025-12-04T13:35:25.8607217Z test_ops.py::TestCommonCUDA::test_dtypes_bfloat16_cuda PASSED [0.7267s] [ 3%] 2025-12-04T13:35:25.8607444Z test_ops.py::TestCommonCUDA::test_dtypes_bitwise_right_shift_cuda PASSED [0.7601s] [ 3%] 2025-12-04T13:35:25.8607687Z test_ops.py::TestCommonCUDA::test_dtypes_broadcast_tensors_cuda PASSED [0.7551s] [ 3%] 2025-12-04T13:35:25.8607917Z test_ops.py::TestCommonCUDA::test_dtypes_cdist_cuda PASSED [1.3013s] [ 3%] 2025-12-04T13:35:25.8608165Z test_ops.py::TestCommonCUDA::test_dtypes_chalf_cuda PASSED [0.7400s] [ 3%] 2025-12-04T13:35:25.8608389Z test_ops.py::TestCommonCUDA::test_dtypes_cholesky_inverse_cuda PASSED [1.9623s] [ 3%] 2025-12-04T13:35:25.8608628Z test_ops.py::TestCommonCUDA::test_dtypes_cholesky_solve_cuda PASSED [0.7544s] [ 3%] 2025-12-04T13:35:25.8608856Z test_ops.py::TestCommonCUDA::test_dtypes_clamp_min_cuda PASSED [0.7387s] [ 3%] 2025-12-04T13:35:25.8609071Z test_ops.py::TestCommonCUDA::test_dtypes_cummin_cuda PASSED [0.7562s] [ 3%] 2025-12-04T13:35:25.8609303Z test_ops.py::TestCommonCUDA::test_dtypes_cumulative_trapezoid_cuda PASSED [0.7635s] [ 3%] 2025-12-04T13:35:25.8609539Z test_ops.py::TestCommonCUDA::test_dtypes_deg2rad_cuda PASSED [0.7227s] [ 3%] 2025-12-04T13:35:25.8609762Z test_ops.py::TestCommonCUDA::test_dtypes_diagonal_copy_cuda PASSED [0.7568s] [ 3%] 2025-12-04T13:35:25.8609999Z test_ops.py::TestCommonCUDA::test_dtypes_diagonal_scatter_cuda PASSED [0.0493s] [ 3%] 2025-12-04T13:35:25.8610227Z test_ops.py::TestCommonCUDA::test_dtypes_digamma_cuda PASSED [0.7446s] [ 3%] 2025-12-04T13:35:25.8610456Z test_ops.py::TestCommonCUDA::test_dtypes_div_floor_rounding_cuda PASSED [0.7694s] [ 3%] 2025-12-04T13:35:25.8610703Z test_ops.py::TestCommonCUDA::test_dtypes_div_no_rounding_mode_cuda PASSED [0.7665s] [ 3%] 2025-12-04T13:35:25.8610935Z test_ops.py::TestCommonCUDA::test_dtypes_dstack_cuda PASSED [0.7264s] [ 3%] 2025-12-04T13:35:25.8611150Z test_ops.py::TestCommonCUDA::test_dtypes_empty_cuda PASSED [0.0219s] [ 3%] 2025-12-04T13:35:25.8611373Z test_ops.py::TestCommonCUDA::test_dtypes_empty_strided_cuda PASSED [0.7167s] [ 3%] 2025-12-04T13:35:25.8611596Z test_ops.py::TestCommonCUDA::test_dtypes_equal_cuda PASSED [0.7379s] [ 3%] 2025-12-04T13:35:25.8611811Z test_ops.py::TestCommonCUDA::test_dtypes_fft_fft2_cuda PASSED [1.4143s] [ 3%] 2025-12-04T13:35:25.8612025Z test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda PASSED [0.7981s] [ 3%] 2025-12-04T13:35:25.8612240Z test_ops.py::TestCommonCUDA::test_dtypes_fft_rfftn_cuda PASSED [0.7444s] [ 3%] 2025-12-04T13:35:25.8612454Z test_ops.py::TestCommonCUDA::test_dtypes_fill_cuda PASSED [0.7330s] [ 3%] 2025-12-04T13:35:25.8612708Z test_ops.py::TestCommonCUDA::test_dtypes_flatten_cuda PASSED [0.7388s] [ 3%] 2025-12-04T13:35:25.8612939Z test_ops.py::TestCommonCUDA::test_dtypes_fliplr_cuda PASSED [0.7397s] [ 3%] 2025-12-04T13:35:25.8613162Z test_ops.py::TestCommonCUDA::test_dtypes_float_power_cuda PASSED [0.8432s] [ 3%] 2025-12-04T13:35:25.8613383Z test_ops.py::TestCommonCUDA::test_dtypes_frac_cuda PASSED [0.7506s] [ 3%] 2025-12-04T13:35:25.8613598Z test_ops.py::TestCommonCUDA::test_dtypes_geometric_cuda PASSED [0.7413s] [ 4%] 2025-12-04T13:35:25.8613815Z test_ops.py::TestCommonCUDA::test_dtypes_heaviside_cuda PASSED [0.7600s] [ 4%] 2025-12-04T13:35:25.8614030Z test_ops.py::TestCommonCUDA::test_dtypes_histc_cuda PASSED [0.8860s] [ 4%] 2025-12-04T13:35:25.8614244Z test_ops.py::TestCommonCUDA::test_dtypes_igammac_cuda PASSED [0.7357s] [ 4%] 2025-12-04T13:35:25.8614462Z test_ops.py::TestCommonCUDA::test_dtypes_index_fill_cuda PASSED [0.8899s] [ 4%] 2025-12-04T13:35:25.8614686Z test_ops.py::TestCommonCUDA::test_dtypes_isfinite_cuda PASSED [0.7375s] [ 4%] 2025-12-04T13:35:25.8614905Z test_ops.py::TestCommonCUDA::test_dtypes_isposinf_cuda PASSED [0.7137s] [ 4%] 2025-12-04T13:35:25.8615136Z test_ops.py::TestCommonCUDA::test_dtypes_lerp_cuda PASSED [1.4972s] [ 4%] 2025-12-04T13:35:25.8615361Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_cross_cuda PASSED [0.7388s] [ 4%] 2025-12-04T13:35:25.8615586Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_eig_cuda PASSED [1.5568s] [ 4%] 2025-12-04T13:35:25.8615808Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigh_cuda PASSED [1.2779s] [ 4%] 2025-12-04T13:35:25.8616041Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_cuda PASSED [1.3495s] [ 4%] 2025-12-04T13:35:25.8616296Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_grad_oriented_cuda PASSED [1.7406s] [ 4%] 2025-12-04T13:35:25.8616539Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_cuda PASSED [1.5809s] [ 4%] 2025-12-04T13:35:25.8616760Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_norm_cuda PASSED [1.4557s] [ 4%] 2025-12-04T13:35:25.8616981Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_qr_cuda PASSED [1.3439s] [ 4%] 2025-12-04T13:35:25.8617206Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_svdvals_cuda PASSED [1.2778s] [ 4%] 2025-12-04T13:35:25.8617436Z test_ops.py::TestCommonCUDA::test_dtypes_linalg_vecdot_cuda PASSED [1.3191s] [ 4%] 2025-12-04T13:35:25.8617659Z test_ops.py::TestCommonCUDA::test_dtypes_log2_cuda PASSED [1.2148s] [ 4%] 2025-12-04T13:35:25.8617873Z test_ops.py::TestCommonCUDA::test_dtypes_log_cuda PASSED [1.2194s] [ 4%] 2025-12-04T13:35:25.8618120Z test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_cuda PASSED [1.2511s] [ 4%] 2025-12-04T13:35:25.8618344Z test_ops.py::TestCommonCUDA::test_dtypes_logical_and_cuda PASSED [1.2450s] [ 4%] 2025-12-04T13:35:25.8618568Z test_ops.py::TestCommonCUDA::test_dtypes_logical_not_cuda PASSED [1.2004s] [ 4%] 2025-12-04T13:35:25.8618786Z test_ops.py::TestCommonCUDA::test_dtypes_long_cuda PASSED [1.1857s] [ 4%] 2025-12-04T13:35:25.8619002Z test_ops.py::TestCommonCUDA::test_dtypes_lu_cuda PASSED [1.3226s] [ 4%] 2025-12-04T13:35:25.8619218Z test_ops.py::TestCommonCUDA::test_dtypes_mT_cuda PASSED [1.2106s] [ 4%] 2025-12-04T13:35:25.8619439Z test_ops.py::TestCommonCUDA::test_dtypes_masked_cumsum_cuda PASSED [1.2566s] [ 4%] 2025-12-04T13:35:25.8619673Z test_ops.py::TestCommonCUDA::test_dtypes_masked_logsumexp_cuda PASSED [1.4631s] [ 4%] 2025-12-04T13:35:25.8619905Z test_ops.py::TestCommonCUDA::test_dtypes_masked_norm_cuda PASSED [2.2615s] [ 4%] 2025-12-04T13:35:25.8620128Z test_ops.py::TestCommonCUDA::test_dtypes_masked_select_cuda PASSED [1.3448s] [ 4%] 2025-12-04T13:35:25.8620366Z test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_no_dim_cuda PASSED [1.1958s] [ 4%] 2025-12-04T13:35:25.8620614Z test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_5_cuda PASSED [1.2508s] [ 4%] 2025-12-04T13:35:25.8620850Z test_ops.py::TestCommonCUDA::test_dtypes_nanquantile_cuda PASSED [1.3497s] [ 4%] 2025-12-04T13:35:25.8621106Z test_ops.py::TestCommonCUDA::test_dtypes_narrow_copy_cuda PASSED [1.2120s] [ 4%] 2025-12-04T13:35:25.8621337Z test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda PASSED [1.2218s] [ 4%] 2025-12-04T13:35:25.8621560Z test_ops.py::TestCommonCUDA::test_dtypes_new_empty_strided_cuda PASSED [1.2299s] [ 4%] 2025-12-04T13:35:25.8621786Z test_ops.py::TestCommonCUDA::test_dtypes_new_full_cuda PASSED [1.2028s] [ 4%] 2025-12-04T13:35:25.8621998Z test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda PASSED [1.1984s] [ 4%] 2025-12-04T13:35:25.8622244Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool2d_cuda PASSED [1.2556s] [ 4%] 2025-12-04T13:35:25.8622517Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool2d_cuda PASSED [1.2094s] [ 4%] 2025-12-04T13:35:25.8622789Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_binary_cross_entropy_cuda PASSED [1.2349s] [ 4%] 2025-12-04T13:35:25.8623073Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose3d_cuda PASSED [2.3884s] [ 4%] 2025-12-04T13:35:25.8623370Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_without_train_cuda PASSED [1.2257s] [ 4%] 2025-12-04T13:35:25.8623690Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool2d_cuda PASSED [1.3328s] [ 4%] 2025-12-04T13:35:25.8623966Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_gaussian_nll_loss_cuda PASSED [2.3519s] [ 4%] 2025-12-04T13:35:25.8624229Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_grid_sample_cuda PASSED [1.2614s] [ 4%] 2025-12-04T13:35:25.8624486Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_group_norm_cuda PASSED [1.3020s] [ 4%] 2025-12-04T13:35:25.8624736Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardtanh_cuda PASSED [1.2199s] [ 4%] 2025-12-04T13:35:25.8624989Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_huber_loss_cuda PASSED [1.2984s] [ 4%] 2025-12-04T13:35:25.8625255Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bicubic_cuda PASSED [1.2301s] [ 4%] 2025-12-04T13:35:25.8625518Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_layer_norm_cuda PASSED [1.2104s] [ 4%] 2025-12-04T13:35:25.8625772Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_cuda PASSED [1.4068s] [ 4%] 2025-12-04T13:35:25.8626035Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool1d_grad_cuda PASSED [1.2742s] [ 4%] 2025-12-04T13:35:25.8626296Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_cuda PASSED [1.5271s] [ 4%] 2025-12-04T13:35:25.8626557Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_margin_loss_cuda PASSED [1.2410s] [ 4%] 2025-12-04T13:35:25.8626843Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multilabel_soft_margin_loss_cuda PASSED [1.2530s] [ 4%] 2025-12-04T13:35:25.8627122Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_nll_loss_cuda PASSED [1.3390s] [ 4%] 2025-12-04T13:35:25.8627379Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_normalize_cuda PASSED [1.2369s] [ 4%] 2025-12-04T13:35:25.8627643Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pixel_unshuffle_cuda PASSED [1.2044s] [ 4%] 2025-12-04T13:35:25.8627906Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rms_norm_cuda PASSED [1.2281s] [ 4%] 2025-12-04T13:35:25.8628193Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rrelu_cuda PASSED [1.2082s] [ 4%] 2025-12-04T13:35:25.8628443Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_complex_cuda XFAIL [0.0298s] [ 4%] 2025-12-04T13:35:25.8628689Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_cuda PASSED [2.4090s] [ 4%] 2025-12-04T13:35:25.8628966Z test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_with_distance_loss_cuda PASSED [1.2437s] [ 4%] 2025-12-04T13:35:25.8629265Z test_ops.py::TestCommonCUDA::test_dtypes_nonzero_static_cuda SKIPPED [0.0012s] (Only runs on cpu) [ 4%] 2025-12-04T13:35:25.8629547Z test_ops.py::TestCommonCUDA::test_dtypes_norm_inf_cuda PASSED [1.2170s] [ 4%] 2025-12-04T13:35:25.8629779Z test_ops.py::TestCommonCUDA::test_dtypes_norm_nuc_cuda PASSED [1.1952s] [ 4%] 2025-12-04T13:35:25.8629994Z test_ops.py::TestCommonCUDA::test_dtypes_normal_cuda PASSED [1.2085s] [ 4%] 2025-12-04T13:35:25.8630219Z test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda PASSED [1.1998s] [ 4%] 2025-12-04T13:35:25.8630443Z test_ops.py::TestCommonCUDA::test_dtypes_ones_cuda PASSED [1.1962s] [ 5%] 2025-12-04T13:35:25.8630690Z test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_2_cuda SKIPPED [0.0002s] (Skipped!) [ 5%] 2025-12-04T13:35:25.8630938Z test_ops.py::TestCommonCUDA::test_dtypes_quantile_cuda PASSED [1.3153s] [ 5%] 2025-12-04T13:35:25.8631150Z test_ops.py::TestCommonCUDA::test_dtypes_rad2deg_cuda PASSED [1.1845s] [ 5%] 2025-12-04T13:35:25.8631360Z test_ops.py::TestCommonCUDA::test_dtypes_real_cuda PASSED [1.2052s] [ 5%] 2025-12-04T13:35:25.8631572Z test_ops.py::TestCommonCUDA::test_dtypes_renorm_cuda PASSED [1.2329s] [ 5%] 2025-12-04T13:35:25.8631782Z test_ops.py::TestCommonCUDA::test_dtypes_roll_cuda PASSED [1.2224s] [ 5%] 2025-12-04T13:35:25.8632005Z test_ops.py::TestCommonCUDA::test_dtypes_rot90_cuda PASSED [1.3329s] [ 5%] 2025-12-04T13:35:25.8632214Z test_ops.py::TestCommonCUDA::test_dtypes_rsub_cuda PASSED [1.2924s] [ 5%] 2025-12-04T13:35:25.8632429Z test_ops.py::TestCommonCUDA::test_dtypes_scalar_tensor_cuda PASSED [1.2210s] [ 5%] 2025-12-04T13:35:25.8632650Z test_ops.py::TestCommonCUDA::test_dtypes_scatter_add_cuda PASSED [1.2527s] [ 5%] 2025-12-04T13:35:25.8632879Z test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_mean_cuda PASSED [1.2765s] [ 5%] 2025-12-04T13:35:25.8633115Z test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_sum_cuda PASSED [1.2814s] [ 5%] 2025-12-04T13:35:25.8633339Z test_ops.py::TestCommonCUDA::test_dtypes_select_cuda PASSED [1.2385s] [ 5%] 2025-12-04T13:35:25.8633581Z test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_cosine_cuda PASSED [1.2577s] [ 5%] 2025-12-04T13:35:25.8633821Z test_ops.py::TestCommonCUDA::test_dtypes_softmax_cuda PASSED [1.2472s] [ 5%] 2025-12-04T13:35:25.8634046Z test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_j1_cuda PASSED [1.1938s] [ 5%] 2025-12-04T13:35:25.8634282Z test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y0_cuda PASSED [1.2159s] [ 5%] 2025-12-04T13:35:25.8634515Z test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y1_cuda PASSED [1.4159s] [ 5%] 2025-12-04T13:35:25.8634742Z test_ops.py::TestCommonCUDA::test_dtypes_special_erfcx_cuda PASSED [1.2145s] [ 5%] 2025-12-04T13:35:25.8634965Z test_ops.py::TestCommonCUDA::test_dtypes_special_ndtri_cuda PASSED [1.2350s] [ 5%] 2025-12-04T13:35:25.8635228Z test_ops.py::TestCommonCUDA::test_dtypes_special_polygamma_special_polygamma_n_0_cuda PASSED [1.2295s] [ 5%] 2025-12-04T13:35:25.8635519Z test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_t_cuda PASSED [1.2174s] [ 5%] 2025-12-04T13:35:25.8635812Z test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_u_cuda PASSED [1.2141s] [ 5%] 2025-12-04T13:35:25.8636172Z test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_v_cuda PASSED [1.2217s] [ 5%] 2025-12-04T13:35:25.8636432Z test_ops.py::TestCommonCUDA::test_dtypes_special_zeta_cuda PASSED [1.6118s] [ 5%] 2025-12-04T13:35:25.8636669Z test_ops.py::TestCommonCUDA::test_dtypes_split_with_sizes_copy_cuda PASSED [1.2321s] [ 5%] 2025-12-04T13:35:25.8636899Z test_ops.py::TestCommonCUDA::test_dtypes_square_cuda PASSED [1.2053s] [ 5%] 2025-12-04T13:35:25.8637111Z test_ops.py::TestCommonCUDA::test_dtypes_stack_cuda PASSED [1.2653s] [ 5%] 2025-12-04T13:35:25.8637321Z test_ops.py::TestCommonCUDA::test_dtypes_stft_cuda PASSED [2.7230s] [ 5%] 2025-12-04T13:35:25.8637531Z test_ops.py::TestCommonCUDA::test_dtypes_sub_cuda PASSED [1.2557s] [ 5%] 2025-12-04T13:35:25.8637740Z test_ops.py::TestCommonCUDA::test_dtypes_svd_cuda PASSED [1.7480s] [ 5%] 2025-12-04T13:35:25.8637988Z test_ops.py::TestCommonCUDA::test_dtypes_tanh_cuda PASSED [1.5730s] [ 5%] 2025-12-04T13:35:25.8638253Z test_ops.py::TestCommonCUDA::test_dtypes_to_cuda PASSED [1.2564s] [ 5%] 2025-12-04T13:35:25.8638500Z test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__safe_softmax_default_cuda PASSED [1.2235s] [ 5%] 2025-12-04T13:35:25.8638755Z test_ops.py::TestCommonCUDA::test_dtypes_transpose_copy_cuda PASSED [1.2382s] [ 5%] 2025-12-04T13:35:25.8638974Z test_ops.py::TestCommonCUDA::test_dtypes_trunc_cuda PASSED [1.1981s] [ 5%] 2025-12-04T13:35:25.8639191Z test_ops.py::TestCommonCUDA::test_dtypes_unbind_copy_cuda PASSED [1.2054s] [ 5%] 2025-12-04T13:35:25.8639405Z test_ops.py::TestCommonCUDA::test_dtypes_var_cuda PASSED [1.2699s] [ 5%] 2025-12-04T13:35:25.8639629Z test_ops.py::TestCommonCUDA::test_dtypes_var_mean_unbiased_cuda PASSED [1.2241s] [ 5%] 2025-12-04T13:35:25.8639861Z test_ops.py::TestCommonCUDA::test_dtypes_view_as_complex_cuda PASSED [1.1662s] [ 5%] 2025-12-04T13:35:25.8640084Z test_ops.py::TestCommonCUDA::test_dtypes_view_cuda PASSED [1.2278s] [ 5%] 2025-12-04T13:35:25.8640313Z test_ops.py::TestCommonCUDA::test_dtypes_zero__cuda PASSED [1.1968s] [ 5%] 2025-12-04T13:35:25.8640522Z test_ops.py::TestCommonCUDA::test_errors___rand___cuda PASSED [1.2274s] [ 5%] 2025-12-04T13:35:25.8640734Z test_ops.py::TestCommonCUDA::test_errors___rmod___cuda PASSED [0.0032s] [ 5%] 2025-12-04T13:35:25.8640948Z test_ops.py::TestCommonCUDA::test_errors__chunk_cat_cuda PASSED [1.1933s] [ 5%] 2025-12-04T13:35:25.8641164Z test_ops.py::TestCommonCUDA::test_errors_amax_cuda PASSED [1.2028s] [ 5%] 2025-12-04T13:35:25.8641386Z test_ops.py::TestCommonCUDA::test_errors_as_strided_scatter_cuda PASSED [0.0037s] [ 5%] 2025-12-04T13:35:25.8641612Z test_ops.py::TestCommonCUDA::test_errors_bitwise_and_cuda PASSED [0.0024s] [ 5%] 2025-12-04T13:35:25.8641825Z test_ops.py::TestCommonCUDA::test_errors_cat_cuda PASSED [1.1982s] [ 5%] 2025-12-04T13:35:25.8642036Z test_ops.py::TestCommonCUDA::test_errors_cauchy_cuda PASSED [0.0038s] [ 5%] 2025-12-04T13:35:25.8642247Z test_ops.py::TestCommonCUDA::test_errors_cov_cuda PASSED [1.1649s] [ 5%] 2025-12-04T13:35:25.8642468Z test_ops.py::TestCommonCUDA::test_errors_div_floor_rounding_cuda PASSED [0.0024s] [ 5%] 2025-12-04T13:35:25.8642706Z test_ops.py::TestCommonCUDA::test_errors_div_no_rounding_mode_cuda PASSED [0.0017s] [ 5%] 2025-12-04T13:35:25.8642931Z test_ops.py::TestCommonCUDA::test_errors_dot_cuda PASSED [1.2072s] [ 5%] 2025-12-04T13:35:25.8643141Z test_ops.py::TestCommonCUDA::test_errors_fft_fft_cuda PASSED [1.1853s] [ 5%] 2025-12-04T13:35:25.8643351Z test_ops.py::TestCommonCUDA::test_errors_fft_ifftn_cuda PASSED [1.2035s] [ 5%] 2025-12-04T13:35:25.8643564Z test_ops.py::TestCommonCUDA::test_errors_fft_ihfft2_cuda PASSED [1.1961s] [ 5%] 2025-12-04T13:35:25.8643779Z test_ops.py::TestCommonCUDA::test_errors_fft_ihfftn_cuda PASSED [1.1842s] [ 5%] 2025-12-04T13:35:25.8643994Z test_ops.py::TestCommonCUDA::test_errors_fft_irfft2_cuda PASSED [1.1821s] [ 5%] 2025-12-04T13:35:25.8644208Z test_ops.py::TestCommonCUDA::test_errors_gt_cuda PASSED [1.1733s] [ 5%] 2025-12-04T13:35:25.8644421Z test_ops.py::TestCommonCUDA::test_errors_igammac_cuda PASSED [1.1951s] [ 5%] 2025-12-04T13:35:25.8644631Z test_ops.py::TestCommonCUDA::test_errors_ldexp_cuda PASSED [1.1995s] [ 5%] 2025-12-04T13:35:25.8644864Z test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda PASSED [1.2003s] [ 5%] 2025-12-04T13:35:25.8645098Z test_ops.py::TestCommonCUDA::test_errors_logaddexp_cuda PASSED [1.2119s] [ 5%] 2025-12-04T13:35:25.8645314Z test_ops.py::TestCommonCUDA::test_errors_logical_and_cuda PASSED [1.1999s] [ 5%] 2025-12-04T13:35:25.8645535Z test_ops.py::TestCommonCUDA::test_errors_masked_scatter_cuda PASSED [1.2178s] [ 5%] 2025-12-04T13:35:25.8645786Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool1d_cuda PASSED [1.2168s] [ 5%] 2025-12-04T13:35:25.8646096Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool3d_cuda PASSED [1.2097s] [ 5%] 2025-12-04T13:35:25.8646389Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_gaussian_nll_loss_cuda PASSED [1.1901s] [ 6%] 2025-12-04T13:35:25.8646662Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_hinge_embedding_loss_cuda PASSED [0.0034s] [ 6%] 2025-12-04T13:35:25.8646926Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool1d_cuda PASSED [1.2110s] [ 6%] 2025-12-04T13:35:25.8647186Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_multi_margin_loss_cuda PASSED [1.1862s] [ 6%] 2025-12-04T13:35:25.8647444Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_rms_norm_cuda PASSED [1.2189s] [ 6%] 2025-12-04T13:35:25.8647703Z test_ops.py::TestCommonCUDA::test_errors_nn_functional_soft_margin_loss_cuda PASSED [1.2016s] [ 6%] 2025-12-04T13:35:25.8647953Z test_ops.py::TestCommonCUDA::test_errors_normal_in_place_cuda PASSED [0.0038s] [ 6%] 2025-12-04T13:35:25.8648217Z test_ops.py::TestCommonCUDA::test_errors_rsub_cuda PASSED [1.2116s] [ 6%] 2025-12-04T13:35:25.8648448Z test_ops.py::TestCommonCUDA::test_errors_signal_windows_cosine_cuda PASSED [0.0055s] [ 6%] 2025-12-04T13:35:25.8648718Z test_ops.py::TestCommonCUDA::test_errors_signal_windows_exponential_cuda PASSED [0.0056s] [ 6%] 2025-12-04T13:35:25.8648968Z test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout4_cuda PASSED [0.0589s] [ 6%] 2025-12-04T13:35:25.8649203Z test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout2_cuda PASSED [0.0030s] [ 6%] 2025-12-04T13:35:25.8649449Z test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout1_cuda PASSED [0.0314s] [ 6%] 2025-12-04T13:35:25.8649705Z test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout4_cuda PASSED [0.0014s] [ 6%] 2025-12-04T13:35:25.8649965Z test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_h_cuda PASSED [0.0024s] [ 6%] 2025-12-04T13:35:25.8650247Z test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_t_cuda PASSED [0.0020s] [ 6%] 2025-12-04T13:35:25.8650537Z test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_u_cuda PASSED [0.0021s] [ 6%] 2025-12-04T13:35:25.8650793Z test_ops.py::TestCommonCUDA::test_errors_sub_cuda PASSED [0.0014s] [ 6%] 2025-12-04T13:35:25.8651009Z test_ops.py::TestCommonCUDA::test_errors_unbind_copy_cuda PASSED [1.1853s] [ 6%] 2025-12-04T13:35:25.8651223Z test_ops.py::TestCommonCUDA::test_errors_unbind_cuda PASSED [1.2057s] [ 6%] 2025-12-04T13:35:25.8651434Z test_ops.py::TestCommonCUDA::test_errors_view_as_cuda PASSED [1.1959s] [ 6%] 2025-12-04T13:35:25.8651644Z test_ops.py::TestCommonCUDA::test_errors_view_copy_cuda PASSED [0.0081s] [ 6%] 2025-12-04T13:35:25.8651927Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmv_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8652285Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_alias_copy_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8652640Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_aminmax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8652992Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_arange_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8653338Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_asin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8653681Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atanh_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8654024Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_bmm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8654382Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cholesky_inverse_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8654775Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cross_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8655135Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8655481Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cummin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8655829Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cumsum_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8656172Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diff_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8656533Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_div_trunc_rounding_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8656894Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_dot_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8657248Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eq_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8657591Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fftn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8657942Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8658351Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfft_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8658705Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_rfftn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8659055Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_frac_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8659397Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ge_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8659745Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hash_tensor_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8660103Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_igamma_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8660451Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_igammac_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8660813Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amax_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8661187Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_mean_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8661556Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_select_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8661912Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_inner_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8662273Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cholesky_ex_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8662639Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8662997Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_ex_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8663383Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lstsq_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 6%] 2025-12-04T13:35:25.8663760Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_norm_cuda_float32 SKIPPED [0.0012s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8664125Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linspace_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8664498Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8664875Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logcumsumexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8665238Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_or_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8665603Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_xor_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8665961Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logspace_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8666344Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_max_reduction_no_dim_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8666707Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mean_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8667072Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_5_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8667444Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nan_to_num_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8667789Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_neg_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8668273Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nextafter_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8668653Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 6%] 2025-12-04T13:35:25.8669030Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_inf_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8669375Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ormqr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8669742Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8670125Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_4_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 7%] 2025-12-04T13:35:25.8670481Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_qr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8670827Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_randn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8671171Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinc_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8671524Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_slice_scatter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8671893Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_j0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8672261Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_entr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8672671Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_erfcx_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8673050Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1e_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8673430Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8673845Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_polygamma_special_polygamma_n_0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8674265Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8674693Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8675091Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_zeta_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8675465Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sqrt_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8675809Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tan_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8676152Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_vstack_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8676500Z test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_where_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 7%] 2025-12-04T13:35:25.8676834Z test_ops.py::TestCommonCUDA::test_multiple_devices_T_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8677161Z test_ops.py::TestCommonCUDA::test_multiple_devices___rand___cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8677496Z test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8677851Z test_ops.py::TestCommonCUDA::test_multiple_devices__native_batch_norm_legit_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8678243Z test_ops.py::TestCommonCUDA::test_multiple_devices_add_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8678574Z test_ops.py::TestCommonCUDA::test_multiple_devices_addbmm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8678911Z test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8679251Z test_ops.py::TestCommonCUDA::test_multiple_devices_alias_copy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8679590Z test_ops.py::TestCommonCUDA::test_multiple_devices_amin_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8679914Z test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8680239Z test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8680568Z test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_int64 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8680916Z test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_copy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8681257Z test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8681623Z test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8681978Z test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8682319Z test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8682667Z test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_left_shift_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8683020Z test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_not_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8683359Z test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8683712Z test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8684084Z test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8684431Z test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_to_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8684771Z test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8685103Z test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8685427Z test_ops.py::TestCommonCUDA::test_multiple_devices_ceil_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8685754Z test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8686089Z test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8686427Z test_ops.py::TestCommonCUDA::test_multiple_devices_chunk_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8686757Z test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8687091Z test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8687436Z test_ops.py::TestCommonCUDA::test_multiple_devices_combinations_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8687784Z test_ops.py::TestCommonCUDA::test_multiple_devices_copysign_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8688161Z test_ops.py::TestCommonCUDA::test_multiple_devices_corrcoef_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8688510Z test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8688859Z test_ops.py::TestCommonCUDA::test_multiple_devices_count_nonzero_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8689194Z test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8689525Z test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8689857Z test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8690217Z test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8690563Z test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8690894Z test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8691217Z test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8691547Z test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8691876Z test_ops.py::TestCommonCUDA::test_multiple_devices_dist_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8692221Z test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8692569Z test_ops.py::TestCommonCUDA::test_multiple_devices_double_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8692918Z test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 7%] 2025-12-04T13:35:25.8693254Z test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8693586Z test_ops.py::TestCommonCUDA::test_multiple_devices_empty_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8693922Z test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8694263Z test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8694602Z test_ops.py::TestCommonCUDA::test_multiple_devices_expand_as_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8694943Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8695280Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8695615Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8695952Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8696287Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8696624Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8696965Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft2_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8697303Z test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfftn_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8697632Z test_ops.py::TestCommonCUDA::test_multiple_devices_fill_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8697956Z test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8698341Z test_ops.py::TestCommonCUDA::test_multiple_devices_float_power_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8698678Z test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8699040Z test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8699385Z test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8699726Z test_ops.py::TestCommonCUDA::test_multiple_devices_hash_tensor_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8700065Z test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8700400Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8700736Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_add_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8701070Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8701406Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8701775Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amax_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8702137Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8702493Z test_ops.py::TestCommonCUDA::test_multiple_devices_index_select_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8702833Z test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8703166Z test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8703499Z test_ops.py::TestCommonCUDA::test_multiple_devices_isin_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8703822Z test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8704153Z test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8704485Z test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8704845Z test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8705204Z test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8705535Z test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8705873Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8706222Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eig_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8706570Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigh_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8706983Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 8%] 2025-12-04T13:35:25.8707393Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_inv_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8707748Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_factor_ex_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8708185Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_multi_dot_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8708557Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_ex_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8708914Z test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vector_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8709256Z test_ops.py::TestCommonCUDA::test_multiple_devices_log10_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8709582Z test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8709917Z test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp2_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8710264Z test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8710618Z test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8710945Z test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8711268Z test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8711600Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8711956Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8712309Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_select_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8712655Z test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8713026Z test_ops.py::TestCommonCUDA::test_multiple_devices_max_pool2d_with_indices_backward_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8713414Z test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8713768Z test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8714093Z test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8714428Z test_ops.py::TestCommonCUDA::test_multiple_devices_multinomial_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8714786Z test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8715153Z test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8715506Z test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8715841Z test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8716196Z test_ops.py::TestCommonCUDA::test_multiple_devices_native_dropout_backward_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8716545Z test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8716930Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool2d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 8%] 2025-12-04T13:35:25.8717336Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool1d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8717711Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv2d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8718130Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cross_entropy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8718513Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8718893Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout2d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8719280Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout3d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8719677Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8720045Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_elu_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8720447Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8720872Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8721272Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardsigmoid_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8721670Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hinge_embedding_loss_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8722077Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bicubic_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8722495Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest-exact_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8722910Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8723317Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_trilinear_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8723713Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_layer_norm_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8724107Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_margin_ranking_loss_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8724504Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8724894Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_grad_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8725306Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multi_head_attention_forward_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8725702Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_one_hot_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8734317Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8734754Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_shuffle_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8735150Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_poisson_nll_loss_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8735529Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8735894Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_silu_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8736270Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softplus_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8736649Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8737057Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0012s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8737477Z test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8737862Z test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8738236Z test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8738579Z test_ops.py::TestCommonCUDA::test_multiple_devices_norm_nuc_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8738917Z test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8739257Z test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8739596Z test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8739931Z test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8740264Z test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_1_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 9%] 2025-12-04T13:35:25.8740591Z test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_int64 SKIPPED [0.0001s] (Skipped!) [ 9%] 2025-12-04T13:35:25.8740922Z test_ops.py::TestCommonCUDA::test_multiple_devices_rad2deg_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8741256Z test_ops.py::TestCommonCUDA::test_multiple_devices_randn_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8741593Z test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8741944Z test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8742298Z test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8742638Z test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8743016Z test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8743378Z test_ops.py::TestCommonCUDA::test_multiple_devices_round_decimals_0_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8743736Z test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_add_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8744093Z test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8744454Z test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8744803Z test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8745152Z test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8745492Z test_ops.py::TestCommonCUDA::test_multiple_devices_sigmoid_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8745875Z test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_exponential_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8746262Z test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hamming_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8746636Z test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_kaiser_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8747010Z test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_nuttall_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8747374Z test_ops.py::TestCommonCUDA::test_multiple_devices_signbit_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8747708Z test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8748057Z test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8748463Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8748844Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8749241Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8749630Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_h_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8750018Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_hermite_polynomial_he_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8750392Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8750767Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8751162Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8751536Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 9%] 2025-12-04T13:35:25.8751905Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8752323Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8752701Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8753072Z test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8753441Z test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8753782Z test_ops.py::TestCommonCUDA::test_multiple_devices_split_list_args_cuda_int64 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8754132Z test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8754482Z test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8754845Z test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8755194Z test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8755533Z test_ops.py::TestCommonCUDA::test_multiple_devices_stft_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8755869Z test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8756205Z test_ops.py::TestCommonCUDA::test_multiple_devices_tan_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8756532Z test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8756866Z test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8757206Z test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8757534Z test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8757860Z test_ops.py::TestCommonCUDA::test_multiple_devices_topk_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8758268Z test_ops.py::TestCommonCUDA::test_multiple_devices_torch_ops_aten__safe_softmax_default_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8758641Z test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8758985Z test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_copy_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8759339Z test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8759678Z test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8760010Z test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8760336Z test_ops.py::TestCommonCUDA::test_multiple_devices_triu_cuda_int64 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8760663Z test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8761028Z test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8761379Z test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8761732Z test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_float32 SKIPPED [0.0010s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8762104Z test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8762458Z test_ops.py::TestCommonCUDA::test_multiple_devices_unravel_index_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8762805Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_chunk_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8763157Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8763512Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8763873Z test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8764208Z test_ops.py::TestCommonCUDA::test_multiple_devices_vdot_cuda_float32 SKIPPED [0.0011s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8764540Z test_ops.py::TestCommonCUDA::test_multiple_devices_vsplit_cuda_float32 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8764876Z test_ops.py::TestCommonCUDA::test_multiple_devices_vstack_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8765208Z test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_int64 SKIPPED [0.0009s] (fewer than 2 devices detected) [ 10%] 2025-12-04T13:35:25.8765508Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_H_cuda_bool PASSED [1.2370s] [ 10%] 2025-12-04T13:35:25.8765772Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values___radd___cuda_bool PASSED [0.0067s] [ 10%] 2025-12-04T13:35:25.8766039Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_abs_cuda_bool PASSED [1.2154s] [ 10%] 2025-12-04T13:35:25.8766306Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_alias_copy_cuda_bool PASSED [0.0040s] [ 10%] 2025-12-04T13:35:25.8766594Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_argwhere_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 10%] 2025-12-04T13:35:25.8766885Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_cuda_bool PASSED [1.2433s] [ 10%] 2025-12-04T13:35:25.8767153Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asinh_cuda_bool PASSED [0.0123s] [ 10%] 2025-12-04T13:35:25.8767421Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_1d_cuda_bool PASSED [1.2454s] [ 10%] 2025-12-04T13:35:25.8767693Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_2d_cuda_bool PASSED [0.0057s] [ 10%] 2025-12-04T13:35:25.8767971Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_xor_cuda_bool PASSED [0.0054s] [ 10%] 2025-12-04T13:35:25.8768288Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_block_diag_cuda_bool PASSED [1.2450s] [ 10%] 2025-12-04T13:35:25.8768566Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_to_cuda_bool PASSED [0.0051s] [ 10%] 2025-12-04T13:35:25.8768832Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cat_cuda_bool PASSED [1.2412s] [ 10%] 2025-12-04T13:35:25.8769092Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chalf_cuda_bool PASSED [0.0057s] [ 10%] 2025-12-04T13:35:25.8769356Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clone_cuda_bool PASSED [1.1891s] [ 10%] 2025-12-04T13:35:25.8769658Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_combinations_cuda_bool PASSED [0.0136s] [ 10%] 2025-12-04T13:35:25.8769942Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_conj_cuda_bool PASSED [1.2275s] [ 10%] 2025-12-04T13:35:25.8770208Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummin_cuda_bool PASSED [0.0046s] [ 10%] 2025-12-04T13:35:25.8770482Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_copy_cuda_bool PASSED [1.2089s] [ 10%] 2025-12-04T13:35:25.8770774Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_div_no_rounding_mode_cuda_bool PASSED [0.0083s] [ 10%] 2025-12-04T13:35:25.8771055Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dstack_cuda_bool PASSED [1.2072s] [ 10%] 2025-12-04T13:35:25.8771340Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_like_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 10%] 2025-12-04T13:35:25.8771656Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_permuted_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 10%] 2025-12-04T13:35:25.8771950Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp2_cuda_bool PASSED [1.2043s] [ 10%] 2025-12-04T13:35:25.8772232Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft2_cuda_bool PASSED [1.2468s] [ 10%] 2025-12-04T13:35:25.8772506Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftshift_cuda_bool PASSED [0.0053s] [ 10%] 2025-12-04T13:35:25.8772782Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfftn_cuda_bool PASSED [2.2976s] [ 10%] 2025-12-04T13:35:25.8773052Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft_cuda_bool PASSED [1.2020s] [ 10%] 2025-12-04T13:35:25.8773314Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flip_cuda_bool PASSED [0.0062s] [ 10%] 2025-12-04T13:35:25.8773575Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fliplr_cuda_bool PASSED [0.0029s] [ 10%] 2025-12-04T13:35:25.8773843Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_power_cuda_bool PASSED [0.0065s] [ 11%] 2025-12-04T13:35:25.8774112Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmax_cuda_bool PASSED [0.0053s] [ 11%] 2025-12-04T13:35:25.8774371Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_cuda_bool PASSED [1.1806s] [ 11%] 2025-12-04T13:35:25.8774628Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gather_cuda_bool PASSED [0.0065s] [ 11%] 2025-12-04T13:35:25.8774885Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_half_cuda_bool PASSED [1.1845s] [ 11%] 2025-12-04T13:35:25.8775147Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hash_tensor_cuda_bool PASSED [0.0088s] [ 11%] 2025-12-04T13:35:25.8775419Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_heaviside_cuda_bool PASSED [0.0055s] [ 11%] 2025-12-04T13:35:25.8775685Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hsplit_cuda_bool PASSED [1.2336s] [ 11%] 2025-12-04T13:35:25.8775944Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_int_cuda_bool PASSED [0.0052s] [ 11%] 2025-12-04T13:35:25.8776201Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_le_cuda_bool PASSED [0.0058s] [ 11%] 2025-12-04T13:35:25.8776488Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_select_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8776783Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_binary_cuda_bool PASSED [0.0052s] [ 11%] 2025-12-04T13:35:25.8777051Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_binary_cuda_bool PASSED [0.0052s] [ 11%] 2025-12-04T13:35:25.8777333Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_no_dim_cuda_bool PASSED [1.2179s] [ 11%] 2025-12-04T13:35:25.8777633Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_with_dim_cuda_bool PASSED [0.0051s] [ 11%] 2025-12-04T13:35:25.8777919Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_minimum_cuda_bool PASSED [0.0056s] [ 11%] 2025-12-04T13:35:25.8778253Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_msort_cuda_bool PASSED [1.2734s] [ 11%] 2025-12-04T13:35:25.8778546Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_empty_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8778886Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_feature_alpha_dropout_without_train_cuda_bool PASSED [1.2004s] [ 11%] 2025-12-04T13:35:25.8779231Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pad_circular_cuda_bool PASSED [0.0044s] [ 11%] 2025-12-04T13:35:25.8779549Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_unshuffle_cuda_bool PASSED [0.0036s] [ 11%] 2025-12-04T13:35:25.8779873Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_unfold_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8780185Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8780490Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nonzero_static_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8780795Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_cuda_bool PASSED [1.2023s] [ 11%] 2025-12-04T13:35:25.8781054Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_outer_cuda_bool PASSED [0.0039s] [ 11%] 2025-12-04T13:35:25.8781339Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_0_cuda_bool PASSED [1.1850s] [ 11%] 2025-12-04T13:35:25.8781660Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_3_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8781958Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_prod_cuda_bool PASSED [1.2239s] [ 11%] 2025-12-04T13:35:25.8782213Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_put_cuda_bool PASSED [0.0130s] [ 11%] 2025-12-04T13:35:25.8782486Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_interleave_cuda_bool PASSED [1.2158s] [ 11%] 2025-12-04T13:35:25.8782760Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_rot90_cuda_bool PASSED [0.0125s] [ 11%] 2025-12-04T13:35:25.8783016Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sin_cuda_bool PASSED [1.1975s] [ 11%] 2025-12-04T13:35:25.8783269Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinc_cuda_bool PASSED [0.0064s] [ 11%] 2025-12-04T13:35:25.8783523Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sort_cuda_bool PASSED [1.2128s] [ 11%] 2025-12-04T13:35:25.8783794Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y1_cuda_bool PASSED [0.0049s] [ 11%] 2025-12-04T13:35:25.8784075Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i0e_cuda_bool PASSED [1.1876s] [ 11%] 2025-12-04T13:35:25.8784386Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_u_cuda_bool PASSED [0.0080s] [ 11%] 2025-12-04T13:35:25.8784699Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_copy_cuda_bool PASSED [1.2113s] [ 11%] 2025-12-04T13:35:25.8784969Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_cuda_bool PASSED [0.0055s] [ 11%] 2025-12-04T13:35:25.8785233Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_stack_cuda_bool PASSED [0.0053s] [ 11%] 2025-12-04T13:35:25.8785490Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tile_cuda_bool PASSED [0.0153s] [ 11%] 2025-12-04T13:35:25.8785744Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_cuda_bool PASSED [1.2289s] [ 11%] 2025-12-04T13:35:25.8786019Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_sparse_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 11%] 2025-12-04T13:35:25.8786311Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_transpose_copy_cuda_bool PASSED [1.2063s] [ 11%] 2025-12-04T13:35:25.8786589Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_true_divide_cuda_bool PASSED [0.0085s] [ 11%] 2025-12-04T13:35:25.8786886Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unbind_copy_cuda_bool PASSED [1.1844s] [ 11%] 2025-12-04T13:35:25.8787167Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsqueeze_cuda_bool PASSED [0.0059s] [ 11%] 2025-12-04T13:35:25.8787430Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_view_as_cuda_bool PASSED [1.2013s] [ 11%] 2025-12-04T13:35:25.8787691Z test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zero__cuda_bool PASSED [0.0044s] [ 11%] 2025-12-04T13:35:25.8787943Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_int64 PASSED [1.1900s] [ 11%] 2025-12-04T13:35:25.8788237Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_complex64 PASSED [1.2029s] [ 11%] 2025-12-04T13:35:25.8788498Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rand___cuda_int64 PASSED [0.0063s] [ 11%] 2025-12-04T13:35:25.8788766Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_complex64 PASSED [0.0136s] [ 11%] 2025-12-04T13:35:25.8789036Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_int64 PASSED [0.0045s] [ 11%] 2025-12-04T13:35:25.8789324Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_complex64 PASSED [0.3192s] [ 11%] 2025-12-04T13:35:25.8789590Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_int64 PASSED [0.0148s] [ 11%] 2025-12-04T13:35:25.8789854Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_float32 PASSED [0.0177s] [ 11%] 2025-12-04T13:35:25.8790140Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples__batch_norm_with_update_cuda_float32 XFAIL [0.0821s] [ 11%] 2025-12-04T13:35:25.8790469Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_complex64 PASSED [2.4352s] [ 11%] 2025-12-04T13:35:25.8790771Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_int64 PASSED [0.0040s] [ 11%] 2025-12-04T13:35:25.8791030Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_int64 PASSED [1.2464s] [ 11%] 2025-12-04T13:35:25.8791293Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_complex64 PASSED [0.0073s] [ 11%] 2025-12-04T13:35:25.8791558Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_float32 PASSED [1.2272s] [ 11%] 2025-12-04T13:35:25.8791822Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_complex64 PASSED [0.0141s] [ 11%] 2025-12-04T13:35:25.8792095Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_complex64 PASSED [1.2122s] [ 11%] 2025-12-04T13:35:25.8792371Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_int64 PASSED [0.0041s] [ 11%] 2025-12-04T13:35:25.8792638Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_complex64 PASSED [1.1902s] [ 11%] 2025-12-04T13:35:25.8792902Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_float32 PASSED [0.0107s] [ 11%] 2025-12-04T13:35:25.8793166Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_int64 PASSED [0.0073s] [ 12%] 2025-12-04T13:35:25.8793428Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_float32 PASSED [1.2089s] [ 12%] 2025-12-04T13:35:25.8793767Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32 SKIPPED [0.0002s] (Works for int64, fails for everything else) [ 12%] 2025-12-04T13:35:25.8794102Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_int64 PASSED [1.1979s] [ 12%] 2025-12-04T13:35:25.8794358Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_int64 PASSED [0.0205s] [ 12%] 2025-12-04T13:35:25.8794615Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atanh_cuda_int64 PASSED [1.2296s] [ 12%] 2025-12-04T13:35:25.8794877Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_int64 PASSED [0.0056s] [ 12%] 2025-12-04T13:35:25.8795143Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32 PASSED [1.2398s] [ 12%] 2025-12-04T13:35:25.8795443Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_and_cuda_int64 PASSED [0.0066s] [ 12%] 2025-12-04T13:35:25.8795739Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_left_shift_cuda_int64 PASSED [0.0049s] [ 12%] 2025-12-04T13:35:25.8796020Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_xor_cuda_int64 PASSED [0.0046s] [ 12%] 2025-12-04T13:35:25.8796287Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_complex64 PASSED [1.1941s] [ 12%] 2025-12-04T13:35:25.8796546Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bmm_cuda_float32 PASSED [0.0064s] [ 12%] 2025-12-04T13:35:25.8796804Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_float32 PASSED [1.1881s] [ 12%] 2025-12-04T13:35:25.8797061Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_int64 PASSED [0.0051s] [ 12%] 2025-12-04T13:35:25.8797338Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_complex64 PASSED [1.2011s] [ 12%] 2025-12-04T13:35:25.8797639Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_float32 PASSED [0.0061s] [ 12%] 2025-12-04T13:35:25.8797936Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_float32 PASSED [1.2175s] [ 12%] 2025-12-04T13:35:25.8798229Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_float32 PASSED [0.0050s] [ 12%] 2025-12-04T13:35:25.8798488Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_int64 PASSED [1.1860s] [ 12%] 2025-12-04T13:35:25.8798750Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_complex64 PASSED [0.0080s] [ 12%] 2025-12-04T13:35:25.8799032Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_complex64 PASSED [1.2223s] [ 12%] 2025-12-04T13:35:25.8799310Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_int64 PASSED [1.2025s] [ 12%] 2025-12-04T13:35:25.8799579Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_int64 PASSED [0.0047s] [ 12%] 2025-12-04T13:35:25.8799850Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_complex_cuda_float32 PASSED [0.0115s] [ 12%] 2025-12-04T13:35:25.8800118Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_complex64 PASSED [1.2050s] [ 12%] 2025-12-04T13:35:25.8800394Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_complex64 PASSED [0.0051s] [ 12%] 2025-12-04T13:35:25.8800677Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_int64 PASSED [1.2040s] [ 12%] 2025-12-04T13:35:25.8800957Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_int64 PASSED [0.0150s] [ 12%] 2025-12-04T13:35:25.8801241Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_complex64 PASSED [0.0043s] [ 12%] 2025-12-04T13:35:25.8801519Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_int64 PASSED [1.2268s] [ 12%] 2025-12-04T13:35:25.8801788Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_float32 PASSED [0.0241s] [ 12%] 2025-12-04T13:35:25.8802055Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_copysign_cuda_int64 PASSED [0.0056s] [ 12%] 2025-12-04T13:35:25.8802321Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_complex64 PASSED [1.9762s] [ 12%] 2025-12-04T13:35:25.8802590Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64 PASSED [1.2091s] [ 12%] 2025-12-04T13:35:25.8802860Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_float32 PASSED [0.0071s] [ 12%] 2025-12-04T13:35:25.8803122Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64 PASSED [1.2155s] [ 12%] 2025-12-04T13:35:25.8803383Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_int64 PASSED [0.0251s] [ 12%] 2025-12-04T13:35:25.8803642Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_int64 PASSED [1.2209s] [ 12%] 2025-12-04T13:35:25.8803941Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_float32 PASSED [0.0083s] [ 12%] 2025-12-04T13:35:25.8804228Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_complex64 PASSED [0.0131s] [ 12%] 2025-12-04T13:35:25.8804501Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_float32 PASSED [1.2397s] [ 12%] 2025-12-04T13:35:25.8804782Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_float32 PASSED [0.0186s] [ 12%] 2025-12-04T13:35:25.8805060Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_complex64 PASSED [0.3060s] [ 12%] 2025-12-04T13:35:25.8805323Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_float32 PASSED [0.0808s] [ 12%] 2025-12-04T13:35:25.8805585Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_float32 PASSED [1.3405s] [ 12%] 2025-12-04T13:35:25.8805854Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_float32 PASSED [0.7672s] [ 12%] 2025-12-04T13:35:25.8806139Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 12%] 2025-12-04T13:35:25.8806462Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 12%] 2025-12-04T13:35:25.8806774Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 12%] 2025-12-04T13:35:25.8807059Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erf_cuda_int64 PASSED [0.7199s] [ 12%] 2025-12-04T13:35:25.8807318Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfinv_cuda_float32 PASSED [0.0071s] [ 12%] 2025-12-04T13:35:25.8807591Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_complex64 PASSED [0.7264s] [ 12%] 2025-12-04T13:35:25.8807865Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_float32 PASSED [0.0105s] [ 12%] 2025-12-04T13:35:25.8808165Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_int64 PASSED [0.7164s] [ 12%] 2025-12-04T13:35:25.8808436Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exponential_cuda_float32 PASSED [0.0131s] [ 12%] 2025-12-04T13:35:25.8808717Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_int64 SKIPPED [0.0002s] (Skipped!) [ 12%] 2025-12-04T13:35:25.8808993Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_complex64 PASSED [0.0561s] [ 12%] 2025-12-04T13:35:25.8809263Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_float32 PASSED [0.7294s] [ 12%] 2025-12-04T13:35:25.8809535Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_complex64 PASSED [1.0664s] [ 12%] 2025-12-04T13:35:25.8809810Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_complex64 PASSED [0.7960s] [ 12%] 2025-12-04T13:35:25.8810078Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_int64 PASSED [0.7621s] [ 12%] 2025-12-04T13:35:25.8810356Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_complex64 PASSED [0.0090s] [ 12%] 2025-12-04T13:35:25.8810643Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_float32 PASSED [1.2345s] [ 12%] 2025-12-04T13:35:25.8810920Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_complex64 PASSED [1.2624s] [ 12%] 2025-12-04T13:35:25.8811192Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_float32 PASSED [0.7860s] [ 12%] 2025-12-04T13:35:25.8811462Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_float32 PASSED [0.7808s] [ 12%] 2025-12-04T13:35:25.8811730Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_int64 PASSED [0.7882s] [ 12%] 2025-12-04T13:35:25.8811997Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_float32 PASSED [0.7973s] [ 12%] 2025-12-04T13:35:25.8812261Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_int64 PASSED [0.7885s] [ 13%] 2025-12-04T13:35:25.8812563Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_complex64 PASSED [0.0068s] [ 13%] 2025-12-04T13:35:25.8812852Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_complex64 PASSED [0.0158s] [ 13%] 2025-12-04T13:35:25.8813132Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_float32 PASSED [0.0132s] [ 13%] 2025-12-04T13:35:25.8813408Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_int64 PASSED [0.0052s] [ 13%] 2025-12-04T13:35:25.8813674Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_float32 PASSED [0.0113s] [ 13%] 2025-12-04T13:35:25.8813933Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_int64 PASSED [0.0045s] [ 13%] 2025-12-04T13:35:25.8814193Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_int64 PASSED [0.7739s] [ 13%] 2025-12-04T13:35:25.8814455Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_complex64 PASSED [0.0270s] [ 13%] 2025-12-04T13:35:25.8814751Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_grid_sampler_3d_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 13%] 2025-12-04T13:35:25.8815062Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_float32 PASSED [0.0059s] [ 13%] 2025-12-04T13:35:25.8815334Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_complex64 PASSED [0.7726s] [ 13%] 2025-12-04T13:35:25.8815597Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64 PASSED [0.0045s] [ 13%] 2025-12-04T13:35:25.8815859Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_int64 PASSED [0.7757s] [ 13%] 2025-12-04T13:35:25.8816136Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amax_cuda_int64 PASSED [0.0067s] [ 13%] 2025-12-04T13:35:25.8816420Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_float32 PASSED [0.7800s] [ 13%] 2025-12-04T13:35:25.8816694Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_float32 PASSED [0.0072s] [ 13%] 2025-12-04T13:35:25.8816960Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_float32 PASSED [0.7710s] [ 13%] 2025-12-04T13:35:25.8817224Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isin_cuda_float32 PASSED [0.0381s] [ 13%] 2025-12-04T13:35:25.8817482Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_int64 PASSED [0.7291s] [ 13%] 2025-12-04T13:35:25.8817744Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_int64 PASSED [0.0038s] [ 13%] 2025-12-04T13:35:25.8818010Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_complex64 PASSED [0.7074s] [ 13%] 2025-12-04T13:35:25.8818327Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_int64 PASSED [0.0045s] [ 13%] 2025-12-04T13:35:25.8818586Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_complex64 PASSED [0.7233s] [ 13%] 2025-12-04T13:35:25.8818847Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_float32 PASSED [0.0059s] [ 13%] 2025-12-04T13:35:25.8819143Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_int64 PASSED [0.1187s] [ 13%] 2025-12-04T13:35:25.8819457Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_int64 PASSED [0.1129s] [ 13%] 2025-12-04T13:35:25.8819764Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_int64 PASSED [0.1129s] [ 13%] 2025-12-04T13:35:25.8820072Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_float32 PASSED [0.9671s] [ 13%] 2025-12-04T13:35:25.8820346Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64 PASSED [0.0040s] [ 13%] 2025-12-04T13:35:25.8820607Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_int64 PASSED [0.7437s] [ 13%] 2025-12-04T13:35:25.8820870Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_complex64 PASSED [0.0171s] [ 13%] 2025-12-04T13:35:25.8821164Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_le_cuda_int64 PASSED [0.0050s] [ 13%] 2025-12-04T13:35:25.8821446Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_float32 PASSED [0.7121s] [ 13%] 2025-12-04T13:35:25.8821723Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigh_cuda_float32 PASSED [0.0249s] [ 13%] 2025-12-04T13:35:25.8822006Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_complex64 PASSED [0.0809s] [ 13%] 2025-12-04T13:35:25.8822289Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_float32 PASSED [0.0105s] [ 13%] 2025-12-04T13:35:25.8822566Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_float32 PASSED [0.0099s] [ 13%] 2025-12-04T13:35:25.8822937Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_float32 SKIPPED [0.0007s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 13%] 2025-12-04T13:35:25.8823308Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_float32 PASSED [0.9575s] [ 13%] 2025-12-04T13:35:25.8823614Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_ex_cuda_float32 PASSED [0.0564s] [ 13%] 2025-12-04T13:35:25.8823908Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_float32 PASSED [0.3712s] [ 13%] 2025-12-04T13:35:25.8824223Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_complex64 PASSED [0.7896s] [ 13%] 2025-12-04T13:35:25.8824562Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.7992s] [ 13%] 2025-12-04T13:35:25.8824958Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_singular_cuda_float32 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 13%] 2025-12-04T13:35:25.8825329Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64 PASSED [0.7514s] [ 13%] 2025-12-04T13:35:25.8825623Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_float32 PASSED [0.7449s] [ 13%] 2025-12-04T13:35:25.8825922Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_complex64 PASSED [0.7480s] [ 13%] 2025-12-04T13:35:25.8826219Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_complex64 PASSED [0.7420s] [ 13%] 2025-12-04T13:35:25.8826506Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vecdot_cuda_float32 PASSED [0.0393s] [ 13%] 2025-12-04T13:35:25.8826781Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_float32 PASSED [0.0170s] [ 13%] 2025-12-04T13:35:25.8827075Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_complex64 PASSED [0.0708s] [ 13%] 2025-12-04T13:35:25.8827391Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_float32 PASSED [0.0697s] [ 13%] 2025-12-04T13:35:25.8827679Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_float32 PASSED [0.7366s] [ 13%] 2025-12-04T13:35:25.8827953Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_complex64 PASSED [0.0066s] [ 13%] 2025-12-04T13:35:25.8828284Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_complex64 PASSED [0.7274s] [ 13%] 2025-12-04T13:35:25.8828563Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_float32 PASSED [0.0063s] [ 13%] 2025-12-04T13:35:25.8828835Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_int64 PASSED [0.0048s] [ 13%] 2025-12-04T13:35:25.8829108Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_complex64 PASSED [0.7522s] [ 13%] 2025-12-04T13:35:25.8829377Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_float32 PASSED [0.0048s] [ 13%] 2025-12-04T13:35:25.8829639Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_int64 PASSED [0.7293s] [ 13%] 2025-12-04T13:35:25.8829931Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lt_cuda_int64 PASSED [0.0064s] [ 13%] 2025-12-04T13:35:25.8830211Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_float32 PASSED [0.0489s] [ 13%] 2025-12-04T13:35:25.8830479Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_complex64 PASSED [0.7455s] [ 13%] 2025-12-04T13:35:25.8830748Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_float32 PASSED [0.0781s] [ 13%] 2025-12-04T13:35:25.8831026Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_int64 PASSED [0.0170s] [ 13%] 2025-12-04T13:35:25.8831306Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_int64 PASSED [0.7443s] [ 13%] 2025-12-04T13:35:25.8831587Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_float32 PASSED [0.0196s] [ 13%] 2025-12-04T13:35:25.8831867Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_int64 PASSED [0.7398s] [ 13%] 2025-12-04T13:35:25.8832154Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_complex64 PASSED [0.0894s] [ 14%] 2025-12-04T13:35:25.8832478Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32 PASSED [0.0834s] [ 14%] 2025-12-04T13:35:25.8832772Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_int64 PASSED [0.0359s] [ 14%] 2025-12-04T13:35:25.8833063Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_complex64 PASSED [0.0776s] [ 14%] 2025-12-04T13:35:25.8833347Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_float32 PASSED [0.0735s] [ 14%] 2025-12-04T13:35:25.8833631Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_float32 PASSED [0.7722s] [ 14%] 2025-12-04T13:35:25.8833916Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmax_cuda_float32 PASSED [0.0254s] [ 14%] 2025-12-04T13:35:25.8834205Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmin_cuda_float32 PASSED [0.7849s] [ 14%] 2025-12-04T13:35:25.8834491Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_float32 PASSED [0.1078s] [ 14%] 2025-12-04T13:35:25.8834770Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_float32 PASSED [0.0679s] [ 14%] 2025-12-04T13:35:25.8835042Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_complex64 PASSED [0.8120s] [ 14%] 2025-12-04T13:35:25.8835319Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_complex64 PASSED [0.0222s] [ 14%] 2025-12-04T13:35:25.8835619Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_complex64 PASSED [0.7726s] [ 14%] 2025-12-04T13:35:25.8835916Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_int64 PASSED [0.0065s] [ 14%] 2025-12-04T13:35:25.8836186Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_float32 PASSED [0.0127s] [ 14%] 2025-12-04T13:35:25.8836450Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_float32 PASSED [0.9495s] [ 14%] 2025-12-04T13:35:25.8836718Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64 PASSED [0.7580s] [ 14%] 2025-12-04T13:35:25.8837009Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0125s] [ 14%] 2025-12-04T13:35:25.8837316Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_int64 PASSED [0.7534s] [ 14%] 2025-12-04T13:35:25.8837623Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_3_cuda_float32 PASSED [0.0129s] [ 14%] 2025-12-04T13:35:25.8837926Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_5_cuda_int64 PASSED [0.7735s] [ 14%] 2025-12-04T13:35:25.8838258Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_float32 PASSED [0.0344s] [ 14%] 2025-12-04T13:35:25.8838557Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nansum_cuda_int64 PASSED [0.0076s] [ 14%] 2025-12-04T13:35:25.8838845Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_cuda_complex64 PASSED [0.7730s] [ 14%] 2025-12-04T13:35:25.8839125Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_layer_norm_cuda_float32 PASSED [0.0268s] [ 14%] 2025-12-04T13:35:25.8839405Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_complex64 PASSED [0.0050s] [ 14%] 2025-12-04T13:35:25.8839668Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_complex64 PASSED [0.7482s] [ 14%] 2025-12-04T13:35:25.8839936Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_float32 PASSED [0.0066s] [ 14%] 2025-12-04T13:35:25.8840205Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_int64 PASSED [0.7677s] [ 14%] 2025-12-04T13:35:25.8840476Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_complex64 PASSED [0.0065s] [ 14%] 2025-12-04T13:35:25.8840787Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.7368s] [ 14%] 2025-12-04T13:35:25.8841143Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0174s] [ 14%] 2025-12-04T13:35:25.8841482Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [0.0186s] [ 14%] 2025-12-04T13:35:25.8841818Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.7375s] [ 14%] 2025-12-04T13:35:25.8842170Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.0320s] [ 14%] 2025-12-04T13:35:25.8842559Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv2d_cuda_complex64 GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8842861Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843041Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843216Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843390Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843561Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843733Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8843904Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844076Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844250Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844423Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844594Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844765Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8844936Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845108Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845281Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845451Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845622Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845792Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8845962Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846135Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846306Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846374Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846444Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846512Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846609Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846691Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846759Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846827Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846896Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8846963Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847031Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847099Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847169Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847237Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847306Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847375Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847445Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847529Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847596Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847665Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847733Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847801Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847869Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8847937Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848004Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848122Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848189Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848261Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848328Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848399Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848466Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848534Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848602Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848670Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848737Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848807Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848874Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8848944Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849011Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849081Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849150Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849219Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849285Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849354Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849421Z GridwiseOp: Problemsize descriptor dimension check failure 2025-12-04T13:35:25.8849618Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x768bed604800 size: 1024 2025-12-04T13:35:25.8849799Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x768bed604800 size: 1024 2025-12-04T13:35:25.8850034Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x768bed604200 size: 1024 2025-12-04T13:35:25.8850230Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x768bed604200 size: 1024 2025-12-04T13:35:25.8850429Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x768bed604200 size: 1024 2025-12-04T13:35:25.8850617Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x768bed604200 size: 1024 2025-12-04T13:35:25.8850662Z PASSED [0.3556s] [ 14%] 2025-12-04T13:35:25.8850820Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose2d_cuda_complex64 PASSED [0.0409s] [ 14%] 2025-12-04T13:35:25.8850973Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_complex64 PASSED [0.7361s] [ 14%] 2025-12-04T13:35:25.8851125Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.0199s] [ 14%] 2025-12-04T13:35:25.8851299Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64 PASSED [0.7180s] [ 14%] 2025-12-04T13:35:25.8851440Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_ctc_loss_cuda_float32 PASSED [0.0313s] [ 14%] 2025-12-04T13:35:25.8851578Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_dropout3d_cuda_float32 PASSED [0.0263s] [ 14%] 2025-12-04T13:35:25.8851709Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_elu_cuda_float32 PASSED [0.7183s] [ 14%] 2025-12-04T13:35:25.8851879Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0132s] [ 14%] 2025-12-04T13:35:25.8852051Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_int64 PASSED [0.7241s] [ 14%] 2025-12-04T13:35:25.8852200Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_gaussian_nll_loss_cuda_float32 PASSED [0.5099s] [ 14%] 2025-12-04T13:35:25.8852333Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_glu_cuda_float32 PASSED [0.7544s] [ 14%] 2025-12-04T13:35:25.8852484Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.7239s] [ 14%] 2025-12-04T13:35:25.8852634Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0231s] [ 14%] 2025-12-04T13:35:25.8852794Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.0307s] [ 14%] 2025-12-04T13:35:25.8852933Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64 PASSED [0.7132s] [ 14%] 2025-12-04T13:35:25.8853068Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_float32 PASSED [0.0102s] [ 14%] 2025-12-04T13:35:25.8853210Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_layer_norm_cuda_float32 PASSED [0.7250s] [ 14%] 2025-12-04T13:35:25.8853344Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_float32 PASSED [0.0306s] [ 14%] 2025-12-04T13:35:25.8853494Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_int64 PASSED [0.0124s] [ 14%] 2025-12-04T13:35:25.8853641Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.7189s] [ 14%] 2025-12-04T13:35:25.8853776Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_nll_loss_cuda_float32 PASSED [0.0517s] [ 14%] 2025-12-04T13:35:25.8853921Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_complex64 PASSED [0.7164s] [ 14%] 2025-12-04T13:35:25.8854081Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_float32 PASSED [0.0121s] [ 14%] 2025-12-04T13:35:25.8854232Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_int64 PASSED [0.7292s] [ 14%] 2025-12-04T13:35:25.8854365Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pdist_cuda_float32 PASSED [0.0104s] [ 14%] 2025-12-04T13:35:25.8854507Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_int64 PASSED [0.0035s] [ 14%] 2025-12-04T13:35:25.8854654Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0050s] [ 14%] 2025-12-04T13:35:25.8854803Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_float32 PASSED [0.8029s] [ 14%] 2025-12-04T13:35:25.8854945Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_int64 PASSED [0.0374s] [ 14%] 2025-12-04T13:35:25.8855126Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_scaled_dot_product_attention_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 14%] 2025-12-04T13:35:25.8855284Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.7157s] [ 14%] 2025-12-04T13:35:25.8855422Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softplus_cuda_float32 PASSED [0.0192s] [ 14%] 2025-12-04T13:35:25.8855556Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_int64 PASSED [0.7197s] [ 14%] 2025-12-04T13:35:25.8855702Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_complex64 PASSED [0.0061s] [ 15%] 2025-12-04T13:35:25.8855845Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_float32 PASSED [0.6968s] [ 15%] 2025-12-04T13:35:25.8855983Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_float32 PASSED [0.0239s] [ 15%] 2025-12-04T13:35:25.8856118Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_int64 PASSED [0.7048s] [ 15%] 2025-12-04T13:35:25.8856275Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [0.0159s] [ 15%] 2025-12-04T13:35:25.8856444Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0140s] [ 15%] 2025-12-04T13:35:25.8856590Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_upsample_nearest_cuda_float32 PASSED [0.7157s] [ 15%] 2025-12-04T13:35:25.8856707Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_float32 PASSED [0.0115s] [ 15%] 2025-12-04T13:35:25.8856828Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_inf_cuda_complex64 PASSED [0.7204s] [ 15%] 2025-12-04T13:35:25.8856947Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_float32 PASSED [0.0084s] [ 15%] 2025-12-04T13:35:25.8857063Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_cuda_float32 PASSED [0.7347s] [ 15%] 2025-12-04T13:35:25.8857181Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_complex64 PASSED [0.0044s] [ 15%] 2025-12-04T13:35:25.8857293Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_int64 PASSED [0.7195s] [ 15%] 2025-12-04T13:35:25.8857419Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_complex64 PASSED [0.8549s] [ 15%] 2025-12-04T13:35:25.8857539Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_float32 PASSED [0.8715s] [ 15%] 2025-12-04T13:35:25.8857690Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_int64 SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T13:35:25.8857811Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_complex64 PASSED [0.7325s] [ 15%] 2025-12-04T13:35:25.8857926Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_put_cuda_float32 PASSED [0.0290s] [ 15%] 2025-12-04T13:35:25.8858103Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_float32 PASSED [0.0538s] [ 15%] 2025-12-04T13:35:25.8858243Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_float32 PASSED [0.7494s] [ 15%] 2025-12-04T13:35:25.8858362Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64 PASSED [0.0050s] [ 15%] 2025-12-04T13:35:25.8858476Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_float32 PASSED [0.7544s] [ 15%] 2025-12-04T13:35:25.8858591Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_cuda_int64 PASSED [0.0130s] [ 15%] 2025-12-04T13:35:25.8858706Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_int64 PASSED [0.0035s] [ 15%] 2025-12-04T13:35:25.8858823Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_int64 PASSED [0.7324s] [ 15%] 2025-12-04T13:35:25.8858946Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_float32 PASSED [0.7186s] [ 15%] 2025-12-04T13:35:25.8859067Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_int64 PASSED [0.7258s] [ 15%] 2025-12-04T13:35:25.8859203Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32 PASSED [0.7184s] [ 15%] 2025-12-04T13:35:25.8859319Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_complex64 PASSED [0.0286s] [ 15%] 2025-12-04T13:35:25.8859433Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_float32 PASSED [0.0244s] [ 15%] 2025-12-04T13:35:25.8859548Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_float32 PASSED [0.0030s] [ 15%] 2025-12-04T13:35:25.8859696Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_neg_3_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T13:35:25.8859810Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_float32 PASSED [0.7114s] [ 15%] 2025-12-04T13:35:25.8859923Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_int64 PASSED [0.0049s] [ 15%] 2025-12-04T13:35:25.8860058Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_float32 PASSED [0.0296s] [ 15%] 2025-12-04T13:35:25.8860188Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_int64 PASSED [0.7242s] [ 15%] 2025-12-04T13:35:25.8860321Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32 PASSED [0.0314s] [ 15%] 2025-12-04T13:35:25.8860450Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_int64 PASSED [0.0113s] [ 15%] 2025-12-04T13:35:25.8860580Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32 PASSED [0.7430s] [ 15%] 2025-12-04T13:35:25.8860696Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_float32 PASSED [0.0089s] [ 15%] 2025-12-04T13:35:25.8860808Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_int64 PASSED [0.7247s] [ 15%] 2025-12-04T13:35:25.8860922Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_float32 PASSED [0.0056s] [ 15%] 2025-12-04T13:35:25.8861034Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_int64 PASSED [0.7199s] [ 15%] 2025-12-04T13:35:25.8861149Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_int64 PASSED [0.0050s] [ 15%] 2025-12-04T13:35:25.8861264Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_float32 PASSED [0.7082s] [ 15%] 2025-12-04T13:35:25.8861378Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sign_cuda_float32 PASSED [0.0049s] [ 15%] 2025-12-04T13:35:25.8861531Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_bartlett_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 15%] 2025-12-04T13:35:25.8861680Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hann_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 15%] 2025-12-04T13:35:25.8861820Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64 PASSED [0.6563s] [ 15%] 2025-12-04T13:35:25.8861944Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_int64 PASSED [0.7144s] [ 15%] 2025-12-04T13:35:25.8862055Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_int64 PASSED [0.2142s] [ 15%] 2025-12-04T13:35:25.8862169Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinh_cuda_int64 PASSED [0.7245s] [ 15%] 2025-12-04T13:35:25.8862291Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_int64 PASSED [0.0063s] [ 15%] 2025-12-04T13:35:25.8862404Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_float32 PASSED [0.0287s] [ 15%] 2025-12-04T13:35:25.8862515Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sort_cuda_int64 PASSED [0.0130s] [ 15%] 2025-12-04T13:35:25.8862644Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_int64 PASSED [0.9258s] [ 15%] 2025-12-04T13:35:25.8862792Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.1961s] [ 15%] 2025-12-04T13:35:25.8862915Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_int64 PASSED [0.8930s] [ 15%] 2025-12-04T13:35:25.8863049Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_int64 PASSED [0.9524s] [ 15%] 2025-12-04T13:35:25.8863168Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_float32 PASSED [0.9849s] [ 15%] 2025-12-04T13:35:25.8863312Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_int64 PASSED [0.0102s] [ 15%] 2025-12-04T13:35:25.8863453Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_float32 PASSED [0.9791s] [ 15%] 2025-12-04T13:35:25.8863592Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_int64 PASSED [0.9258s] [ 15%] 2025-12-04T13:35:25.8863712Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_int64 PASSED [0.7066s] [ 15%] 2025-12-04T13:35:25.8863862Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_int64 PASSED [0.2151s] [ 15%] 2025-12-04T13:35:25.8864020Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_t_cuda_int64 PASSED [0.0056s] [ 15%] 2025-12-04T13:35:25.8864153Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_float32 PASSED [0.7183s] [ 15%] 2025-12-04T13:35:25.8864266Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_stack_cuda_int64 PASSED [0.0060s] [ 15%] 2025-12-04T13:35:25.8864396Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_float32 PASSED [0.7112s] [ 15%] 2025-12-04T13:35:25.8864507Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_float32 PASSED [0.0137s] [ 16%] 2025-12-04T13:35:25.8864617Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_int64 PASSED [0.7278s] [ 16%] 2025-12-04T13:35:25.8864731Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_int64 PASSED [0.0045s] [ 16%] 2025-12-04T13:35:25.8864852Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_float32 PASSED [0.7176s] [ 16%] 2025-12-04T13:35:25.8864961Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_int64 PASSED [0.0158s] [ 16%] 2025-12-04T13:35:25.8865072Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_int64 PASSED [0.0220s] [ 16%] 2025-12-04T13:35:25.8865188Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_int64 PASSED [0.7098s] [ 16%] 2025-12-04T13:35:25.8865306Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_int64 PASSED [0.0080s] [ 16%] 2025-12-04T13:35:25.8865423Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_complex64 PASSED [0.7275s] [ 16%] 2025-12-04T13:35:25.8865536Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_float32 PASSED [0.0148s] [ 16%] 2025-12-04T13:35:25.8865669Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_int64 PASSED [0.7148s] [ 16%] 2025-12-04T13:35:25.8865811Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triangular_solve_cuda_complex64 PASSED [0.0255s] [ 16%] 2025-12-04T13:35:25.8865925Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_float32 PASSED [0.7304s] [ 16%] 2025-12-04T13:35:25.8866043Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_int64 PASSED [0.0089s] [ 16%] 2025-12-04T13:35:25.8866158Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_float32 PASSED [0.7162s] [ 16%] 2025-12-04T13:35:25.8866286Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_int64 PASSED [0.1225s] [ 16%] 2025-12-04T13:35:25.8866413Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_complex64 PASSED [0.7129s] [ 16%] 2025-12-04T13:35:25.8866533Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_int64 PASSED [0.0048s] [ 16%] 2025-12-04T13:35:25.8866657Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_float32 PASSED [0.7146s] [ 16%] 2025-12-04T13:35:25.8866801Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_float32 PASSED [0.0102s] [ 16%] 2025-12-04T13:35:25.8866920Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_float32 PASSED [0.7231s] [ 16%] 2025-12-04T13:35:25.8867035Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_int64 PASSED [0.0059s] [ 16%] 2025-12-04T13:35:25.8867152Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_cuda_float32 PASSED [0.7297s] [ 16%] 2025-12-04T13:35:25.8867271Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_complex64 PASSED [0.0095s] [ 16%] 2025-12-04T13:35:25.8867386Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_complex64 PASSED [0.7106s] [ 16%] 2025-12-04T13:35:25.8867500Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_int64 PASSED [0.0044s] [ 16%] 2025-12-04T13:35:25.8867617Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_complex64 PASSED [0.7255s] [ 16%] 2025-12-04T13:35:25.8867734Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_complex64 PASSED [0.0067s] [ 16%] 2025-12-04T13:35:25.8867846Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_int64 PASSED [0.7043s] [ 16%] 2025-12-04T13:35:25.8867959Z test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_float32 PASSED [0.0042s] [ 16%] 2025-12-04T13:35:25.8868060Z test_ops.py::TestCommonCUDA::test_numpy_ref_allclose_cuda_float64 PASSED [0.0107s] [ 16%] 2025-12-04T13:35:25.8868208Z test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_int64 PASSED [0.7263s] [ 16%] 2025-12-04T13:35:25.8868306Z test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_float64 PASSED [0.0056s] [ 16%] 2025-12-04T13:35:25.8868412Z test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128 PASSED [0.0141s] [ 16%] 2025-12-04T13:35:25.8868513Z test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_float64 PASSED [0.0128s] [ 16%] 2025-12-04T13:35:25.8868612Z test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_int64 PASSED [0.0143s] [ 16%] 2025-12-04T13:35:25.8868705Z test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_int64 PASSED [0.7314s] [ 16%] 2025-12-04T13:35:25.8868837Z test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_int64 PASSED [0.0071s] [ 16%] 2025-12-04T13:35:25.8868943Z test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_complex128 PASSED [0.7100s] [ 16%] 2025-12-04T13:35:25.8869049Z test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_float64 PASSED [0.0049s] [ 16%] 2025-12-04T13:35:25.8869165Z test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_complex128 PASSED [0.7149s] [ 16%] 2025-12-04T13:35:25.8869279Z test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_float64 PASSED [0.0067s] [ 16%] 2025-12-04T13:35:25.8869425Z test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_complex128 PASSED [0.0140s] [ 16%] 2025-12-04T13:35:25.8869561Z test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_int64 PASSED [0.7202s] [ 16%] 2025-12-04T13:35:25.8869680Z test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64 PASSED [0.0064s] [ 16%] 2025-12-04T13:35:25.8869797Z test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_mse_loss_cuda_float64 PASSED [0.7303s] [ 16%] 2025-12-04T13:35:25.8869908Z test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_one_hot_cuda_int64 PASSED [0.0058s] [ 16%] 2025-12-04T13:35:25.8870044Z test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_complex128 PASSED [0.7291s] [ 16%] 2025-12-04T13:35:25.8870139Z test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_float64 PASSED [0.0079s] [ 16%] 2025-12-04T13:35:25.8870244Z test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_int64 PASSED [0.0847s] [ 16%] 2025-12-04T13:35:25.8870353Z test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_float64 PASSED [0.7285s] [ 16%] 2025-12-04T13:35:25.8870476Z test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_int64 PASSED [0.0050s] [ 16%] 2025-12-04T13:35:25.8870586Z test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_copy_cuda_complex128 PASSED [0.7212s] [ 16%] 2025-12-04T13:35:25.8870681Z test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_int64 PASSED [0.0061s] [ 16%] 2025-12-04T13:35:25.8870779Z test_ops.py::TestCommonCUDA::test_out___rmatmul___cuda_float32 PASSED [0.7170s] [ 16%] 2025-12-04T13:35:25.8870890Z test_ops.py::TestCommonCUDA::test_out__refs__conversions_byte_cuda_float32 PASSED [0.0031s] [ 16%] 2025-12-04T13:35:25.8870984Z test_ops.py::TestCommonCUDA::test_out__refs_acos_cuda_float32 PASSED [0.7118s] [ 16%] 2025-12-04T13:35:25.8871076Z test_ops.py::TestCommonCUDA::test_out__refs_addr_cuda_float32 PASSED [0.0112s] [ 16%] 2025-12-04T13:35:25.8871180Z test_ops.py::TestCommonCUDA::test_out__refs_alias_copy_cuda_float32 PASSED [0.7219s] [ 16%] 2025-12-04T13:35:25.8871289Z test_ops.py::TestCommonCUDA::test_out__refs_as_strided_copy_cuda_float32 PASSED [0.0083s] [ 16%] 2025-12-04T13:35:25.8871385Z test_ops.py::TestCommonCUDA::test_out__refs_asinh_cuda_float32 PASSED [0.7288s] [ 16%] 2025-12-04T13:35:25.8871477Z test_ops.py::TestCommonCUDA::test_out__refs_atanh_cuda_float32 PASSED [0.0052s] [ 16%] 2025-12-04T13:35:25.8871589Z test_ops.py::TestCommonCUDA::test_out__refs_bitwise_right_shift_cuda_int64 PASSED [0.0107s] [ 16%] 2025-12-04T13:35:25.8871693Z test_ops.py::TestCommonCUDA::test_out__refs_broadcast_to_cuda_float32 PASSED [0.7180s] [ 16%] 2025-12-04T13:35:25.8871787Z test_ops.py::TestCommonCUDA::test_out__refs_cat_cuda_float32 PASSED [0.0158s] [ 16%] 2025-12-04T13:35:25.8871889Z test_ops.py::TestCommonCUDA::test_out__refs_column_stack_cuda_float32 PASSED [0.0064s] [ 16%] 2025-12-04T13:35:25.8871983Z test_ops.py::TestCommonCUDA::test_out__refs_cosh_cuda_float32 PASSED [0.7562s] [ 16%] 2025-12-04T13:35:25.8872090Z test_ops.py::TestCommonCUDA::test_out__refs_count_nonzero_cuda_float32 PASSED [0.0033s] [ 16%] 2025-12-04T13:35:25.8872186Z test_ops.py::TestCommonCUDA::test_out__refs_cumsum_cuda_float32 XFAIL [0.0051s] [ 16%] 2025-12-04T13:35:25.8872282Z test_ops.py::TestCommonCUDA::test_out__refs_deg2rad_cuda_float32 PASSED [0.7433s] [ 17%] 2025-12-04T13:35:25.8872389Z test_ops.py::TestCommonCUDA::test_out__refs_diagonal_copy_cuda_float32 PASSED [0.0158s] [ 17%] 2025-12-04T13:35:25.8872499Z test_ops.py::TestCommonCUDA::test_out__refs_div_no_rounding_mode_cuda_float32 PASSED [0.0146s] [ 17%] 2025-12-04T13:35:25.8872597Z test_ops.py::TestCommonCUDA::test_out__refs_dsplit_cuda_float32 PASSED [0.7409s] [ 17%] 2025-12-04T13:35:25.8872693Z test_ops.py::TestCommonCUDA::test_out__refs_dstack_cuda_float32 PASSED [0.0102s] [ 17%] 2025-12-04T13:35:25.8872891Z test_ops.py::TestCommonCUDA::test_out__refs_equal_cuda_float32 SKIPPED [0.0026s] (Skipped! Only supports single tensor or iterable of tensor outputs.) [ 17%] 2025-12-04T13:35:25.8873022Z test_ops.py::TestCommonCUDA::test_out__refs_exp_cuda_float32 PASSED [0.7330s] [ 17%] 2025-12-04T13:35:25.8873137Z test_ops.py::TestCommonCUDA::test_out__refs_expand_copy_cuda_float32 PASSED [0.0115s] [ 17%] 2025-12-04T13:35:25.8873233Z test_ops.py::TestCommonCUDA::test_out__refs_fft_fft_cuda_float32 PASSED [0.7433s] [ 17%] 2025-12-04T13:35:25.8873330Z test_ops.py::TestCommonCUDA::test_out__refs_fft_hfftn_cuda_float32 PASSED [0.0185s] [ 17%] 2025-12-04T13:35:25.8873431Z test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft2_cuda_float32 PASSED [0.0088s] [ 17%] 2025-12-04T13:35:25.8873528Z test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft_cuda_float32 PASSED [0.7214s] [ 17%] 2025-12-04T13:35:25.8873634Z test_ops.py::TestCommonCUDA::test_out__refs_floor_divide_cuda_float32 PASSED [0.0982s] [ 17%] 2025-12-04T13:35:25.8873727Z test_ops.py::TestCommonCUDA::test_out__refs_fmax_cuda_float32 PASSED [0.0141s] [ 17%] 2025-12-04T13:35:25.8873822Z test_ops.py::TestCommonCUDA::test_out__refs_fmin_cuda_float32 PASSED [0.0138s] [ 17%] 2025-12-04T13:35:25.8873918Z test_ops.py::TestCommonCUDA::test_out__refs_hsplit_cuda_float32 PASSED [0.7182s] [ 17%] 2025-12-04T13:35:25.8874029Z test_ops.py::TestCommonCUDA::test_out__refs_i0_cuda_float32 PASSED [0.0064s] [ 17%] 2025-12-04T13:35:25.8874128Z test_ops.py::TestCommonCUDA::test_out__refs_isneginf_cuda_float32 PASSED [0.7216s] [ 17%] 2025-12-04T13:35:25.8874224Z test_ops.py::TestCommonCUDA::test_out__refs_isreal_cuda_float32 PASSED [0.0055s] [ 17%] 2025-12-04T13:35:25.8874315Z test_ops.py::TestCommonCUDA::test_out__refs_lcm_cuda_int64 PASSED [0.0128s] [ 17%] 2025-12-04T13:35:25.8874408Z test_ops.py::TestCommonCUDA::test_out__refs_le_cuda_float32 PASSED [0.0104s] [ 17%] 2025-12-04T13:35:25.8874514Z test_ops.py::TestCommonCUDA::test_out__refs_linalg_svdvals_cuda_float32 PASSED [0.0531s] [ 17%] 2025-12-04T13:35:25.8874639Z test_ops.py::TestCommonCUDA::test_out__refs_linspace_tensor_overload_cuda_float32 PASSED [0.1852s] [ 17%] 2025-12-04T13:35:25.8874733Z test_ops.py::TestCommonCUDA::test_out__refs_log_cuda_float32 PASSED [0.7368s] [ 17%] 2025-12-04T13:35:25.8874829Z test_ops.py::TestCommonCUDA::test_out__refs_mean_cuda_float32 PASSED [0.0384s] [ 17%] 2025-12-04T13:35:25.8874979Z test_ops.py::TestCommonCUDA::test_out__refs_new_empty_cuda_float32 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 17%] 2025-12-04T13:35:25.8875078Z test_ops.py::TestCommonCUDA::test_out__refs_new_full_cuda_float32 PASSED [0.7140s] [ 17%] 2025-12-04T13:35:25.8875196Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_leaky_relu_cuda_float32 PASSED [0.0132s] [ 17%] 2025-12-04T13:35:25.8875312Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_nll_loss_cuda_float32 PASSED [0.0937s] [ 17%] 2025-12-04T13:35:25.8875436Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.7181s] [ 17%] 2025-12-04T13:35:25.8875548Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu6_cuda_float32 PASSED [0.0061s] [ 17%] 2025-12-04T13:35:25.8875660Z test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu_cuda_float32 PASSED [0.7190s] [ 17%] 2025-12-04T13:35:25.8875807Z test_ops.py::TestCommonCUDA::test_out__refs_normal_cuda_float32 SKIPPED [0.0002s] (Expected: normal is not comparable) [ 17%] 2025-12-04T13:35:25.8875909Z test_ops.py::TestCommonCUDA::test_out__refs_positive_cuda_float32 PASSED [0.7105s] [ 17%] 2025-12-04T13:35:25.8876001Z test_ops.py::TestCommonCUDA::test_out__refs_rsub_cuda_float32 PASSED [0.0032s] [ 17%] 2025-12-04T13:35:25.8876094Z test_ops.py::TestCommonCUDA::test_out__refs_sgn_cuda_float32 PASSED [0.7476s] [ 17%] 2025-12-04T13:35:25.8876189Z test_ops.py::TestCommonCUDA::test_out__refs_signbit_cuda_float32 PASSED [0.0047s] [ 17%] 2025-12-04T13:35:25.8876283Z test_ops.py::TestCommonCUDA::test_out__refs_sin_cuda_float32 PASSED [0.7299s] [ 17%] 2025-12-04T13:35:25.8876373Z test_ops.py::TestCommonCUDA::test_out__refs_sinh_cuda_float32 PASSED [0.0049s] [ 17%] 2025-12-04T13:35:25.8876507Z test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j0_cuda_float32 PASSED [0.7216s] [ 17%] 2025-12-04T13:35:25.8876619Z test_ops.py::TestCommonCUDA::test_out__refs_special_i0e_cuda_float32 PASSED [0.0075s] [ 17%] 2025-12-04T13:35:25.8876745Z test_ops.py::TestCommonCUDA::test_out__refs_special_spherical_bessel_j0_cuda_float32 PASSED [0.7315s] [ 17%] 2025-12-04T13:35:25.8876850Z test_ops.py::TestCommonCUDA::test_out__refs_special_zeta_cuda_float32 PASSED [0.0162s] [ 17%] 2025-12-04T13:35:25.8876962Z test_ops.py::TestCommonCUDA::test_out__refs_squeeze_multiple_cuda_float32 PASSED [0.7239s] [ 17%] 2025-12-04T13:35:25.8877061Z test_ops.py::TestCommonCUDA::test_out__refs_sum_to_size_cuda_float32 PASSED [0.0032s] [ 17%] 2025-12-04T13:35:25.8877153Z test_ops.py::TestCommonCUDA::test_out__refs_t_cuda_float32 PASSED [0.7152s] [ 17%] 2025-12-04T13:35:25.8877253Z test_ops.py::TestCommonCUDA::test_out__refs_transpose_cuda_float32 PASSED [0.0032s] [ 17%] 2025-12-04T13:35:25.8877350Z test_ops.py::TestCommonCUDA::test_out__refs_tril_cuda_float32 PASSED [0.7406s] [ 17%] 2025-12-04T13:35:25.8877452Z test_ops.py::TestCommonCUDA::test_out__refs_tril_indices_cuda_int64 PASSED [0.0032s] [ 17%] 2025-12-04T13:35:25.8877568Z test_ops.py::TestCommonCUDA::test_out__refs_true_divide_cuda_float32 PASSED [0.0168s] [ 17%] 2025-12-04T13:35:25.8877661Z test_ops.py::TestCommonCUDA::test_out__refs_trunc_cuda_float32 PASSED [0.7273s] [ 17%] 2025-12-04T13:35:25.8877757Z test_ops.py::TestCommonCUDA::test_out__refs_unbind_cuda_float32 PASSED [0.0031s] [ 17%] 2025-12-04T13:35:25.8877852Z test_ops.py::TestCommonCUDA::test_out__refs_unfold_cuda_float32 PASSED [0.7529s] [ 17%] 2025-12-04T13:35:25.8877951Z test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32 PASSED [0.0033s] [ 17%] 2025-12-04T13:35:25.8878044Z test_ops.py::TestCommonCUDA::test_out__refs_xlogy_cuda_float32 PASSED [0.0140s] [ 17%] 2025-12-04T13:35:25.8878207Z test_ops.py::TestCommonCUDA::test_out__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.7642s] [ 17%] 2025-12-04T13:35:25.8878299Z test_ops.py::TestCommonCUDA::test_out_abs_cuda_float32 PASSED [0.0045s] [ 17%] 2025-12-04T13:35:25.8878389Z test_ops.py::TestCommonCUDA::test_out_add_cuda_float32 PASSED [0.7467s] [ 17%] 2025-12-04T13:35:25.8878484Z test_ops.py::TestCommonCUDA::test_out_alias_copy_cuda_float32 PASSED [0.0057s] [ 17%] 2025-12-04T13:35:25.8878571Z test_ops.py::TestCommonCUDA::test_out_all_cuda_float32 PASSED [0.7490s] [ 17%] 2025-12-04T13:35:25.8878660Z test_ops.py::TestCommonCUDA::test_out_amax_cuda_float32 PASSED [0.0205s] [ 17%] 2025-12-04T13:35:25.8878750Z test_ops.py::TestCommonCUDA::test_out_aminmax_cuda_float32 PASSED [0.0106s] [ 17%] 2025-12-04T13:35:25.8878837Z test_ops.py::TestCommonCUDA::test_out_any_cuda_float32 PASSED [0.7360s] [ 17%] 2025-12-04T13:35:25.8878926Z test_ops.py::TestCommonCUDA::test_out_arange_cuda_float32 PASSED [0.0240s] [ 17%] 2025-12-04T13:35:25.8879016Z test_ops.py::TestCommonCUDA::test_out_argmin_cuda_float32 PASSED [0.7353s] [ 17%] 2025-12-04T13:35:25.8879111Z test_ops.py::TestCommonCUDA::test_out_atleast_2d_cuda_float32 PASSED [0.0031s] [ 17%] 2025-12-04T13:35:25.8879205Z test_ops.py::TestCommonCUDA::test_out_bfloat16_cuda_float32 PASSED [0.7429s] [ 17%] 2025-12-04T13:35:25.8879300Z test_ops.py::TestCommonCUDA::test_out_bitwise_or_cuda_int64 PASSED [0.0085s] [ 17%] 2025-12-04T13:35:25.8879405Z test_ops.py::TestCommonCUDA::test_out_bitwise_right_shift_cuda_int64 PASSED [0.0068s] [ 17%] 2025-12-04T13:35:25.8879502Z test_ops.py::TestCommonCUDA::test_out_broadcast_to_cuda_float32 PASSED [0.7323s] [ 18%] 2025-12-04T13:35:25.8879596Z test_ops.py::TestCommonCUDA::test_out_bucketize_cuda_float32 PASSED [0.0161s] [ 18%] 2025-12-04T13:35:25.8879685Z test_ops.py::TestCommonCUDA::test_out_cdist_cuda_float32 PASSED [0.7263s] [ 18%] 2025-12-04T13:35:25.8879773Z test_ops.py::TestCommonCUDA::test_out_ceil_cuda_float32 PASSED [0.0048s] [ 18%] 2025-12-04T13:35:25.8879861Z test_ops.py::TestCommonCUDA::test_out_chunk_cuda_float32 PASSED [0.7233s] [ 18%] 2025-12-04T13:35:25.8879994Z test_ops.py::TestCommonCUDA::test_out_combinations_cuda_float32 PASSED [0.0035s] [ 18%] 2025-12-04T13:35:25.8880092Z test_ops.py::TestCommonCUDA::test_out_conj_cuda_float32 PASSED [0.7297s] [ 18%] 2025-12-04T13:35:25.8880186Z test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32 PASSED [0.0037s] [ 18%] 2025-12-04T13:35:25.8880271Z test_ops.py::TestCommonCUDA::test_out_cov_cuda_float32 PASSED [0.7130s] [ 18%] 2025-12-04T13:35:25.8880383Z test_ops.py::TestCommonCUDA::test_out_cumulative_trapezoid_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T13:35:25.8880468Z test_ops.py::TestCommonCUDA::test_out_diff_cuda_float32 PASSED [0.8028s] [ 18%] 2025-12-04T13:35:25.8880576Z test_ops.py::TestCommonCUDA::test_out_div_no_rounding_mode_cuda_float32 PASSED [0.0120s] [ 18%] 2025-12-04T13:35:25.8880662Z test_ops.py::TestCommonCUDA::test_out_dot_cuda_float32 PASSED [0.7266s] [ 18%] 2025-12-04T13:35:25.8880754Z test_ops.py::TestCommonCUDA::test_out_double_cuda_float32 PASSED [0.0032s] [ 18%] 2025-12-04T13:35:25.8880844Z test_ops.py::TestCommonCUDA::test_out_dsplit_cuda_float32 PASSED [0.7132s] [ 18%] 2025-12-04T13:35:25.8881010Z test_ops.py::TestCommonCUDA::test_out_empty_permuted_cuda_float32 SKIPPED [0.0002s] (Expected: empty_permuted is not comparable) [ 18%] 2025-12-04T13:35:25.8881110Z test_ops.py::TestCommonCUDA::test_out_erf_cuda_float32 PASSED [0.7105s] [ 18%] 2025-12-04T13:35:25.8881198Z test_ops.py::TestCommonCUDA::test_out_exp_cuda_float32 PASSED [0.0062s] [ 18%] 2025-12-04T13:35:25.8881292Z test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32 PASSED [0.7277s] [ 18%] 2025-12-04T13:35:25.8881384Z test_ops.py::TestCommonCUDA::test_out_fft_hfftn_cuda_float32 PASSED [0.0133s] [ 18%] 2025-12-04T13:35:25.8881478Z test_ops.py::TestCommonCUDA::test_out_fft_ifftn_cuda_float32 PASSED [0.7188s] [ 18%] 2025-12-04T13:35:25.8881568Z test_ops.py::TestCommonCUDA::test_out_fft_irfft_cuda_float32 PASSED [0.0115s] [ 18%] 2025-12-04T13:35:25.8881664Z test_ops.py::TestCommonCUDA::test_out_fft_irfftn_cuda_float32 PASSED [0.7283s] [ 18%] 2025-12-04T13:35:25.8881755Z test_ops.py::TestCommonCUDA::test_out_fft_rfft2_cuda_float32 PASSED [0.0111s] [ 18%] 2025-12-04T13:35:25.8881851Z test_ops.py::TestCommonCUDA::test_out_fft_rfft_cuda_float32 PASSED [0.7656s] [ 18%] 2025-12-04T13:35:25.8881946Z test_ops.py::TestCommonCUDA::test_out_float_power_cuda_float32 PASSED [0.0120s] [ 18%] 2025-12-04T13:35:25.8882044Z test_ops.py::TestCommonCUDA::test_out_floor_divide_cuda_float32 PASSED [0.0100s] [ 18%] 2025-12-04T13:35:25.8882131Z test_ops.py::TestCommonCUDA::test_out_fmin_cuda_float32 PASSED [0.0091s] [ 18%] 2025-12-04T13:35:25.8882219Z test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32 PASSED [0.0066s] [ 18%] 2025-12-04T13:35:25.8882304Z test_ops.py::TestCommonCUDA::test_out_geqrf_cuda_float32 XFAIL [0.0093s] [ 18%] 2025-12-04T13:35:25.8882421Z test_ops.py::TestCommonCUDA::test_out_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 18%] 2025-12-04T13:35:25.8882513Z test_ops.py::TestCommonCUDA::test_out_heaviside_cuda_float32 PASSED [0.7772s] [ 18%] 2025-12-04T13:35:25.8882604Z test_ops.py::TestCommonCUDA::test_out_histc_cuda_float32 XFAIL [0.0038s] [ 18%] 2025-12-04T13:35:25.8882695Z test_ops.py::TestCommonCUDA::test_out_hstack_cuda_float32 PASSED [0.7605s] [ 18%] 2025-12-04T13:35:25.8882788Z test_ops.py::TestCommonCUDA::test_out_i0_cuda_float32 PASSED [0.1643s] [ 18%] 2025-12-04T13:35:25.8882880Z test_ops.py::TestCommonCUDA::test_out_index_add_cuda_float32 PASSED [0.7721s] [ 18%] 2025-12-04T13:35:25.8882986Z test_ops.py::TestCommonCUDA::test_out_index_reduce_amax_cuda_float32 PASSED [0.0113s] [ 18%] 2025-12-04T13:35:25.8883090Z test_ops.py::TestCommonCUDA::test_out_index_reduce_amin_cuda_float32 PASSED [0.7768s] [ 18%] 2025-12-04T13:35:25.8883183Z test_ops.py::TestCommonCUDA::test_out_ldexp_cuda_float32 PASSED [0.0125s] [ 18%] 2025-12-04T13:35:25.8883272Z test_ops.py::TestCommonCUDA::test_out_le_cuda_float32 PASSED [0.0068s] [ 18%] 2025-12-04T13:35:25.8883363Z test_ops.py::TestCommonCUDA::test_out_lerp_cuda_float32 PASSED [0.7835s] [ 18%] 2025-12-04T13:35:25.8883485Z test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_cuda_float32 PASSED [0.0200s] [ 18%] 2025-12-04T13:35:25.8883703Z test_ops.py::TestCommonCUDA::test_out_linalg_householder_product_cuda_float32 SKIPPED [0.0008s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 18%] 2025-12-04T13:35:25.8883820Z test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.7768s] [ 18%] 2025-12-04T13:35:25.8883925Z test_ops.py::TestCommonCUDA::test_out_linalg_solve_ex_cuda_float32 PASSED [0.0362s] [ 18%] 2025-12-04T13:35:25.8884032Z test_ops.py::TestCommonCUDA::test_out_linalg_tensorsolve_cuda_float32 PASSED [0.7543s] [ 18%] 2025-12-04T13:35:25.8884132Z test_ops.py::TestCommonCUDA::test_out_linalg_vander_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T13:35:25.8884229Z test_ops.py::TestCommonCUDA::test_out_linspace_cuda_float32 PASSED [0.0427s] [ 18%] 2025-12-04T13:35:25.8884321Z test_ops.py::TestCommonCUDA::test_out_log1p_cuda_float32 PASSED [0.7660s] [ 18%] 2025-12-04T13:35:25.8884415Z test_ops.py::TestCommonCUDA::test_out_log2_cuda_float32 PASSED [0.0063s] [ 18%] 2025-12-04T13:35:25.8884531Z test_ops.py::TestCommonCUDA::test_out_logaddexp2_cuda_float32 PASSED [0.7847s] [ 18%] 2025-12-04T13:35:25.8884630Z test_ops.py::TestCommonCUDA::test_out_logical_or_cuda_float32 PASSED [0.0087s] [ 18%] 2025-12-04T13:35:25.8884720Z test_ops.py::TestCommonCUDA::test_out_logit_cuda_float32 PASSED [0.0058s] [ 18%] 2025-12-04T13:35:25.8884817Z test_ops.py::TestCommonCUDA::test_out_logspace_cuda_float32 PASSED [0.2707s] [ 18%] 2025-12-04T13:35:25.8884911Z test_ops.py::TestCommonCUDA::test_out_logsumexp_cuda_float32 PASSED [0.7879s] [ 18%] 2025-12-04T13:35:25.8885001Z test_ops.py::TestCommonCUDA::test_out_mH_cuda_float32 PASSED [0.0031s] [ 18%] 2025-12-04T13:35:25.8885099Z test_ops.py::TestCommonCUDA::test_out_masked_cumsum_cuda_float32 PASSED [0.7688s] [ 18%] 2025-12-04T13:35:25.8885208Z test_ops.py::TestCommonCUDA::test_out_masked_logsumexp_cuda_float32 PASSED [0.0034s] [ 18%] 2025-12-04T13:35:25.8885304Z test_ops.py::TestCommonCUDA::test_out_masked_norm_cuda_float32 PASSED [0.7513s] [ 18%] 2025-12-04T13:35:25.8885402Z test_ops.py::TestCommonCUDA::test_out_masked_std_cuda_float32 PASSED [0.0036s] [ 18%] 2025-12-04T13:35:25.8885495Z test_ops.py::TestCommonCUDA::test_out_matrix_exp_cuda_float32 PASSED [0.7602s] [ 18%] 2025-12-04T13:35:25.8885607Z test_ops.py::TestCommonCUDA::test_out_max_reduction_no_dim_cuda_float32 PASSED [0.0055s] [ 18%] 2025-12-04T13:35:25.8885698Z test_ops.py::TestCommonCUDA::test_out_minimum_cuda_float32 PASSED [0.0095s] [ 18%] 2025-12-04T13:35:25.8885816Z test_ops.py::TestCommonCUDA::test_out_multinomial_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 18%] 2025-12-04T13:35:25.8885903Z test_ops.py::TestCommonCUDA::test_out_mv_cuda_float32 PASSED [0.7506s] [ 18%] 2025-12-04T13:35:25.8886019Z test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.0133s] [ 18%] 2025-12-04T13:35:25.8886114Z test_ops.py::TestCommonCUDA::test_out_narrow_copy_cuda_float32 XFAIL [0.0028s] [ 18%] 2025-12-04T13:35:25.8886210Z test_ops.py::TestCommonCUDA::test_out_new_zeros_cuda_float32 PASSED [0.7447s] [ 18%] 2025-12-04T13:35:25.8886338Z test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0030s] [ 18%] 2025-12-04T13:35:25.8886468Z test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.7552s] [ 18%] 2025-12-04T13:35:25.8886579Z test_ops.py::TestCommonCUDA::test_out_nn_functional_bilinear_cuda_float32 PASSED [0.7492s] [ 19%] 2025-12-04T13:35:25.8886708Z test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T13:35:25.8886818Z test_ops.py::TestCommonCUDA::test_out_nn_functional_conv1d_cuda_float32 PASSED [0.7521s] [ 19%] 2025-12-04T13:35:25.8886932Z test_ops.py::TestCommonCUDA::test_out_nn_functional_conv2d_cuda_float32 PASSED [0.0035s] [ 19%] 2025-12-04T13:35:25.8887081Z test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_similarity_cuda_float32 PASSED [0.7572s] [ 19%] 2025-12-04T13:35:25.8887220Z test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0035s] [ 19%] 2025-12-04T13:35:25.8887351Z test_ops.py::TestCommonCUDA::test_out_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.7620s] [ 19%] 2025-12-04T13:35:25.8887476Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T13:35:25.8887616Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.7180s] [ 19%] 2025-12-04T13:35:25.8887741Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8887874Z test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.7133s] [ 19%] 2025-12-04T13:35:25.8887987Z test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T13:35:25.8888159Z test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.7240s] [ 19%] 2025-12-04T13:35:25.8888316Z test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [0.0044s] [ 19%] 2025-12-04T13:35:25.8888442Z test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.7159s] [ 19%] 2025-12-04T13:35:25.8888553Z test_ops.py::TestCommonCUDA::test_out_nn_functional_nll_loss_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T13:35:25.8888671Z test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_constant_cuda_float32 PASSED [0.7208s] [ 19%] 2025-12-04T13:35:25.8888778Z test_ops.py::TestCommonCUDA::test_out_nn_functional_pdist_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T13:35:25.8888916Z test_ops.py::TestCommonCUDA::test_out_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.7171s] [ 19%] 2025-12-04T13:35:25.8889025Z test_ops.py::TestCommonCUDA::test_out_nn_functional_selu_cuda_float32 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8889141Z test_ops.py::TestCommonCUDA::test_out_nn_functional_softplus_cuda_float32 PASSED [0.7358s] [ 19%] 2025-12-04T13:35:25.8889285Z test_ops.py::TestCommonCUDA::test_out_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T13:35:25.8889413Z test_ops.py::TestCommonCUDA::test_out_nonzero_static_cuda_float32 SKIPPED [0.0007s] (Only runs on cpu) [ 19%] 2025-12-04T13:35:25.8889511Z test_ops.py::TestCommonCUDA::test_out_norm_fro_cuda_float32 PASSED [0.7261s] [ 19%] 2025-12-04T13:35:25.8889599Z test_ops.py::TestCommonCUDA::test_out_ormqr_cuda_float32 XFAIL [0.0146s] [ 19%] 2025-12-04T13:35:25.8889699Z test_ops.py::TestCommonCUDA::test_out_permute_copy_cuda_float32 PASSED [1.4555s] [ 19%] 2025-12-04T13:35:25.8889792Z test_ops.py::TestCommonCUDA::test_out_pinverse_cuda_float32 PASSED [0.0051s] [ 19%] 2025-12-04T13:35:25.8889884Z test_ops.py::TestCommonCUDA::test_out_polar_cuda_float32 PASSED [0.0110s] [ 19%] 2025-12-04T13:35:25.8890013Z test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_1_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 19%] 2025-12-04T13:35:25.8890145Z test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_3_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 19%] 2025-12-04T13:35:25.8890238Z test_ops.py::TestCommonCUDA::test_out_rad2deg_cuda_float32 PASSED [0.7324s] [ 19%] 2025-12-04T13:35:25.8890335Z test_ops.py::TestCommonCUDA::test_out_remainder_cuda_float32 PASSED [0.0120s] [ 19%] 2025-12-04T13:35:25.8890449Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_abs_cuda_float32 PASSED [0.7262s] [ 19%] 2025-12-04T13:35:25.8890567Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8890681Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_float32 PASSED [0.7211s] [ 19%] 2025-12-04T13:35:25.8890798Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_arange_cuda_float32 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8890950Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_as_strided_copy_cuda_float32 PASSED [0.7027s] [ 19%] 2025-12-04T13:35:25.8891082Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T13:35:25.8891197Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_complex64 PASSED [0.9029s] [ 19%] 2025-12-04T13:35:25.8891319Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_complex64 PASSED [0.0035s] [ 19%] 2025-12-04T13:35:25.8891433Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_float32 PASSED [0.7176s] [ 19%] 2025-12-04T13:35:25.8891557Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bernoulli_cuda_float32 PASSED [0.0036s] [ 19%] 2025-12-04T13:35:25.8891669Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_float32 PASSED [0.7190s] [ 19%] 2025-12-04T13:35:25.8891804Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_solve_cuda_complex64 PASSED [0.0052s] [ 19%] 2025-12-04T13:35:25.8891924Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_clamp_max_cuda_float32 PASSED [0.7176s] [ 19%] 2025-12-04T13:35:25.8892061Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_float32 PASSED [0.0033s] [ 19%] 2025-12-04T13:35:25.8892190Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_complex64 PASSED [0.7179s] [ 19%] 2025-12-04T13:35:25.8892302Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_float32 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8892420Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_deg2rad_cuda_float32 PASSED [0.6988s] [ 19%] 2025-12-04T13:35:25.8892535Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_complex64 PASSED [0.0033s] [ 19%] 2025-12-04T13:35:25.8892664Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diagonal_copy_cuda_float32 PASSED [0.7077s] [ 19%] 2025-12-04T13:35:25.8892799Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_complex64 PASSED [0.0036s] [ 19%] 2025-12-04T13:35:25.8892933Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_trunc_rounding_cuda_float32 PASSED [0.7205s] [ 19%] 2025-12-04T13:35:25.8893045Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T13:35:25.8893162Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_float32 PASSED [0.7124s] [ 19%] 2025-12-04T13:35:25.8893276Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_complex64 PASSED [0.0031s] [ 19%] 2025-12-04T13:35:25.8893397Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_complex64 PASSED [0.7161s] [ 19%] 2025-12-04T13:35:25.8893514Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32 PASSED [0.0038s] [ 19%] 2025-12-04T13:35:25.8893636Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_float32 PASSED [0.7160s] [ 19%] 2025-12-04T13:35:25.8893759Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_complex64 PASSED [0.0035s] [ 19%] 2025-12-04T13:35:25.8893882Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfftn_cuda_float32 PASSED [0.7100s] [ 19%] 2025-12-04T13:35:25.8893997Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfft2_cuda_float32 PASSED [0.0036s] [ 19%] 2025-12-04T13:35:25.8894124Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_complex64 PASSED [0.7225s] [ 19%] 2025-12-04T13:35:25.8894238Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmax_cuda_float32 PASSED [0.0032s] [ 19%] 2025-12-04T13:35:25.8894351Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmin_cuda_float32 PASSED [0.7465s] [ 19%] 2025-12-04T13:35:25.8894476Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_complex64 PASSED [0.0035s] [ 19%] 2025-12-04T13:35:25.8894619Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_complex64 PASSED [0.7297s] [ 19%] 2025-12-04T13:35:25.8894761Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_mean_cuda_float32 PASSED [0.0034s] [ 19%] 2025-12-04T13:35:25.8894887Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_complex64 PASSED [0.7542s] [ 19%] 2025-12-04T13:35:25.8895001Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8895124Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_float32 PASSED [0.7223s] [ 20%] 2025-12-04T13:35:25.8895249Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_complex64 PASSED [0.0047s] [ 20%] 2025-12-04T13:35:25.8895376Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_complex64 PASSED [0.7193s] [ 20%] 2025-12-04T13:35:25.8895505Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_float32 PASSED [0.0048s] [ 20%] 2025-12-04T13:35:25.8895629Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_float32 PASSED [0.7365s] [ 20%] 2025-12-04T13:35:25.8895775Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_float32 PASSED [0.0033s] [ 20%] 2025-12-04T13:35:25.8895906Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_float32 PASSED [0.7127s] [ 20%] 2025-12-04T13:35:25.8896044Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_power_cuda_complex64 PASSED [0.0034s] [ 20%] 2025-12-04T13:35:25.8896174Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64 PASSED [0.7205s] [ 20%] 2025-12-04T13:35:25.8896329Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_complex64 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8896450Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_complex64 PASSED [0.0024s] [ 20%] 2025-12-04T13:35:25.8896576Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_float32 PASSED [0.0022s] [ 20%] 2025-12-04T13:35:25.8896710Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_float32 PASSED [0.0026s] [ 20%] 2025-12-04T13:35:25.8896833Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vecdot_cuda_float32 PASSED [0.7267s] [ 20%] 2025-12-04T13:35:25.8896967Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_vector_norm_cuda_complex64 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8897087Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp2_cuda_float32 PASSED [0.7207s] [ 20%] 2025-12-04T13:35:25.8897216Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logcumsumexp_cuda_complex64 PASSED [0.0175s] [ 20%] 2025-12-04T13:35:25.8897328Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_complex64 PASSED [0.7275s] [ 20%] 2025-12-04T13:35:25.8897448Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_float32 PASSED [0.0047s] [ 20%] 2025-12-04T13:35:25.8897577Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_complex64 PASSED [0.7153s] [ 20%] 2025-12-04T13:35:25.8897693Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8897830Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.7168s] [ 20%] 2025-12-04T13:35:25.8897951Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_complex64 PASSED [0.0034s] [ 20%] 2025-12-04T13:35:25.8898103Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_float32 PASSED [0.7134s] [ 20%] 2025-12-04T13:35:25.8898246Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0112s] [ 20%] 2025-12-04T13:35:25.8898417Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_normalize_cuda_complex64 PASSED [0.7249s] [ 20%] 2025-12-04T13:35:25.8898558Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8898676Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_float32 PASSED [0.7166s] [ 20%] 2025-12-04T13:35:25.8898793Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_complex64 PASSED [0.0030s] [ 20%] 2025-12-04T13:35:25.8898905Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_float32 PASSED [0.7156s] [ 20%] 2025-12-04T13:35:25.8899020Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_float32 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8899161Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_0_cuda_float32 PASSED [0.7221s] [ 20%] 2025-12-04T13:35:25.8899313Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_3_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 20%] 2025-12-04T13:35:25.8899468Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_polygamma_polygamma_n_4_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 20%] 2025-12-04T13:35:25.8899597Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64 PASSED [0.7257s] [ 20%] 2025-12-04T13:35:25.8899711Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_qr_cuda_complex64 PASSED [0.0047s] [ 20%] 2025-12-04T13:35:25.8899835Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_complex64 PASSED [0.7028s] [ 20%] 2025-12-04T13:35:25.8899958Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_reciprocal_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8900077Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_remainder_cuda_float32 PASSED [0.7166s] [ 20%] 2025-12-04T13:35:25.8900192Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8900305Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_cuda_float32 PASSED [0.7312s] [ 20%] 2025-12-04T13:35:25.8900441Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amin_cuda_float32 PASSED [0.0047s] [ 20%] 2025-12-04T13:35:25.8900555Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_complex64 PASSED [0.7173s] [ 20%] 2025-12-04T13:35:25.8900669Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8900783Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_complex64 PASSED [0.7310s] [ 20%] 2025-12-04T13:35:25.8900918Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_complex64 PASSED [0.0033s] [ 20%] 2025-12-04T13:35:25.8901042Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_erfcx_cuda_float32 PASSED [0.7057s] [ 20%] 2025-12-04T13:35:25.8901166Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i0e_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8901281Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_float32 PASSED [0.7160s] [ 20%] 2025-12-04T13:35:25.8901397Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_complex64 PASSED [0.0033s] [ 20%] 2025-12-04T13:35:25.8901511Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_complex64 PASSED [0.0026s] [ 20%] 2025-12-04T13:35:25.8901637Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_float32 PASSED [0.7097s] [ 20%] 2025-12-04T13:35:25.8901751Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8901862Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_float32 PASSED [0.7136s] [ 20%] 2025-12-04T13:35:25.8901975Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_topk_cuda_float32 PASSED [0.0031s] [ 20%] 2025-12-04T13:35:25.8902104Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_complex64 PASSED [0.7163s] [ 20%] 2025-12-04T13:35:25.8902242Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8902365Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_float32 PASSED [0.6993s] [ 20%] 2025-12-04T13:35:25.8902491Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_complex64 PASSED [0.0035s] [ 20%] 2025-12-04T13:35:25.8902612Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unbind_copy_cuda_float32 PASSED [0.7095s] [ 20%] 2025-12-04T13:35:25.8902739Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8902859Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32 PASSED [0.7078s] [ 20%] 2025-12-04T13:35:25.8902991Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unsqueeze_copy_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8903115Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_complex64 PASSED [0.7147s] [ 20%] 2025-12-04T13:35:25.8903236Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_complex64 PASSED [0.0032s] [ 20%] 2025-12-04T13:35:25.8903360Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_float32 PASSED [0.7145s] [ 20%] 2025-12-04T13:35:25.8903476Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_where_cuda_complex64 PASSED [0.0035s] [ 20%] 2025-12-04T13:35:25.8903587Z test_ops.py::TestCommonCUDA::test_out_requires_grad_error_xlogy_cuda_float32 PASSED [0.7091s] [ 20%] 2025-12-04T13:35:25.8903683Z test_ops.py::TestCommonCUDA::test_out_reshape_cuda_float32 PASSED [0.0031s] [ 21%] 2025-12-04T13:35:25.8903784Z test_ops.py::TestCommonCUDA::test_out_resolve_conj_cuda_float32 PASSED [0.7108s] [ 21%] 2025-12-04T13:35:25.8903874Z test_ops.py::TestCommonCUDA::test_out_roll_cuda_float32 PASSED [0.0029s] [ 21%] 2025-12-04T13:35:25.8903987Z test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amax_cuda_float32 PASSED [0.7374s] [ 21%] 2025-12-04T13:35:25.8904098Z test_ops.py::TestCommonCUDA::test_out_scatter_reduce_mean_cuda_float32 PASSED [0.0276s] [ 21%] 2025-12-04T13:35:25.8904190Z test_ops.py::TestCommonCUDA::test_out_sgn_cuda_float32 PASSED [0.7155s] [ 21%] 2025-12-04T13:35:25.8904301Z test_ops.py::TestCommonCUDA::test_out_signal_windows_kaiser_cuda_float32 PASSED [0.0028s] [ 21%] 2025-12-04T13:35:25.8904416Z test_ops.py::TestCommonCUDA::test_out_signal_windows_nuttall_cuda_float32 PASSED [0.0021s] [ 21%] 2025-12-04T13:35:25.8904504Z test_ops.py::TestCommonCUDA::test_out_sinc_cuda_float32 PASSED [0.8824s] [ 21%] 2025-12-04T13:35:25.8904615Z test_ops.py::TestCommonCUDA::test_out_special_bessel_j0_cuda_float32 PASSED [0.7217s] [ 21%] 2025-12-04T13:35:25.8904739Z test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0130s] [ 21%] 2025-12-04T13:35:25.8904859Z test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k1_cuda_float32 PASSED [0.1692s] [ 21%] 2025-12-04T13:35:25.8904960Z test_ops.py::TestCommonCUDA::test_out_special_ndtr_cuda_float32 PASSED [0.7265s] [ 21%] 2025-12-04T13:35:25.8905064Z test_ops.py::TestCommonCUDA::test_out_special_ndtri_cuda_float32 PASSED [0.1620s] [ 21%] 2025-12-04T13:35:25.8905198Z test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0093s] [ 21%] 2025-12-04T13:35:25.8905334Z test_ops.py::TestCommonCUDA::test_out_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [0.0107s] [ 21%] 2025-12-04T13:35:25.8905423Z test_ops.py::TestCommonCUDA::test_out_split_cuda_float32 PASSED [0.0021s] [ 21%] 2025-12-04T13:35:25.8905535Z test_ops.py::TestCommonCUDA::test_out_split_with_sizes_copy_cuda_float32 PASSED [0.7372s] [ 21%] 2025-12-04T13:35:25.8905624Z test_ops.py::TestCommonCUDA::test_out_std_cuda_float32 PASSED [0.0153s] [ 21%] 2025-12-04T13:35:25.8905714Z test_ops.py::TestCommonCUDA::test_out_sub_cuda_float32 PASSED [0.7425s] [ 21%] 2025-12-04T13:35:25.8905829Z test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32 PASSED [0.0034s] [ 21%] 2025-12-04T13:35:25.8905933Z test_ops.py::TestCommonCUDA::test_out_topk_cuda_float32 PASSED [0.0182s] [ 21%] 2025-12-04T13:35:25.8906021Z test_ops.py::TestCommonCUDA::test_out_tril_cuda_float32 PASSED [0.7284s] [ 21%] 2025-12-04T13:35:25.8906118Z test_ops.py::TestCommonCUDA::test_out_triu_indices_cuda_int64 PASSED [0.0028s] [ 21%] 2025-12-04T13:35:25.8906206Z test_ops.py::TestCommonCUDA::test_out_unbind_cuda_float32 PASSED [0.7190s] [ 21%] 2025-12-04T13:35:25.8906297Z test_ops.py::TestCommonCUDA::test_out_unfold_cuda_float32 PASSED [0.0031s] [ 21%] 2025-12-04T13:35:25.8906394Z test_ops.py::TestCommonCUDA::test_out_unsafe_chunk_cuda_float32 PASSED [0.7227s] [ 21%] 2025-12-04T13:35:25.8906493Z test_ops.py::TestCommonCUDA::test_out_unsafe_split_cuda_float32 PASSED [0.0031s] [ 21%] 2025-12-04T13:35:25.8906582Z test_ops.py::TestCommonCUDA::test_out_var_cuda_float32 PASSED [0.7346s] [ 21%] 2025-12-04T13:35:25.8906680Z test_ops.py::TestCommonCUDA::test_out_var_unbiased_cuda_float32 PASSED [0.0033s] [ 21%] 2025-12-04T13:35:25.8906772Z test_ops.py::TestCommonCUDA::test_out_vsplit_cuda_float32 PASSED [0.7093s] [ 21%] 2025-12-04T13:35:25.8906877Z test_ops.py::TestCommonCUDA::test_out_warning___radd___cuda PASSED [0.0033s] [ 21%] 2025-12-04T13:35:25.8906976Z test_ops.py::TestCommonCUDA::test_out_warning___rmatmul___cuda PASSED [0.7145s] [ 21%] 2025-12-04T13:35:25.8907067Z test_ops.py::TestCommonCUDA::test_out_warning___rmod___cuda PASSED [0.0031s] [ 21%] 2025-12-04T13:35:25.8907160Z test_ops.py::TestCommonCUDA::test_out_warning___rmul___cuda PASSED [0.7156s] [ 21%] 2025-12-04T13:35:25.8907252Z test_ops.py::TestCommonCUDA::test_out_warning___ror___cuda PASSED [0.0032s] [ 21%] 2025-12-04T13:35:25.8907364Z test_ops.py::TestCommonCUDA::test_out_warning__batch_norm_with_update_cuda XFAIL [0.0169s] [ 21%] 2025-12-04T13:35:25.8907459Z test_ops.py::TestCommonCUDA::test_out_warning__chunk_cat_cuda PASSED [0.7476s] [ 21%] 2025-12-04T13:35:25.8907576Z test_ops.py::TestCommonCUDA::test_out_warning__native_batch_norm_legit_cuda PASSED [0.0520s] [ 21%] 2025-12-04T13:35:25.8907691Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_cdouble_cuda PASSED [0.7193s] [ 21%] 2025-12-04T13:35:25.8907808Z test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_chalf_cuda PASSED [0.0031s] [ 21%] 2025-12-04T13:35:25.8907901Z test_ops.py::TestCommonCUDA::test_out_warning__refs_abs_cuda PASSED [0.7327s] [ 21%] 2025-12-04T13:35:25.8908002Z test_ops.py::TestCommonCUDA::test_out_warning__refs_arange_cuda PASSED [0.0584s] [ 21%] 2025-12-04T13:35:25.8908151Z test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_copy_cuda PASSED [0.7185s] [ 21%] 2025-12-04T13:35:25.8908266Z test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_shapes_cuda PASSED [0.0027s] [ 21%] 2025-12-04T13:35:25.8908379Z test_ops.py::TestCommonCUDA::test_out_warning__refs_broadcast_tensors_cuda PASSED [0.7376s] [ 21%] 2025-12-04T13:35:25.8908483Z test_ops.py::TestCommonCUDA::test_out_warning__refs_ceil_cuda PASSED [0.0089s] [ 21%] 2025-12-04T13:35:25.8908588Z test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_min_cuda PASSED [0.0195s] [ 21%] 2025-12-04T13:35:25.8908701Z test_ops.py::TestCommonCUDA::test_out_warning__refs_count_nonzero_cuda PASSED [0.7303s] [ 21%] 2025-12-04T13:35:25.8908801Z test_ops.py::TestCommonCUDA::test_out_warning__refs_cumprod_cuda PASSED [0.0297s] [ 21%] 2025-12-04T13:35:25.8908903Z test_ops.py::TestCommonCUDA::test_out_warning__refs_deg2rad_cuda PASSED [0.7204s] [ 21%] 2025-12-04T13:35:25.8909000Z test_ops.py::TestCommonCUDA::test_out_warning__refs_diag_cuda PASSED [0.0394s] [ 21%] 2025-12-04T13:35:25.8909103Z test_ops.py::TestCommonCUDA::test_out_warning__refs_dsplit_cuda PASSED [0.7283s] [ 21%] 2025-12-04T13:35:25.8909211Z test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_strided_cuda PASSED [0.0029s] [ 21%] 2025-12-04T13:35:25.8909313Z test_ops.py::TestCommonCUDA::test_out_warning__refs_erfinv_cuda PASSED [0.7266s] [ 21%] 2025-12-04T13:35:25.8909441Z test_ops.py::TestCommonCUDA::test_out_warning__refs_exp2_cuda PASSED [0.0139s] [ 21%] 2025-12-04T13:35:25.8909556Z test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_as_cuda PASSED [0.7210s] [ 21%] 2025-12-04T13:35:25.8909670Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftshift_cuda PASSED [0.0029s] [ 21%] 2025-12-04T13:35:25.8909771Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft_cuda PASSED [0.7417s] [ 21%] 2025-12-04T13:35:25.8909872Z test_ops.py::TestCommonCUDA::test_out_warning__refs_floor_cuda PASSED [0.0085s] [ 21%] 2025-12-04T13:35:25.8909967Z test_ops.py::TestCommonCUDA::test_out_warning__refs_fmax_cuda PASSED [0.0222s] [ 21%] 2025-12-04T13:35:25.8910066Z test_ops.py::TestCommonCUDA::test_out_warning__refs_gcd_cuda PASSED [0.0168s] [ 21%] 2025-12-04T13:35:25.8910164Z test_ops.py::TestCommonCUDA::test_out_warning__refs_hstack_cuda PASSED [0.7254s] [ 21%] 2025-12-04T13:35:25.8910266Z test_ops.py::TestCommonCUDA::test_out_warning__refs_igamma_cuda PASSED [0.0256s] [ 21%] 2025-12-04T13:35:25.8910363Z test_ops.py::TestCommonCUDA::test_out_warning__refs_imag_cuda PASSED [0.0025s] [ 21%] 2025-12-04T13:35:25.8910489Z test_ops.py::TestCommonCUDA::test_out_warning__refs_index_fill_cuda PASSED [0.7160s] [ 21%] 2025-12-04T13:35:25.8910589Z test_ops.py::TestCommonCUDA::test_out_warning__refs_isfinite_cuda PASSED [0.0128s] [ 21%] 2025-12-04T13:35:25.8910691Z test_ops.py::TestCommonCUDA::test_out_warning__refs_isnan_cuda PASSED [0.7296s] [ 21%] 2025-12-04T13:35:25.8910790Z test_ops.py::TestCommonCUDA::test_out_warning__refs_isneginf_cuda PASSED [0.0081s] [ 21%] 2025-12-04T13:35:25.8910888Z test_ops.py::TestCommonCUDA::test_out_warning__refs_lcm_cuda PASSED [0.0233s] [ 22%] 2025-12-04T13:35:25.8910983Z test_ops.py::TestCommonCUDA::test_out_warning__refs_lerp_cuda PASSED [0.0303s] [ 22%] 2025-12-04T13:35:25.8911102Z test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_matrix_norm_cuda PASSED [0.1406s] [ 22%] 2025-12-04T13:35:25.8911213Z test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_svdvals_cuda PASSED [0.0605s] [ 22%] 2025-12-04T13:35:25.8911314Z test_ops.py::TestCommonCUDA::test_out_warning__refs_log2_cuda PASSED [0.7302s] [ 22%] 2025-12-04T13:35:25.8911420Z test_ops.py::TestCommonCUDA::test_out_warning__refs_masked_fill_cuda PASSED [0.0032s] [ 22%] 2025-12-04T13:35:25.8911548Z test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_variadic_tensors_cuda PASSED [0.7115s] [ 22%] 2025-12-04T13:35:25.8911660Z test_ops.py::TestCommonCUDA::test_out_warning__refs_native_layer_norm_cuda PASSED [0.0033s] [ 22%] 2025-12-04T13:35:25.8911759Z test_ops.py::TestCommonCUDA::test_out_warning__refs_neg_cuda PASSED [0.7257s] [ 22%] 2025-12-04T13:35:25.8911873Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_gelu_cuda PASSED [0.0252s] [ 22%] 2025-12-04T13:35:25.8912013Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hinge_embedding_loss_cuda PASSED [0.7202s] [ 22%] 2025-12-04T13:35:25.8912136Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda PASSED [0.0291s] [ 22%] 2025-12-04T13:35:25.8912318Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_log_softmax_with_dtype_cuda PASSED [0.7185s] [ 22%] 2025-12-04T13:35:25.8912460Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_mish_cuda PASSED [0.0132s] [ 22%] 2025-12-04T13:35:25.8912584Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_prelu_cuda PASSED [0.7173s] [ 22%] 2025-12-04T13:35:25.8912861Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_selu_cuda PASSED [0.0136s] [ 22%] 2025-12-04T13:35:25.8913037Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmax_with_dtype_cuda PASSED [0.7273s] [ 22%] 2025-12-04T13:35:25.8913181Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmin_with_dtype_cuda PASSED [0.0032s] [ 22%] 2025-12-04T13:35:25.8913342Z test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda PASSED [0.7260s] [ 22%] 2025-12-04T13:35:25.8913537Z test_ops.py::TestCommonCUDA::test_out_warning__refs_normal__in_place_cuda SKIPPED [0.0002s] (Expected: normal is not comparable) [ 22%] 2025-12-04T13:35:25.8917449Z test_ops.py::TestCommonCUDA::test_out_warning__refs_rot90_cuda PASSED [0.7325s] [ 22%] 2025-12-04T13:35:25.8917577Z test_ops.py::TestCommonCUDA::test_out_warning__refs_select_scatter_cuda PASSED [0.0033s] [ 22%] 2025-12-04T13:35:25.8917692Z test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j0_cuda PASSED [0.7349s] [ 22%] 2025-12-04T13:35:25.8917798Z test_ops.py::TestCommonCUDA::test_out_warning__refs_special_entr_cuda PASSED [0.0136s] [ 22%] 2025-12-04T13:35:25.8917921Z test_ops.py::TestCommonCUDA::test_out_warning__refs_special_softmax_with_dtype_cuda PASSED [0.7237s] [ 22%] 2025-12-04T13:35:25.8918032Z test_ops.py::TestCommonCUDA::test_out_warning__refs_split_with_sizes_cuda PASSED [0.0031s] [ 22%] 2025-12-04T13:35:25.8918183Z test_ops.py::TestCommonCUDA::test_out_warning__refs_stft_cuda PASSED [0.7006s] [ 22%] 2025-12-04T13:35:25.8918283Z test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_cuda PASSED [0.0488s] [ 22%] 2025-12-04T13:35:25.8918420Z test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_to_size_cuda PASSED [0.7123s] [ 22%] 2025-12-04T13:35:25.8918530Z test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_copy_cuda PASSED [0.0253s] [ 22%] 2025-12-04T13:35:25.8918631Z test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_copy_cuda PASSED [0.7486s] [ 22%] 2025-12-04T13:35:25.8918730Z test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_cuda PASSED [0.0031s] [ 22%] 2025-12-04T13:35:25.8918829Z test_ops.py::TestCommonCUDA::test_out_warning__refs_view_copy_cuda PASSED [0.7359s] [ 22%] 2025-12-04T13:35:25.8918923Z test_ops.py::TestCommonCUDA::test_out_warning__refs_view_cuda PASSED [0.0034s] [ 22%] 2025-12-04T13:35:25.8919017Z test_ops.py::TestCommonCUDA::test_out_warning__refs_where_cuda PASSED [0.7152s] [ 22%] 2025-12-04T13:35:25.8919131Z test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_lengths_cuda PASSED [0.0036s] [ 22%] 2025-12-04T13:35:25.8919239Z test_ops.py::TestCommonCUDA::test_out_warning__unsafe_masked_index_cuda PASSED [0.7207s] [ 22%] 2025-12-04T13:35:25.8919344Z test_ops.py::TestCommonCUDA::test_out_warning_addmm_decomposed_cuda PASSED [0.0192s] [ 22%] 2025-12-04T13:35:25.8919435Z test_ops.py::TestCommonCUDA::test_out_warning_amax_cuda PASSED [0.7495s] [ 22%] 2025-12-04T13:35:25.8919525Z test_ops.py::TestCommonCUDA::test_out_warning_arange_cuda XFAIL [0.0083s] [ 22%] 2025-12-04T13:35:25.8919632Z test_ops.py::TestCommonCUDA::test_out_warning_as_strided_scatter_cuda PASSED [1.4229s] [ 22%] 2025-12-04T13:35:25.8919720Z test_ops.py::TestCommonCUDA::test_out_warning_asinh_cuda PASSED [0.0081s] [ 22%] 2025-12-04T13:35:25.8919810Z test_ops.py::TestCommonCUDA::test_out_warning_atanh_cuda PASSED [0.7184s] [ 22%] 2025-12-04T13:35:25.8919904Z test_ops.py::TestCommonCUDA::test_out_warning_block_diag_cuda PASSED [0.0031s] [ 22%] 2025-12-04T13:35:25.8919993Z test_ops.py::TestCommonCUDA::test_out_warning_bool_cuda PASSED [0.7183s] [ 22%] 2025-12-04T13:35:25.8920091Z test_ops.py::TestCommonCUDA::test_out_warning_combinations_cuda PASSED [0.0036s] [ 22%] 2025-12-04T13:35:25.8920186Z test_ops.py::TestCommonCUDA::test_out_warning_complex_cuda PASSED [0.0227s] [ 22%] 2025-12-04T13:35:25.8920284Z test_ops.py::TestCommonCUDA::test_out_warning_conj_physical_cuda PASSED [0.7277s] [ 22%] 2025-12-04T13:35:25.8920387Z test_ops.py::TestCommonCUDA::test_out_warning_constant_pad_nd_cuda PASSED [0.0032s] [ 22%] 2025-12-04T13:35:25.8920481Z test_ops.py::TestCommonCUDA::test_out_warning_copysign_cuda PASSED [0.7641s] [ 22%] 2025-12-04T13:35:25.8920575Z test_ops.py::TestCommonCUDA::test_out_warning_cumprod_cuda PASSED [0.0247s] [ 22%] 2025-12-04T13:35:25.8920669Z test_ops.py::TestCommonCUDA::test_out_warning_diag_embed_cuda PASSED [0.7307s] [ 22%] 2025-12-04T13:35:25.8920769Z test_ops.py::TestCommonCUDA::test_out_warning_diagonal_copy_cuda PASSED [0.0355s] [ 22%] 2025-12-04T13:35:25.8920916Z test_ops.py::TestCommonCUDA::test_out_warning_div_no_rounding_mode_cuda PASSED [0.0158s] [ 22%] 2025-12-04T13:35:25.8921023Z test_ops.py::TestCommonCUDA::test_out_warning_erfinv_cuda PASSED [0.7292s] [ 22%] 2025-12-04T13:35:25.8921111Z test_ops.py::TestCommonCUDA::test_out_warning_exp_cuda PASSED [0.0109s] [ 22%] 2025-12-04T13:35:25.8921204Z test_ops.py::TestCommonCUDA::test_out_warning_expand_as_cuda PASSED [0.7268s] [ 22%] 2025-12-04T13:35:25.8921290Z test_ops.py::TestCommonCUDA::test_out_warning_eye_cuda XFAIL [0.0104s] [ 22%] 2025-12-04T13:35:25.8921382Z test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft2_cuda XFAIL [0.7347s] [ 22%] 2025-12-04T13:35:25.8921471Z test_ops.py::TestCommonCUDA::test_out_warning_fliplr_cuda PASSED [0.7279s] [ 22%] 2025-12-04T13:35:25.8921560Z test_ops.py::TestCommonCUDA::test_out_warning_floor_cuda PASSED [0.0086s] [ 22%] 2025-12-04T13:35:25.8921656Z test_ops.py::TestCommonCUDA::test_out_warning_floor_divide_cuda PASSED [0.0209s] [ 22%] 2025-12-04T13:35:25.8921747Z test_ops.py::TestCommonCUDA::test_out_warning_fmax_cuda PASSED [0.0152s] [ 22%] 2025-12-04T13:35:25.8921834Z test_ops.py::TestCommonCUDA::test_out_warning_fmod_cuda PASSED [0.0155s] [ 22%] 2025-12-04T13:35:25.8921937Z test_ops.py::TestCommonCUDA::test_out_warning_frac_cuda PASSED [0.7374s] [ 22%] 2025-12-04T13:35:25.8922025Z test_ops.py::TestCommonCUDA::test_out_warning_frexp_cuda PASSED [0.0128s] [ 22%] 2025-12-04T13:35:25.8922111Z test_ops.py::TestCommonCUDA::test_out_warning_full_cuda XFAIL [0.0056s] [ 22%] 2025-12-04T13:35:25.8922197Z test_ops.py::TestCommonCUDA::test_out_warning_gcd_cuda PASSED [0.0153s] [ 22%] 2025-12-04T13:35:25.8922286Z test_ops.py::TestCommonCUDA::test_out_warning_geqrf_cuda PASSED [0.7976s] [ 22%] 2025-12-04T13:35:25.8922380Z test_ops.py::TestCommonCUDA::test_out_warning_gradient_cuda PASSED [0.0035s] [ 23%] 2025-12-04T13:35:25.8922481Z test_ops.py::TestCommonCUDA::test_out_warning_grid_sampler_2d_cuda PASSED [0.7234s] [ 23%] 2025-12-04T13:35:25.8922580Z test_ops.py::TestCommonCUDA::test_out_warning_heaviside_cuda PASSED [0.0251s] [ 23%] 2025-12-04T13:35:25.8922753Z test_ops.py::TestCommonCUDA::test_out_warning_histogramdd_cuda SKIPPED [0.0011s] (Skipped! Op has not supported dtypes on this device.) [ 23%] 2025-12-04T13:35:25.8922844Z test_ops.py::TestCommonCUDA::test_out_warning_hypot_cuda PASSED [0.0153s] [ 23%] 2025-12-04T13:35:25.8922935Z test_ops.py::TestCommonCUDA::test_out_warning_isfinite_cuda PASSED [0.7447s] [ 23%] 2025-12-04T13:35:25.8923026Z test_ops.py::TestCommonCUDA::test_out_warning_isreal_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8923126Z test_ops.py::TestCommonCUDA::test_out_warning_jiterator_unary_cuda PASSED [0.7145s] [ 23%] 2025-12-04T13:35:25.8923221Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_eig_cuda PASSED [0.1451s] [ 23%] 2025-12-04T13:35:25.8923318Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvals_cuda PASSED [0.8071s] [ 23%] 2025-12-04T13:35:25.8923418Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvalsh_cuda PASSED [0.0271s] [ 23%] 2025-12-04T13:35:25.8923516Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_cuda PASSED [0.0114s] [ 23%] 2025-12-04T13:35:25.8923615Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_inv_ex_cuda PASSED [0.7870s] [ 23%] 2025-12-04T13:35:25.8923720Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_cuda PASSED [0.1570s] [ 23%] 2025-12-04T13:35:25.8923825Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_multi_dot_cuda PASSED [0.7738s] [ 23%] 2025-12-04T13:35:25.8923950Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_subgradients_at_zero_cuda PASSED [0.1612s] [ 23%] 2025-12-04T13:35:25.8924048Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_cuda PASSED [0.8049s] [ 23%] 2025-12-04T13:35:25.8924147Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_ex_cuda PASSED [0.0579s] [ 23%] 2025-12-04T13:35:25.8924243Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_svd_cuda PASSED [0.3627s] [ 23%] 2025-12-04T13:35:25.8924369Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorinv_cuda PASSED [0.7546s] [ 23%] 2025-12-04T13:35:25.8924479Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_vander_cuda PASSED [0.0033s] [ 23%] 2025-12-04T13:35:25.8924577Z test_ops.py::TestCommonCUDA::test_out_warning_linalg_vecdot_cuda PASSED [0.8568s] [ 23%] 2025-12-04T13:35:25.8924691Z test_ops.py::TestCommonCUDA::test_out_warning_linspace_tensor_overload_cuda XFAIL [0.0089s] [ 23%] 2025-12-04T13:35:25.8924779Z test_ops.py::TestCommonCUDA::test_out_warning_log2_cuda PASSED [1.5503s] [ 23%] 2025-12-04T13:35:25.8924866Z test_ops.py::TestCommonCUDA::test_out_warning_log_cuda PASSED [0.0128s] [ 23%] 2025-12-04T13:35:25.8924960Z test_ops.py::TestCommonCUDA::test_out_warning_logaddexp2_cuda PASSED [0.7715s] [ 23%] 2025-12-04T13:35:25.8925055Z test_ops.py::TestCommonCUDA::test_out_warning_logaddexp_cuda PASSED [0.0269s] [ 23%] 2025-12-04T13:35:25.8925146Z test_ops.py::TestCommonCUDA::test_out_warning_logdet_cuda PASSED [0.7838s] [ 23%] 2025-12-04T13:35:25.8925242Z test_ops.py::TestCommonCUDA::test_out_warning_lu_unpack_cuda PASSED [0.0795s] [ 23%] 2025-12-04T13:35:25.8925346Z test_ops.py::TestCommonCUDA::test_out_warning_masked_logsumexp_cuda PASSED [0.0024s] [ 23%] 2025-12-04T13:35:25.8925454Z test_ops.py::TestCommonCUDA::test_out_warning_masked_mean_cuda PASSED [0.7768s] [ 23%] 2025-12-04T13:35:25.8925549Z test_ops.py::TestCommonCUDA::test_out_warning_masked_norm_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8925673Z test_ops.py::TestCommonCUDA::test_out_warning_max_pool2d_with_indices_backward_cuda XFAIL [0.0033s] [ 23%] 2025-12-04T13:35:25.8925784Z test_ops.py::TestCommonCUDA::test_out_warning_max_reduction_with_dim_cuda PASSED [1.5360s] [ 23%] 2025-12-04T13:35:25.8925875Z test_ops.py::TestCommonCUDA::test_out_warning_maximum_cuda PASSED [0.0268s] [ 23%] 2025-12-04T13:35:25.8925965Z test_ops.py::TestCommonCUDA::test_out_warning_median_cuda PASSED [0.7602s] [ 23%] 2025-12-04T13:35:25.8926079Z test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_list_of_tensors_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8926191Z test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_with_dim_cuda PASSED [0.7843s] [ 23%] 2025-12-04T13:35:25.8926284Z test_ops.py::TestCommonCUDA::test_out_warning_minimum_cuda PASSED [0.0275s] [ 23%] 2025-12-04T13:35:25.8926373Z test_ops.py::TestCommonCUDA::test_out_warning_msort_cuda PASSED [0.7837s] [ 23%] 2025-12-04T13:35:25.8926477Z test_ops.py::TestCommonCUDA::test_out_warning_native_batch_norm_cuda PASSED [0.0312s] [ 23%] 2025-12-04T13:35:25.8926605Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool2d_cuda PASSED [0.7677s] [ 23%] 2025-12-04T13:35:25.8926730Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool3d_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8926844Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool1d_cuda PASSED [0.7602s] [ 23%] 2025-12-04T13:35:25.8926988Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_with_logits_cuda PASSED [0.0036s] [ 23%] 2025-12-04T13:35:25.8927098Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_celu_cuda PASSED [0.7709s] [ 23%] 2025-12-04T13:35:25.8927208Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv3d_cuda PASSED [0.0038s] [ 23%] 2025-12-04T13:35:25.8927336Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_embedding_loss_cuda PASSED [0.7817s] [ 23%] 2025-12-04T13:35:25.8927447Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout3d_cuda PASSED [0.0035s] [ 23%] 2025-12-04T13:35:25.8927576Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_fractional_max_pool2d_cuda PASSED [0.7643s] [ 23%] 2025-12-04T13:35:25.8927689Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardsigmoid_cuda PASSED [0.0128s] [ 23%] 2025-12-04T13:35:25.8927814Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bicubic_cuda PASSED [0.7735s] [ 23%] 2025-12-04T13:35:25.8927966Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_trilinear_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8928150Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_margin_ranking_loss_cuda PASSED [0.7585s] [ 23%] 2025-12-04T13:35:25.8928263Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool2d_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8928385Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda PASSED [0.7888s] [ 23%] 2025-12-04T13:35:25.8928504Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8928632Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_negative_cuda PASSED [0.7819s] [ 23%] 2025-12-04T13:35:25.8928756Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_poisson_nll_loss_cuda PASSED [0.0034s] [ 23%] 2025-12-04T13:35:25.8928864Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_prelu_cuda PASSED [0.7667s] [ 23%] 2025-12-04T13:35:25.8928974Z test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_relu6_cuda PASSED [0.0032s] [ 23%] 2025-12-04T13:35:25.8929086Z test_ops.py::TestCommonCUDA::test_out_warning_normal_cuda XFAIL [0.0180s] [ 23%] 2025-12-04T13:35:25.8929175Z test_ops.py::TestCommonCUDA::test_out_warning_ormqr_cuda PASSED [0.1510s] [ 23%] 2025-12-04T13:35:25.8929270Z test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda PASSED [0.7744s] [ 23%] 2025-12-04T13:35:25.8929360Z test_ops.py::TestCommonCUDA::test_out_warning_polar_cuda PASSED [0.0270s] [ 23%] 2025-12-04T13:35:25.8929491Z test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda SKIPPED [0.0002s] (Skipped!) [ 23%] 2025-12-04T13:35:25.8929585Z test_ops.py::TestCommonCUDA::test_out_warning_ravel_cuda PASSED [0.7833s] [ 23%] 2025-12-04T13:35:25.8929690Z test_ops.py::TestCommonCUDA::test_out_warning_repeat_interleave_cuda PASSED [0.0034s] [ 23%] 2025-12-04T13:35:25.8929791Z test_ops.py::TestCommonCUDA::test_out_warning_resize_as__cuda PASSED [0.7655s] [ 23%] 2025-12-04T13:35:25.8929890Z test_ops.py::TestCommonCUDA::test_out_warning_scalar_tensor_cuda PASSED [0.0031s] [ 24%] 2025-12-04T13:35:25.8929989Z test_ops.py::TestCommonCUDA::test_out_warning_scatter_add_cuda PASSED [0.7953s] [ 24%] 2025-12-04T13:35:25.8930094Z test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amin_cuda PASSED [0.0474s] [ 24%] 2025-12-04T13:35:25.8930201Z test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_prod_cuda PASSED [0.8063s] [ 24%] 2025-12-04T13:35:25.8930298Z test_ops.py::TestCommonCUDA::test_out_warning_searchsorted_cuda PASSED [0.3689s] [ 24%] 2025-12-04T13:35:25.8930412Z test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_bartlett_cuda PASSED [0.0021s] [ 24%] 2025-12-04T13:35:25.8930531Z test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_exponential_cuda PASSED [0.0020s] [ 24%] 2025-12-04T13:35:25.8930637Z test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hann_cuda PASSED [0.0020s] [ 24%] 2025-12-04T13:35:25.8930736Z test_ops.py::TestCommonCUDA::test_out_warning_slice_scatter_cuda PASSED [0.7948s] [ 24%] 2025-12-04T13:35:25.8930839Z test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y0_cuda PASSED [0.0135s] [ 24%] 2025-12-04T13:35:25.8930946Z test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y1_cuda PASSED [0.8000s] [ 24%] 2025-12-04T13:35:25.8931069Z test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_u_cuda PASSED [0.0264s] [ 24%] 2025-12-04T13:35:25.8931189Z test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_h_cuda PASSED [0.0171s] [ 24%] 2025-12-04T13:35:25.8931291Z test_ops.py::TestCommonCUDA::test_out_warning_special_log_ndtr_cuda PASSED [0.7962s] [ 24%] 2025-12-04T13:35:25.8931389Z test_ops.py::TestCommonCUDA::test_out_warning_special_ndtr_cuda PASSED [0.0137s] [ 24%] 2025-12-04T13:35:25.8931515Z test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k0_cuda PASSED [0.8093s] [ 24%] 2025-12-04T13:35:25.8931670Z test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k1_cuda PASSED [0.0155s] [ 24%] 2025-12-04T13:35:25.8931817Z test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_t_cuda PASSED [0.0184s] [ 24%] 2025-12-04T13:35:25.8931953Z test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_w_cuda PASSED [0.0150s] [ 24%] 2025-12-04T13:35:25.8932044Z test_ops.py::TestCommonCUDA::test_out_warning_split_cuda PASSED [0.0021s] [ 24%] 2025-12-04T13:35:25.8932156Z test_ops.py::TestCommonCUDA::test_out_warning_split_with_sizes_copy_cuda PASSED [0.8078s] [ 24%] 2025-12-04T13:35:25.8932247Z test_ops.py::TestCommonCUDA::test_out_warning_squeeze_cuda PASSED [0.0033s] [ 24%] 2025-12-04T13:35:25.8932341Z test_ops.py::TestCommonCUDA::test_out_warning_std_mean_cuda PASSED [0.7781s] [ 24%] 2025-12-04T13:35:25.8932430Z test_ops.py::TestCommonCUDA::test_out_warning_sub_cuda PASSED [0.0284s] [ 24%] 2025-12-04T13:35:25.8932519Z test_ops.py::TestCommonCUDA::test_out_warning_sum_cuda PASSED [0.8032s] [ 24%] 2025-12-04T13:35:25.8932607Z test_ops.py::TestCommonCUDA::test_out_warning_t_cuda PASSED [0.0032s] [ 24%] 2025-12-04T13:35:25.8932710Z test_ops.py::TestCommonCUDA::test_out_warning_take_cuda PASSED [0.8192s] [ 24%] 2025-12-04T13:35:25.8932795Z test_ops.py::TestCommonCUDA::test_out_warning_tan_cuda PASSED [0.0089s] [ 24%] 2025-12-04T13:35:25.8932882Z test_ops.py::TestCommonCUDA::test_out_warning_to_cuda PASSED [0.8116s] [ 24%] 2025-12-04T13:35:25.8932968Z test_ops.py::TestCommonCUDA::test_out_warning_topk_cuda PASSED [0.0353s] [ 24%] 2025-12-04T13:35:25.8933069Z test_ops.py::TestCommonCUDA::test_out_warning_transpose_copy_cuda PASSED [0.8143s] [ 24%] 2025-12-04T13:35:25.8933159Z test_ops.py::TestCommonCUDA::test_out_warning_trapz_cuda PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8933256Z test_ops.py::TestCommonCUDA::test_out_warning_unbind_copy_cuda PASSED [0.8112s] [ 24%] 2025-12-04T13:35:25.8933343Z test_ops.py::TestCommonCUDA::test_out_warning_vdot_cuda PASSED [0.0091s] [ 24%] 2025-12-04T13:35:25.8933435Z test_ops.py::TestCommonCUDA::test_out_warning_vstack_cuda PASSED [0.7940s] [ 24%] 2025-12-04T13:35:25.8933549Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int32 PASSED [0.0052s] [ 24%] 2025-12-04T13:35:25.8933659Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_uint8 PASSED [0.0040s] [ 24%] 2025-12-04T13:35:25.8933770Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int32 PASSED [0.7917s] [ 24%] 2025-12-04T13:35:25.8933878Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int16 PASSED [0.0032s] [ 24%] 2025-12-04T13:35:25.8933989Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int32 PASSED [0.7873s] [ 24%] 2025-12-04T13:35:25.8934094Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_uint8 PASSED [0.0033s] [ 24%] 2025-12-04T13:35:25.8934207Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_bool PASSED [0.0038s] [ 24%] 2025-12-04T13:35:25.8934319Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int32 PASSED [0.0036s] [ 24%] 2025-12-04T13:35:25.8934435Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_uint8 PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8934540Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int64 PASSED [0.8010s] [ 24%] 2025-12-04T13:35:25.8934648Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int8 PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8934756Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_uint8 PASSED [0.7993s] [ 24%] 2025-12-04T13:35:25.8934863Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_uint8 PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8934973Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int64 PASSED [0.8122s] [ 24%] 2025-12-04T13:35:25.8935079Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_bool PASSED [0.0032s] [ 24%] 2025-12-04T13:35:25.8935212Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int16 PASSED [0.7992s] [ 24%] 2025-12-04T13:35:25.8935329Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_bool PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8935434Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int16 PASSED [0.7825s] [ 24%] 2025-12-04T13:35:25.8935545Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int32 PASSED [0.0032s] [ 24%] 2025-12-04T13:35:25.8935651Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int8 PASSED [0.7798s] [ 24%] 2025-12-04T13:35:25.8935757Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int64 PASSED [0.0049s] [ 24%] 2025-12-04T13:35:25.8935870Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_bool PASSED [0.8027s] [ 24%] 2025-12-04T13:35:25.8935980Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int32 PASSED [0.0034s] [ 24%] 2025-12-04T13:35:25.8936088Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_bool PASSED [0.8044s] [ 24%] 2025-12-04T13:35:25.8936196Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int64 PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.8936316Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int64 PASSED [0.7947s] [ 24%] 2025-12-04T13:35:25.8936422Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int64 PASSED [0.0036s] [ 24%] 2025-12-04T13:35:25.8936530Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int8 PASSED [0.8153s] [ 24%] 2025-12-04T13:35:25.8936637Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_int16 PASSED [0.0037s] [ 24%] 2025-12-04T13:35:25.8936767Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int32 PASSED [0.8275s] [ 24%] 2025-12-04T13:35:25.8936893Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int8 PASSED [0.0048s] [ 24%] 2025-12-04T13:35:25.8937023Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int64 PASSED [0.8059s] [ 24%] 2025-12-04T13:35:25.8937149Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8 PASSED [0.0048s] [ 24%] 2025-12-04T13:35:25.8937284Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int16 PASSED [0.8071s] [ 24%] 2025-12-04T13:35:25.8937415Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int32 PASSED [0.0044s] [ 25%] 2025-12-04T13:35:25.8937563Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_bool SKIPPED [0.0002s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8937709Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8937859Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int32 SKIPPED [0.0001s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8938006Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_bool SKIPPED [0.0001s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8938195Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int64 SKIPPED [0.0001s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8938342Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int8 SKIPPED [0.0002s] (Skipped!) [ 25%] 2025-12-04T13:35:25.8938452Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int8 PASSED [0.8219s] [ 25%] 2025-12-04T13:35:25.8938564Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int64 PASSED [0.0037s] [ 25%] 2025-12-04T13:35:25.8938675Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int8 PASSED [0.8198s] [ 25%] 2025-12-04T13:35:25.8938782Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int16 PASSED [0.0036s] [ 25%] 2025-12-04T13:35:25.8938888Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int8 PASSED [0.8048s] [ 25%] 2025-12-04T13:35:25.8939027Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_uint8 PASSED [0.0033s] [ 25%] 2025-12-04T13:35:25.8939182Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_bool PASSED [0.0064s] [ 25%] 2025-12-04T13:35:25.8939323Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int64 PASSED [0.0055s] [ 25%] 2025-12-04T13:35:25.8939462Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_bool PASSED [0.0051s] [ 25%] 2025-12-04T13:35:25.8939605Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int16 PASSED [0.0033s] [ 25%] 2025-12-04T13:35:25.8939744Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int64 PASSED [0.0034s] [ 25%] 2025-12-04T13:35:25.8939887Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int8 PASSED [0.0032s] [ 25%] 2025-12-04T13:35:25.8940027Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_bool PASSED [0.0051s] [ 25%] 2025-12-04T13:35:25.8940182Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_h_cuda_uint8 PASSED [0.0049s] [ 25%] 2025-12-04T13:35:25.8940319Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int8 PASSED [0.0034s] [ 25%] 2025-12-04T13:35:25.8940458Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_uint8 PASSED [0.0048s] [ 25%] 2025-12-04T13:35:25.8940610Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int16 PASSED [0.0032s] [ 25%] 2025-12-04T13:35:25.8940761Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16 PASSED [0.0050s] [ 25%] 2025-12-04T13:35:25.8940913Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int8 PASSED [0.0032s] [ 25%] 2025-12-04T13:35:25.8941035Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int64 PASSED [0.0034s] [ 25%] 2025-12-04T13:35:25.8941160Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int8 PASSED [0.0032s] [ 25%] 2025-12-04T13:35:25.8941279Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_uint8 PASSED [0.0034s] [ 25%] 2025-12-04T13:35:25.8941396Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool PASSED [0.0032s] [ 25%] 2025-12-04T13:35:25.8941503Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int16 PASSED [0.8064s] [ 25%] 2025-12-04T13:35:25.8941610Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int16 PASSED [0.0035s] [ 25%] 2025-12-04T13:35:25.8941718Z test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int64 PASSED [0.0041s] [ 25%] 2025-12-04T13:35:25.8941826Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex32 PASSED [0.8340s] [ 25%] 2025-12-04T13:35:25.8941932Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex64 PASSED [0.0051s] [ 25%] 2025-12-04T13:35:25.8942033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float64 PASSED [0.8341s] [ 25%] 2025-12-04T13:35:25.8942129Z test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int16 PASSED [0.0045s] [ 25%] 2025-12-04T13:35:25.8942262Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex128 PASSED [0.0350s] [ 25%] 2025-12-04T13:35:25.8942387Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_float16 PASSED [0.0205s] [ 25%] 2025-12-04T13:35:25.8942511Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_uint8 PASSED [0.0177s] [ 25%] 2025-12-04T13:35:25.8942624Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bool PASSED [0.0129s] [ 25%] 2025-12-04T13:35:25.8942770Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex128 PASSED [0.0290s] [ 25%] 2025-12-04T13:35:25.8942903Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex64 PASSED [0.0291s] [ 25%] 2025-12-04T13:35:25.8943017Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int16 PASSED [0.0152s] [ 25%] 2025-12-04T13:35:25.8943137Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int16 PASSED [0.0202s] [ 25%] 2025-12-04T13:35:25.8943255Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int8 PASSED [0.8421s] [ 25%] 2025-12-04T13:35:25.8943379Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bfloat16 PASSED [0.0253s] [ 25%] 2025-12-04T13:35:25.8943498Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int16 PASSED [0.0200s] [ 25%] 2025-12-04T13:35:25.8943620Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bfloat16 PASSED [0.0218s] [ 25%] 2025-12-04T13:35:25.8943745Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex64 PASSED [0.8434s] [ 25%] 2025-12-04T13:35:25.8943875Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float64 PASSED [0.0249s] [ 25%] 2025-12-04T13:35:25.8943993Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int64 PASSED [0.0201s] [ 25%] 2025-12-04T13:35:25.8944109Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_uint8 PASSED [0.0183s] [ 25%] 2025-12-04T13:35:25.8944225Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bool PASSED [0.8222s] [ 25%] 2025-12-04T13:35:25.8944340Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int32 PASSED [0.0181s] [ 25%] 2025-12-04T13:35:25.8944456Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_uint8 PASSED [0.0151s] [ 25%] 2025-12-04T13:35:25.8944585Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex64 PASSED [0.0337s] [ 25%] 2025-12-04T13:35:25.8944701Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_bool PASSED [0.0219s] [ 25%] 2025-12-04T13:35:25.8944824Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex32 PASSED [0.0330s] [ 25%] 2025-12-04T13:35:25.8944938Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_uint8 PASSED [0.0176s] [ 25%] 2025-12-04T13:35:25.8945057Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bfloat16 PASSED [0.0218s] [ 25%] 2025-12-04T13:35:25.8945180Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex128 PASSED [0.0335s] [ 25%] 2025-12-04T13:35:25.8945298Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float64 PASSED [0.8287s] [ 25%] 2025-12-04T13:35:25.8945411Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int16 PASSED [0.0226s] [ 25%] 2025-12-04T13:35:25.8945529Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_uint8 PASSED [0.0182s] [ 25%] 2025-12-04T13:35:25.8945652Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex128 PASSED [0.0295s] [ 25%] 2025-12-04T13:35:25.8945772Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_complex64 PASSED [0.0289s] [ 25%] 2025-12-04T13:35:25.8945883Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8 PASSED [0.0145s] [ 25%] 2025-12-04T13:35:25.8945997Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bool PASSED [0.8469s] [ 25%] 2025-12-04T13:35:25.8946111Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int32 PASSED [0.0179s] [ 26%] 2025-12-04T13:35:25.8946226Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_uint8 PASSED [0.0150s] [ 26%] 2025-12-04T13:35:25.8946362Z test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int8 PASSED [0.0148s] [ 26%] 2025-12-04T13:35:25.8946473Z test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int8 PASSED [0.0111s] [ 26%] 2025-12-04T13:35:25.8946571Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int8 PASSED [0.0502s] [ 26%] 2025-12-04T13:35:25.8946670Z test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_uint8 PASSED [0.0509s] [ 26%] 2025-12-04T13:35:25.8946782Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex128 PASSED [1.1578s] [ 26%] 2025-12-04T13:35:25.8946889Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float32 PASSED [0.0553s] [ 26%] 2025-12-04T13:35:25.8946996Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_float16 PASSED [0.3385s] [ 26%] 2025-12-04T13:35:25.8947100Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int16 PASSED [0.9033s] [ 26%] 2025-12-04T13:35:25.8947204Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int32 PASSED [0.0560s] [ 26%] 2025-12-04T13:35:25.8947306Z test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float32 XFAIL [0.0068s] [ 26%] 2025-12-04T13:35:25.8947431Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bool PASSED [1.7034s] [ 26%] 2025-12-04T13:35:25.8947543Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex64 PASSED [0.0054s] [ 26%] 2025-12-04T13:35:25.8947654Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float16 PASSED [0.8606s] [ 26%] 2025-12-04T13:35:25.8947762Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float32 PASSED [0.0052s] [ 26%] 2025-12-04T13:35:25.8947870Z test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int32 PASSED [0.8572s] [ 26%] 2025-12-04T13:35:25.8947967Z test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bool PASSED [0.0190s] [ 26%] 2025-12-04T13:35:25.8948108Z test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex128 PASSED [0.0168s] [ 26%] 2025-12-04T13:35:25.8948213Z test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_complex64 PASSED [0.0163s] [ 26%] 2025-12-04T13:35:25.8948315Z test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int32 PASSED [0.0162s] [ 26%] 2025-12-04T13:35:25.8948412Z test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int8 PASSED [0.0162s] [ 26%] 2025-12-04T13:35:25.8948520Z test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float32 PASSED [0.0387s] [ 26%] 2025-12-04T13:35:25.8948627Z test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float64 PASSED [0.0379s] [ 26%] 2025-12-04T13:35:25.8948727Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bool PASSED [0.8625s] [ 26%] 2025-12-04T13:35:25.8948830Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_bfloat16 PASSED [0.0168s] [ 26%] 2025-12-04T13:35:25.8948934Z test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_float32 PASSED [0.0105s] [ 26%] 2025-12-04T13:35:25.8949032Z test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int8 PASSED [0.0143s] [ 26%] 2025-12-04T13:35:25.8949145Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_uint8 XFAIL [0.0028s] [ 26%] 2025-12-04T13:35:25.8949258Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bfloat16 PASSED [0.8171s] [ 26%] 2025-12-04T13:35:25.8949370Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_complex64 PASSED [0.0048s] [ 26%] 2025-12-04T13:35:25.8949480Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float32 PASSED [0.8145s] [ 26%] 2025-12-04T13:35:25.8949587Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float64 PASSED [0.0063s] [ 26%] 2025-12-04T13:35:25.8949693Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int32 PASSED [0.0040s] [ 26%] 2025-12-04T13:35:25.8949818Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bool PASSED [0.7837s] [ 26%] 2025-12-04T13:35:25.8949983Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex128 PASSED [0.0058s] [ 26%] 2025-12-04T13:35:25.8950124Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float16 PASSED [0.7881s] [ 26%] 2025-12-04T13:35:25.8950247Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_int8 PASSED [0.0048s] [ 26%] 2025-12-04T13:35:25.8950368Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex128 PASSED [0.8057s] [ 26%] 2025-12-04T13:35:25.8950490Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex32 PASSED [0.0075s] [ 26%] 2025-12-04T13:35:25.8950610Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex64 PASSED [0.8145s] [ 26%] 2025-12-04T13:35:25.8950725Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int8 PASSED [0.0061s] [ 26%] 2025-12-04T13:35:25.8950840Z test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_uint8 PASSED [0.8130s] [ 26%] 2025-12-04T13:35:25.8950946Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bfloat16 PASSED [0.0256s] [ 26%] 2025-12-04T13:35:25.8951068Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex32 PASSED [0.0360s] [ 26%] 2025-12-04T13:35:25.8951170Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float64 PASSED [0.8179s] [ 26%] 2025-12-04T13:35:25.8951270Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int16 PASSED [0.0210s] [ 26%] 2025-12-04T13:35:25.8951377Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex128 PASSED [0.0298s] [ 26%] 2025-12-04T13:35:25.8951480Z test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float32 PASSED [0.8441s] [ 26%] 2025-12-04T13:35:25.8951584Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float16 PASSED [0.0932s] [ 26%] 2025-12-04T13:35:25.8951687Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float32 PASSED [0.0592s] [ 26%] 2025-12-04T13:35:25.8951790Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int16 PASSED [0.0776s] [ 26%] 2025-12-04T13:35:25.8951892Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int32 PASSED [0.0775s] [ 26%] 2025-12-04T13:35:25.8951995Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float32 PASSED [0.8414s] [ 26%] 2025-12-04T13:35:25.8952095Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int64 PASSED [0.0206s] [ 26%] 2025-12-04T13:35:25.8952197Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float16 PASSED [0.0224s] [ 26%] 2025-12-04T13:35:25.8952297Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int8 PASSED [0.8330s] [ 26%] 2025-12-04T13:35:25.8952407Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float32 PASSED [0.0066s] [ 26%] 2025-12-04T13:35:25.8952515Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int32 PASSED [0.0043s] [ 26%] 2025-12-04T13:35:25.8952626Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bfloat16 PASSED [0.0054s] [ 26%] 2025-12-04T13:35:25.8952740Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex32 PASSED [0.0055s] [ 26%] 2025-12-04T13:35:25.8952848Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int16 PASSED [0.0044s] [ 26%] 2025-12-04T13:35:25.8952954Z test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int8 PASSED [0.0044s] [ 26%] 2025-12-04T13:35:25.8953061Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int32 PASSED [0.0486s] [ 26%] 2025-12-04T13:35:25.8953171Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int64 PASSED [0.0484s] [ 26%] 2025-12-04T13:35:25.8953276Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int8 PASSED [0.0476s] [ 26%] 2025-12-04T13:35:25.8953381Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_bool PASSED [0.0146s] [ 26%] 2025-12-04T13:35:25.8953507Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int32 PASSED [0.0480s] [ 26%] 2025-12-04T13:35:25.8953623Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_uint8 PASSED [0.0470s] [ 26%] 2025-12-04T13:35:25.8953743Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_uint8 PASSED [0.0473s] [ 27%] 2025-12-04T13:35:25.8953852Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int32 PASSED [0.0476s] [ 27%] 2025-12-04T13:35:25.8953958Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int64 PASSED [0.0481s] [ 27%] 2025-12-04T13:35:25.8954069Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float16 PASSED [0.0090s] [ 27%] 2025-12-04T13:35:25.8954179Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float32 PASSED [0.0085s] [ 27%] 2025-12-04T13:35:25.8954286Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float64 PASSED [0.0085s] [ 27%] 2025-12-04T13:35:25.8954395Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int64 PASSED [0.0080s] [ 27%] 2025-12-04T13:35:25.8954502Z test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_uint8 PASSED [0.0079s] [ 27%] 2025-12-04T13:35:25.8954636Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bfloat16 PASSED [0.0075s] [ 27%] 2025-12-04T13:35:25.8954749Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bool PASSED [0.0059s] [ 27%] 2025-12-04T13:35:25.8954874Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex128 PASSED [0.0078s] [ 27%] 2025-12-04T13:35:25.8954997Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex64 PASSED [0.0077s] [ 27%] 2025-12-04T13:35:25.8955115Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float16 PASSED [0.0074s] [ 27%] 2025-12-04T13:35:25.8955228Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_bfloat16 PASSED [0.0048s] [ 27%] 2025-12-04T13:35:25.8955342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16 PASSED [0.0048s] [ 27%] 2025-12-04T13:35:25.8955452Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int32 PASSED [0.8174s] [ 27%] 2025-12-04T13:35:25.8955562Z test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_uint8 PASSED [0.8119s] [ 27%] 2025-12-04T13:35:25.8955668Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int16 PASSED [0.3338s] [ 27%] 2025-12-04T13:35:25.8955775Z test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int64 PASSED [0.3305s] [ 27%] 2025-12-04T13:35:25.8955880Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex128 PASSED [0.0087s] [ 27%] 2025-12-04T13:35:25.8955986Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex32 PASSED [0.0086s] [ 27%] 2025-12-04T13:35:25.8956087Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float64 PASSED [0.0080s] [ 27%] 2025-12-04T13:35:25.8956186Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int8 PASSED [0.0069s] [ 27%] 2025-12-04T13:35:25.8956368Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float32 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 27%] 2025-12-04T13:35:25.8956474Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float16 PASSED [0.0249s] [ 27%] 2025-12-04T13:35:25.8956578Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float32 PASSED [0.8202s] [ 27%] 2025-12-04T13:35:25.8956677Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int32 PASSED [0.0134s] [ 27%] 2025-12-04T13:35:25.8956778Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int64 PASSED [0.0118s] [ 27%] 2025-12-04T13:35:25.8956884Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex64 PASSED [0.0212s] [ 27%] 2025-12-04T13:35:25.8956988Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float16 PASSED [0.0197s] [ 27%] 2025-12-04T13:35:25.8957112Z test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float64 PASSED [0.0196s] [ 27%] 2025-12-04T13:35:25.8957225Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int16 PASSED [0.0350s] [ 27%] 2025-12-04T13:35:25.8957326Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64 PASSED [0.0343s] [ 27%] 2025-12-04T13:35:25.8957434Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int32 PASSED [0.0790s] [ 27%] 2025-12-04T13:35:25.8957538Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_int8 PASSED [0.0770s] [ 27%] 2025-12-04T13:35:25.8957647Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bfloat16 PASSED [0.1220s] [ 27%] 2025-12-04T13:35:25.8957753Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float32 PASSED [0.9213s] [ 27%] 2025-12-04T13:35:25.8957861Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float64 PASSED [0.0943s] [ 27%] 2025-12-04T13:35:25.8957966Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int16 PASSED [0.0784s] [ 27%] 2025-12-04T13:35:25.8958110Z test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int8 PASSED [0.0770s] [ 27%] 2025-12-04T13:35:25.8958233Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bool PASSED [0.8102s] [ 27%] 2025-12-04T13:35:25.8958348Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64 PASSED [0.0068s] [ 27%] 2025-12-04T13:35:25.8958456Z test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int32 PASSED [0.0043s] [ 27%] 2025-12-04T13:35:25.8958556Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bool PASSED [0.8215s] [ 27%] 2025-12-04T13:35:25.8958658Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float16 PASSED [0.0165s] [ 27%] 2025-12-04T13:35:25.8958761Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float32 PASSED [0.8061s] [ 27%] 2025-12-04T13:35:25.8958867Z test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int64 PASSED [0.0119s] [ 27%] 2025-12-04T13:35:25.8958983Z test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int16 PASSED [0.0355s] [ 27%] 2025-12-04T13:35:25.8959097Z test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int32 PASSED [0.0346s] [ 27%] 2025-12-04T13:35:25.8959202Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bool PASSED [0.8164s] [ 27%] 2025-12-04T13:35:25.8959316Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex128 PASSED [0.0261s] [ 27%] 2025-12-04T13:35:25.8959429Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex64 PASSED [0.8140s] [ 27%] 2025-12-04T13:35:25.8959539Z test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float32 PASSED [0.0255s] [ 27%] 2025-12-04T13:35:25.8959644Z test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int64 PASSED [0.1112s] [ 27%] 2025-12-04T13:35:25.8959751Z test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_uint8 PASSED [0.1081s] [ 27%] 2025-12-04T13:35:25.8959853Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_bfloat16 PASSED [0.0223s] [ 27%] 2025-12-04T13:35:25.8959957Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float32 PASSED [0.0160s] [ 27%] 2025-12-04T13:35:25.8960058Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float64 PASSED [0.8109s] [ 27%] 2025-12-04T13:35:25.8960159Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int16 PASSED [0.0218s] [ 27%] 2025-12-04T13:35:25.8960257Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int8 PASSED [0.0177s] [ 27%] 2025-12-04T13:35:25.8960375Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_complex128 PASSED [0.0117s] [ 27%] 2025-12-04T13:35:25.8960485Z test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int8 PASSED [0.8235s] [ 27%] 2025-12-04T13:35:25.8960592Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_float32 PASSED [0.0154s] [ 27%] 2025-12-04T13:35:25.8960721Z test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_uint8 PASSED [0.0136s] [ 27%] 2025-12-04T13:35:25.8960847Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bfloat16 PASSED [0.0249s] [ 27%] 2025-12-04T13:35:25.8960952Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float16 PASSED [0.0243s] [ 27%] 2025-12-04T13:35:25.8961055Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int16 PASSED [0.8313s] [ 27%] 2025-12-04T13:35:25.8961156Z test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int8 PASSED [0.0206s] [ 27%] 2025-12-04T13:35:25.8961257Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_uint8 PASSED [0.0089s] [ 27%] 2025-12-04T13:35:25.8961365Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float64 PASSED [0.8316s] [ 27%] 2025-12-04T13:35:25.8961472Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int64 PASSED [0.0375s] [ 28%] 2025-12-04T13:35:25.8961578Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int8 PASSED [0.0347s] [ 28%] 2025-12-04T13:35:25.8961704Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bfloat16 PASSED [0.0130s] [ 28%] 2025-12-04T13:35:25.8961818Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float16 PASSED [0.8098s] [ 28%] 2025-12-04T13:35:25.8961930Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float64 PASSED [0.0156s] [ 28%] 2025-12-04T13:35:25.8962037Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float16 PASSED [0.0117s] [ 28%] 2025-12-04T13:35:25.8962143Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float64 PASSED [0.0113s] [ 28%] 2025-12-04T13:35:25.8962256Z test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_uint8 PASSED [0.8127s] [ 28%] 2025-12-04T13:35:25.8962361Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bfloat16 PASSED [0.4266s] [ 28%] 2025-12-04T13:35:25.8962468Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16 PASSED [0.4121s] [ 28%] 2025-12-04T13:35:25.8962576Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float64 PASSED [0.3389s] [ 28%] 2025-12-04T13:35:25.8962682Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int16 PASSED [0.2335s] [ 28%] 2025-12-04T13:35:25.8962784Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int64 PASSED [0.8193s] [ 28%] 2025-12-04T13:35:25.8962887Z test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_uint8 PASSED [0.0205s] [ 28%] 2025-12-04T13:35:25.8963005Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_float64 PASSED [0.3153s] [ 28%] 2025-12-04T13:35:25.8963121Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int16 PASSED [0.1436s] [ 28%] 2025-12-04T13:35:25.8963239Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bool PASSED [0.0847s] [ 28%] 2025-12-04T13:35:25.8963360Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_int16 PASSED [0.0880s] [ 28%] 2025-12-04T13:35:25.8963483Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_bfloat16 PASSED [0.1172s] [ 28%] 2025-12-04T13:35:25.8963601Z test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64 PASSED [0.8148s] [ 28%] 2025-12-04T13:35:25.8963706Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float64 PASSED [0.0057s] [ 28%] 2025-12-04T13:35:25.8963809Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int64 PASSED [0.0037s] [ 28%] 2025-12-04T13:35:25.8963911Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int64 PASSED [0.0041s] [ 28%] 2025-12-04T13:35:25.8964013Z test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int8 PASSED [0.7302s] [ 28%] 2025-12-04T13:35:25.8964172Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex128 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8964344Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8964507Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8964665Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bfloat16 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8964824Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8964974Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 28%] 2025-12-04T13:35:25.8965150Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 28%] 2025-12-04T13:35:25.8965318Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int32 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 28%] 2025-12-04T13:35:25.8965495Z test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 28%] 2025-12-04T13:35:25.8965596Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32 PASSED [0.7881s] [ 28%] 2025-12-04T13:35:25.8965695Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int32 PASSED [0.0511s] [ 28%] 2025-12-04T13:35:25.8965795Z test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_bool PASSED [0.7396s] [ 28%] 2025-12-04T13:35:25.8965897Z test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int16 PASSED [0.0078s] [ 28%] 2025-12-04T13:35:25.8965997Z test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int64 PASSED [0.0061s] [ 28%] 2025-12-04T13:35:25.8966096Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_bool PASSED [0.0211s] [ 28%] 2025-12-04T13:35:25.8966205Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float32 PASSED [0.7468s] [ 28%] 2025-12-04T13:35:25.8966309Z test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int16 PASSED [0.2419s] [ 28%] 2025-12-04T13:35:25.8966413Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float32 PASSED [0.2688s] [ 28%] 2025-12-04T13:35:25.8966514Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float64 PASSED [1.0437s] [ 28%] 2025-12-04T13:35:25.8966616Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bfloat16 PASSED [0.0265s] [ 28%] 2025-12-04T13:35:25.8966720Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex32 PASSED [0.6086s] [ 28%] 2025-12-04T13:35:25.8966819Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int64 PASSED [0.0191s] [ 28%] 2025-12-04T13:35:25.8966931Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bfloat16 PASSED [0.0065s] [ 28%] 2025-12-04T13:35:25.8967040Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bool PASSED [0.0055s] [ 28%] 2025-12-04T13:35:25.8967154Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_float32 PASSED [0.0064s] [ 28%] 2025-12-04T13:35:25.8967266Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_uint8 PASSED [0.0054s] [ 28%] 2025-12-04T13:35:25.8967374Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex128 PASSED [0.7527s] [ 28%] 2025-12-04T13:35:25.8967482Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float16 PASSED [0.0079s] [ 28%] 2025-12-04T13:35:25.8967583Z test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int64 PASSED [0.0052s] [ 28%] 2025-12-04T13:35:25.8967773Z test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float16 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 28%] 2025-12-04T13:35:25.8967880Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_float8_e4m3fn PASSED [0.0719s] [ 28%] 2025-12-04T13:35:25.8967999Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int32 PASSED [0.0680s] [ 28%] 2025-12-04T13:35:25.8968142Z test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_uint8 PASSED [0.0677s] [ 28%] 2025-12-04T13:35:25.8968248Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float32 PASSED [0.0081s] [ 28%] 2025-12-04T13:35:25.8968353Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float64 PASSED [0.0078s] [ 28%] 2025-12-04T13:35:25.8968457Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int16 PASSED [0.0076s] [ 28%] 2025-12-04T13:35:25.8968563Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_uint8 PASSED [0.7606s] [ 28%] 2025-12-04T13:35:25.8968665Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_bool PASSED [0.0101s] [ 28%] 2025-12-04T13:35:25.8968775Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_complex128 PASSED [0.7411s] [ 28%] 2025-12-04T13:35:25.8968883Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float16 PASSED [0.0095s] [ 28%] 2025-12-04T13:35:25.8969007Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float32 PASSED [0.0071s] [ 28%] 2025-12-04T13:35:25.8969118Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_complex128 PASSED [0.0079s] [ 28%] 2025-12-04T13:35:25.8969223Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int32 PASSED [0.0090s] [ 28%] 2025-12-04T13:35:25.8969334Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float64 PASSED [0.0124s] [ 28%] 2025-12-04T13:35:25.8969444Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int32 PASSED [0.7498s] [ 28%] 2025-12-04T13:35:25.8969552Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_uint8 PASSED [0.0076s] [ 29%] 2025-12-04T13:35:25.8969664Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32 PASSED [0.0268s] [ 29%] 2025-12-04T13:35:25.8969769Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int32 PASSED [0.0078s] [ 29%] 2025-12-04T13:35:25.8969877Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float16 PASSED [0.0080s] [ 29%] 2025-12-04T13:35:25.8969980Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int16 PASSED [0.7482s] [ 29%] 2025-12-04T13:35:25.8970084Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int64 PASSED [0.0106s] [ 29%] 2025-12-04T13:35:25.8970189Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_uint8 PASSED [0.0079s] [ 29%] 2025-12-04T13:35:25.8970296Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float64 PASSED [1.2826s] [ 29%] 2025-12-04T13:35:25.8970406Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex128 PASSED [0.0216s] [ 29%] 2025-12-04T13:35:25.8970514Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int32 PASSED [0.0146s] [ 29%] 2025-12-04T13:35:25.8970626Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex128 PASSED [0.0069s] [ 29%] 2025-12-04T13:35:25.8970737Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float64 PASSED [0.0079s] [ 29%] 2025-12-04T13:35:25.8970842Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_bool PASSED [0.0094s] [ 29%] 2025-12-04T13:35:25.8970948Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_uint8 PASSED [0.8110s] [ 29%] 2025-12-04T13:35:25.8971057Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int16 PASSED [0.0075s] [ 29%] 2025-12-04T13:35:25.8971167Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int64 PASSED [0.0057s] [ 29%] 2025-12-04T13:35:25.8971276Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int8 PASSED [0.8090s] [ 29%] 2025-12-04T13:35:25.8971381Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_bool PASSED [0.0121s] [ 29%] 2025-12-04T13:35:25.8971518Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float32 PASSED [0.0082s] [ 29%] 2025-12-04T13:35:25.8971637Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_bool PASSED [0.0089s] [ 29%] 2025-12-04T13:35:25.8971747Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float32 PASSED [0.0098s] [ 29%] 2025-12-04T13:35:25.8971851Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int32 PASSED [0.0097s] [ 29%] 2025-12-04T13:35:25.8971988Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float16 SKIPPED [0.0010s] (Skipped on ROCm) [ 29%] 2025-12-04T13:35:25.8972097Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float64 PASSED [0.0094s] [ 29%] 2025-12-04T13:35:25.8972205Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float32 PASSED [0.0073s] [ 29%] 2025-12-04T13:35:25.8972308Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int32 PASSED [0.0072s] [ 29%] 2025-12-04T13:35:25.8972416Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int8 PASSED [0.0071s] [ 29%] 2025-12-04T13:35:25.8972535Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_bool PASSED [0.0075s] [ 29%] 2025-12-04T13:35:25.8972643Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_float64 PASSED [0.1515s] [ 29%] 2025-12-04T13:35:25.8972749Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int64 PASSED [0.0073s] [ 29%] 2025-12-04T13:35:25.8972862Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex64 PASSED [0.8085s] [ 29%] 2025-12-04T13:35:25.8972968Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float32 PASSED [0.0116s] [ 29%] 2025-12-04T13:35:25.8973074Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int32 PASSED [0.8157s] [ 29%] 2025-12-04T13:35:25.8973177Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int8 PASSED [0.0126s] [ 29%] 2025-12-04T13:35:25.8973286Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float32 PASSED [0.8082s] [ 29%] 2025-12-04T13:35:25.8973391Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int32 PASSED [0.0126s] [ 29%] 2025-12-04T13:35:25.8973499Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float16 PASSED [0.0074s] [ 29%] 2025-12-04T13:35:25.8973603Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int64 PASSED [0.0080s] [ 29%] 2025-12-04T13:35:25.8973709Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_uint8 PASSED [0.0077s] [ 29%] 2025-12-04T13:35:25.8973816Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float16 PASSED [0.0071s] [ 29%] 2025-12-04T13:35:25.8973919Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int64 PASSED [0.0089s] [ 29%] 2025-12-04T13:35:25.8974023Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bfloat16 PASSED [0.0176s] [ 29%] 2025-12-04T13:35:25.8974125Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bool PASSED [0.0146s] [ 29%] 2025-12-04T13:35:25.8974232Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex128 PASSED [0.0304s] [ 29%] 2025-12-04T13:35:25.8974335Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_float32 PASSED [0.8297s] [ 29%] 2025-12-04T13:35:25.8974435Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int64 PASSED [0.0151s] [ 29%] 2025-12-04T13:35:25.8974535Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_uint8 PASSED [0.0124s] [ 29%] 2025-12-04T13:35:25.8974640Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float64 PASSED [0.0192s] [ 29%] 2025-12-04T13:35:25.8974743Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int64 PASSED [0.0149s] [ 29%] 2025-12-04T13:35:25.8974849Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex64 PASSED [0.0054s] [ 29%] 2025-12-04T13:35:25.8974951Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_float16 PASSED [0.0052s] [ 29%] 2025-12-04T13:35:25.8975079Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int32 PASSED [0.0057s] [ 29%] 2025-12-04T13:35:25.8975193Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bfloat16 PASSED [0.0031s] [ 29%] 2025-12-04T13:35:25.8975294Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_bool PASSED [0.0028s] [ 29%] 2025-12-04T13:35:25.8975402Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex128 PASSED [0.0029s] [ 29%] 2025-12-04T13:35:25.8975509Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64 PASSED [0.0029s] [ 29%] 2025-12-04T13:35:25.8975613Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float16 PASSED [0.0027s] [ 29%] 2025-12-04T13:35:25.8975715Z test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_int32 PASSED [0.0027s] [ 29%] 2025-12-04T13:35:25.8975827Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_bfloat16 PASSED [0.0891s] [ 29%] 2025-12-04T13:35:25.8975947Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_complex128 PASSED [0.0800s] [ 29%] 2025-12-04T13:35:25.8976068Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float16 PASSED [0.0891s] [ 29%] 2025-12-04T13:35:25.8976176Z test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int8 PASSED [0.0854s] [ 29%] 2025-12-04T13:35:25.8976281Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float16 PASSED [0.0224s] [ 29%] 2025-12-04T13:35:25.8976386Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float64 PASSED [0.8124s] [ 29%] 2025-12-04T13:35:25.8976489Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int16 PASSED [0.0142s] [ 29%] 2025-12-04T13:35:25.8976589Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int32 PASSED [0.0117s] [ 29%] 2025-12-04T13:35:25.8976690Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int64 PASSED [0.0116s] [ 29%] 2025-12-04T13:35:25.8976791Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_uint8 PASSED [0.8299s] [ 29%] 2025-12-04T13:35:25.8976923Z test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16 SKIPPED [0.0002s] (Skipped!) [ 29%] 2025-12-04T13:35:25.8977024Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32 PASSED [0.8737s] [ 30%] 2025-12-04T13:35:25.8977125Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int64 PASSED [0.0476s] [ 30%] 2025-12-04T13:35:25.8977225Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float64 PASSED [0.0598s] [ 30%] 2025-12-04T13:35:25.8977331Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_bfloat16 PASSED [0.0973s] [ 30%] 2025-12-04T13:35:25.8977429Z test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int64 PASSED [0.0535s] [ 30%] 2025-12-04T13:35:25.8977533Z test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_bfloat16 PASSED [0.0337s] [ 30%] 2025-12-04T13:35:25.8977637Z test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_float16 PASSED [0.8629s] [ 30%] 2025-12-04T13:35:25.8977740Z test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float32 PASSED [0.0521s] [ 30%] 2025-12-04T13:35:25.8977849Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex32 PASSED [0.8350s] [ 30%] 2025-12-04T13:35:25.8977951Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int8 PASSED [0.0050s] [ 30%] 2025-12-04T13:35:25.8978057Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex64 PASSED [0.0044s] [ 30%] 2025-12-04T13:35:25.8978213Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float16 PASSED [0.8274s] [ 30%] 2025-12-04T13:35:25.8978316Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float32 PASSED [0.0059s] [ 30%] 2025-12-04T13:35:25.8978420Z test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float16 PASSED [0.0898s] [ 30%] 2025-12-04T13:35:25.8978518Z test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float32 PASSED [0.9862s] [ 30%] 2025-12-04T13:35:25.8978643Z test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int16 PASSED [0.2272s] [ 30%] 2025-12-04T13:35:25.8978765Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bfloat16 XFAIL [0.0029s] [ 30%] 2025-12-04T13:35:25.8978869Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bool XFAIL [0.0027s] [ 30%] 2025-12-04T13:35:25.8978979Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bfloat16 XFAIL [0.8491s] [ 30%] 2025-12-04T13:35:25.8979082Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bool XFAIL [0.8430s] [ 30%] 2025-12-04T13:35:25.8979189Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float16 XFAIL [1.0452s] [ 30%] 2025-12-04T13:35:25.8979293Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int16 XFAIL [1.3533s] [ 30%] 2025-12-04T13:35:25.8979405Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex128 XFAIL [1.3776s] [ 30%] 2025-12-04T13:35:25.8979516Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex64 XFAIL [1.3527s] [ 30%] 2025-12-04T13:35:25.8979638Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float16 XFAIL [1.3415s] [ 30%] 2025-12-04T13:35:25.8979742Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_uint8 XFAIL [1.3932s] [ 30%] 2025-12-04T13:35:25.8979856Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex32 XFAIL [1.4125s] [ 30%] 2025-12-04T13:35:25.8979970Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex64 XFAIL [1.3931s] [ 30%] 2025-12-04T13:35:25.8980080Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float16 XFAIL [1.3845s] [ 30%] 2025-12-04T13:35:25.8980187Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int32 XFAIL [1.3930s] [ 30%] 2025-12-04T13:35:25.8980293Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int8 XFAIL [1.4112s] [ 30%] 2025-12-04T13:35:25.8980400Z test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_uint8 XFAIL [1.4423s] [ 30%] 2025-12-04T13:35:25.8980508Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bfloat16 PASSED [1.6277s] [ 30%] 2025-12-04T13:35:25.8980612Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_bool PASSED [0.0208s] [ 30%] 2025-12-04T13:35:25.8980717Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float16 PASSED [0.0196s] [ 30%] 2025-12-04T13:35:25.8980822Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float32 PASSED [1.3201s] [ 30%] 2025-12-04T13:35:25.8980929Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float64 PASSED [0.0182s] [ 30%] 2025-12-04T13:35:25.8981033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_bfloat16 PASSED [0.0225s] [ 30%] 2025-12-04T13:35:25.8981138Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_bfloat16 PASSED [0.0170s] [ 30%] 2025-12-04T13:35:25.8981245Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex64 PASSED [1.3333s] [ 30%] 2025-12-04T13:35:25.8981348Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int16 PASSED [0.0141s] [ 30%] 2025-12-04T13:35:25.8981456Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_float64 PASSED [0.0160s] [ 30%] 2025-12-04T13:35:25.8981562Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float16 PASSED [0.0197s] [ 30%] 2025-12-04T13:35:25.8981669Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float64 PASSED [0.0153s] [ 30%] 2025-12-04T13:35:25.8981773Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int64 PASSED [0.0147s] [ 30%] 2025-12-04T13:35:25.8981879Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bfloat16 PASSED [0.0215s] [ 30%] 2025-12-04T13:35:25.8981983Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float64 PASSED [0.0175s] [ 30%] 2025-12-04T13:35:25.8982104Z test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int8 PASSED [0.0150s] [ 30%] 2025-12-04T13:35:25.8982218Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16 PASSED [1.3491s] [ 30%] 2025-12-04T13:35:25.8982318Z test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_uint8 PASSED [0.0065s] [ 30%] 2025-12-04T13:35:25.8982416Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_uint8 PASSED [0.3849s] [ 30%] 2025-12-04T13:35:25.8982520Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_bfloat16 PASSED [0.0327s] [ 30%] 2025-12-04T13:35:25.8982620Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int32 PASSED [0.2545s] [ 30%] 2025-12-04T13:35:25.8982721Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int64 PASSED [0.0204s] [ 30%] 2025-12-04T13:35:25.8982824Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_uint8 PASSED [1.3873s] [ 30%] 2025-12-04T13:35:25.8982940Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_complex64 PASSED [0.0101s] [ 30%] 2025-12-04T13:35:25.8983063Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128 PASSED [0.0088s] [ 30%] 2025-12-04T13:35:25.8983197Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_bfloat16 PASSED [0.0527s] [ 30%] 2025-12-04T13:35:25.8983308Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32 PASSED [0.0860s] [ 30%] 2025-12-04T13:35:25.8983419Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float64 PASSED [0.0877s] [ 30%] 2025-12-04T13:35:25.8983530Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex128 PASSED [0.1780s] [ 30%] 2025-12-04T13:35:25.8983640Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float32 PASSED [1.5037s] [ 30%] 2025-12-04T13:35:25.8983759Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex128 PASSED [0.0234s] [ 30%] 2025-12-04T13:35:25.8983876Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_float32 PASSED [0.0204s] [ 30%] 2025-12-04T13:35:25.8983994Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex64 PASSED [0.0327s] [ 30%] 2025-12-04T13:35:25.8984106Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float64 PASSED [0.0265s] [ 30%] 2025-12-04T13:35:25.8984216Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_bfloat16 PASSED [0.0470s] [ 30%] 2025-12-04T13:35:25.8984321Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16 PASSED [1.3637s] [ 30%] 2025-12-04T13:35:25.8984428Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float32 PASSED [0.0457s] [ 30%] 2025-12-04T13:35:25.8984529Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int8 XFAIL [0.0032s] [ 31%] 2025-12-04T13:35:25.8984656Z test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float16 XFAIL [1.3095s] [ 31%] 2025-12-04T13:35:25.8984757Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int8 PASSED [1.3345s] [ 31%] 2025-12-04T13:35:25.8984863Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_float64 PASSED [1.3286s] [ 31%] 2025-12-04T13:35:25.8984966Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int32 PASSED [0.0237s] [ 31%] 2025-12-04T13:35:25.8985071Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_complex64 PASSED [0.3275s] [ 31%] 2025-12-04T13:35:25.8985172Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float64 PASSED [1.3568s] [ 31%] 2025-12-04T13:35:25.8985276Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex32 PASSED [0.5054s] [ 31%] 2025-12-04T13:35:25.8985375Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int32 PASSED [0.0198s] [ 31%] 2025-12-04T13:35:25.8985562Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_bfloat16 SKIPPED [0.0002s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 31%] 2025-12-04T13:35:25.8985772Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float64 SKIPPED [0.0001s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 31%] 2025-12-04T13:35:25.8985908Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float16 PASSED [1.3344s] [ 31%] 2025-12-04T13:35:25.8986030Z test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_uint8 PASSED [1.3277s] [ 31%] 2025-12-04T13:35:25.8986139Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float32 PASSED [0.1735s] [ 31%] 2025-12-04T13:35:25.8986245Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int8 PASSED [0.0672s] [ 31%] 2025-12-04T13:35:25.8986355Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float16 PASSED [0.0217s] [ 31%] 2025-12-04T13:35:25.8986465Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float32 PASSED [0.0183s] [ 31%] 2025-12-04T13:35:25.8986575Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float64 PASSED [0.0172s] [ 31%] 2025-12-04T13:35:25.8986686Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bfloat16 PASSED [0.0930s] [ 31%] 2025-12-04T13:35:25.8986811Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_complex128 PASSED [1.4201s] [ 31%] 2025-12-04T13:35:25.8986924Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float32 PASSED [0.0737s] [ 31%] 2025-12-04T13:35:25.8987033Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int64 PASSED [0.0717s] [ 31%] 2025-12-04T13:35:25.8987140Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int8 PASSED [0.0673s] [ 31%] 2025-12-04T13:35:25.8987248Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_bfloat16 PASSED [0.3688s] [ 31%] 2025-12-04T13:35:25.8987355Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float64 PASSED [0.3334s] [ 31%] 2025-12-04T13:35:25.8987484Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_complex64 XFAIL [0.0127s] [ 31%] 2025-12-04T13:35:25.8987612Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float32 XFAIL [0.0117s] [ 31%] 2025-12-04T13:35:25.8987722Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bfloat16 PASSED [1.3319s] [ 31%] 2025-12-04T13:35:25.8987829Z test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int8 PASSED [0.0129s] [ 31%] 2025-12-04T13:35:25.8987930Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float32 PASSED [0.0499s] [ 31%] 2025-12-04T13:35:25.8988029Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int32 PASSED [0.0512s] [ 31%] 2025-12-04T13:35:25.8988165Z test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_uint8 PASSED [0.0480s] [ 31%] 2025-12-04T13:35:25.8988273Z test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int16 PASSED [1.3295s] [ 31%] 2025-12-04T13:35:25.8988379Z test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int32 PASSED [0.0092s] [ 31%] 2025-12-04T13:35:25.8988487Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bfloat16 PASSED [0.0906s] [ 31%] 2025-12-04T13:35:25.8988595Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float16 PASSED [0.0886s] [ 31%] 2025-12-04T13:35:25.8988701Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float32 PASSED [0.0592s] [ 31%] 2025-12-04T13:35:25.8988804Z test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int8 PASSED [1.3859s] [ 31%] 2025-12-04T13:35:25.8988907Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float32 PASSED [0.0159s] [ 31%] 2025-12-04T13:35:25.8989011Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float64 PASSED [0.0138s] [ 31%] 2025-12-04T13:35:25.8989138Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_bool PASSED [0.0081s] [ 31%] 2025-12-04T13:35:25.8989278Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex128 PASSED [0.0097s] [ 31%] 2025-12-04T13:35:25.8989456Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_complex64 PASSED [0.0097s] [ 31%] 2025-12-04T13:35:25.8989602Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float16 PASSED [0.0092s] [ 31%] 2025-12-04T13:35:25.8989729Z test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int32 PASSED [0.0092s] [ 31%] 2025-12-04T13:35:25.8989837Z test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int16 PASSED [0.0463s] [ 31%] 2025-12-04T13:35:25.8989940Z test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int64 PASSED [0.0457s] [ 31%] 2025-12-04T13:35:25.8990044Z test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_uint8 PASSED [0.0433s] [ 31%] 2025-12-04T13:35:25.8990146Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_bool PASSED [1.3425s] [ 31%] 2025-12-04T13:35:25.8990254Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float64 PASSED [0.0089s] [ 31%] 2025-12-04T13:35:25.8990356Z test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int32 PASSED [0.0055s] [ 31%] 2025-12-04T13:35:25.8990478Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex128 PASSED [0.0831s] [ 31%] 2025-12-04T13:35:25.8990579Z test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex32 XFAIL [0.0497s] [ 31%] 2025-12-04T13:35:25.8990690Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bfloat16 PASSED [0.0498s] [ 31%] 2025-12-04T13:35:25.8990798Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float64 PASSED [1.3597s] [ 31%] 2025-12-04T13:35:25.8990905Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int64 PASSED [0.0166s] [ 31%] 2025-12-04T13:35:25.8991010Z test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_bool XFAIL [0.0030s] [ 31%] 2025-12-04T13:35:25.8991121Z test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float32 XFAIL [0.0028s] [ 31%] 2025-12-04T13:35:25.8991225Z test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int8 PASSED [1.3356s] [ 31%] 2025-12-04T13:35:25.8991346Z test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float32 PASSED [0.0389s] [ 31%] 2025-12-04T13:35:25.8991449Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex64 PASSED [1.3733s] [ 31%] 2025-12-04T13:35:25.8991545Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int32 PASSED [0.0510s] [ 31%] 2025-12-04T13:35:25.8991648Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_bfloat16 PASSED [0.0231s] [ 31%] 2025-12-04T13:35:25.8991751Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_complex32 PASSED [1.5510s] [ 31%] 2025-12-04T13:35:25.8991854Z test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float32 PASSED [0.0199s] [ 31%] 2025-12-04T13:35:25.8992012Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bfloat16 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 31%] 2025-12-04T13:35:25.8992174Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex128 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 31%] 2025-12-04T13:35:25.8992327Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 31%] 2025-12-04T13:35:25.8992479Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_uint8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 31%] 2025-12-04T13:35:25.8992656Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bfloat16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 32%] 2025-12-04T13:35:25.8992829Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 32%] 2025-12-04T13:35:25.8992937Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float16 PASSED [1.3503s] [ 32%] 2025-12-04T13:35:25.8993065Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float32 PASSED [0.0086s] [ 32%] 2025-12-04T13:35:25.8993182Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_bool PASSED [1.3579s] [ 32%] 2025-12-04T13:35:25.8993290Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float32 PASSED [0.0083s] [ 32%] 2025-12-04T13:35:25.8993394Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float64 PASSED [1.3507s] [ 32%] 2025-12-04T13:35:25.8993504Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_uint8 PASSED [0.0074s] [ 32%] 2025-12-04T13:35:25.8993611Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_bool PASSED [1.3634s] [ 32%] 2025-12-04T13:35:25.8993719Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float16 PASSED [0.0087s] [ 32%] 2025-12-04T13:35:25.8993828Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float64 PASSED [1.3319s] [ 32%] 2025-12-04T13:35:25.8993933Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int8 PASSED [0.0076s] [ 32%] 2025-12-04T13:35:25.8994043Z test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_uint8 PASSED [1.3228s] [ 32%] 2025-12-04T13:35:25.8994233Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32 SKIPPED [0.0003s] (Expected: dropout is not comparable) [ 32%] 2025-12-04T13:35:25.8994354Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float64 PASSED [1.3590s] [ 32%] 2025-12-04T13:35:25.8994492Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_complex64 PASSED [0.0067s] [ 32%] 2025-12-04T13:35:25.8994628Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_float16 PASSED [0.0048s] [ 32%] 2025-12-04T13:35:25.8994759Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int16 PASSED [1.3612s] [ 32%] 2025-12-04T13:35:25.8994892Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int8 PASSED [0.0064s] [ 32%] 2025-12-04T13:35:25.8995024Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_uint8 PASSED [0.0045s] [ 32%] 2025-12-04T13:35:25.8995201Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_bfloat16 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 32%] 2025-12-04T13:35:25.8995372Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float16 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 32%] 2025-12-04T13:35:25.8995544Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float32 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 32%] 2025-12-04T13:35:25.8995715Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float64 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 32%] 2025-12-04T13:35:25.8995834Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float16 PASSED [1.4112s] [ 32%] 2025-12-04T13:35:25.8995953Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16 PASSED [0.0634s] [ 32%] 2025-12-04T13:35:25.8996070Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float64 PASSED [0.0492s] [ 32%] 2025-12-04T13:35:25.8996199Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float16 PASSED [1.4631s] [ 32%] 2025-12-04T13:35:25.8996325Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float64 PASSED [0.0427s] [ 32%] 2025-12-04T13:35:25.8996451Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_bfloat16 PASSED [0.0633s] [ 32%] 2025-12-04T13:35:25.8996574Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float16 PASSED [0.0604s] [ 32%] 2025-12-04T13:35:25.8996696Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int32 PASSED [0.0404s] [ 32%] 2025-12-04T13:35:25.8996835Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int8 PASSED [0.0394s] [ 32%] 2025-12-04T13:35:25.8996973Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float64 PASSED [0.0129s] [ 32%] 2025-12-04T13:35:25.8997096Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_complex64 PASSED [1.3289s] [ 32%] 2025-12-04T13:35:25.8997222Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float16 PASSED [0.0138s] [ 32%] 2025-12-04T13:35:25.8997370Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex128 PASSED [0.0136s] [ 32%] 2025-12-04T13:35:25.8997511Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0437s] [ 32%] 2025-12-04T13:35:25.8997647Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int64 PASSED [0.0401s] [ 32%] 2025-12-04T13:35:25.8997772Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float64 PASSED [0.0068s] [ 32%] 2025-12-04T13:35:25.8997908Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float32 PASSED [0.0974s] [ 32%] 2025-12-04T13:35:25.8998042Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int64 PASSED [0.0092s] [ 32%] 2025-12-04T13:35:25.8998210Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int8 PASSED [0.0090s] [ 32%] 2025-12-04T13:35:25.8998342Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_uint8 PASSED [0.0090s] [ 32%] 2025-12-04T13:35:25.8998479Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex128 PASSED [0.0060s] [ 32%] 2025-12-04T13:35:25.8998606Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_uint8 PASSED [0.0052s] [ 32%] 2025-12-04T13:35:25.8998741Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0054s] [ 32%] 2025-12-04T13:35:25.8998872Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int32 PASSED [0.0054s] [ 32%] 2025-12-04T13:35:25.8999004Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int16 PASSED [0.1087s] [ 32%] 2025-12-04T13:35:25.8999134Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_uint8 PASSED [0.1094s] [ 32%] 2025-12-04T13:35:25.8999255Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32 PASSED [0.0451s] [ 32%] 2025-12-04T13:35:25.8999375Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float64 PASSED [0.0444s] [ 32%] 2025-12-04T13:35:25.8999494Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_uint8 PASSED [0.0335s] [ 32%] 2025-12-04T13:35:25.8999615Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float64 PASSED [0.0252s] [ 32%] 2025-12-04T13:35:25.8999734Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_uint8 PASSED [0.0200s] [ 32%] 2025-12-04T13:35:25.8999853Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_bfloat16 PASSED [0.0395s] [ 32%] 2025-12-04T13:35:25.8999984Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float32 PASSED [1.3485s] [ 32%] 2025-12-04T13:35:25.9000126Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex128 PASSED [1.3287s] [ 32%] 2025-12-04T13:35:25.9000266Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex64 PASSED [1.3381s] [ 32%] 2025-12-04T13:35:25.9000405Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_bfloat16 PASSED [1.3030s] [ 32%] 2025-12-04T13:35:25.9000566Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int64 PASSED [1.2965s] [ 32%] 2025-12-04T13:35:25.9000704Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float64 PASSED [0.0386s] [ 32%] 2025-12-04T13:35:25.9000831Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float32 PASSED [0.0527s] [ 32%] 2025-12-04T13:35:25.9000960Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float64 PASSED [0.0232s] [ 32%] 2025-12-04T13:35:25.9001083Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int8 PASSED [0.0230s] [ 32%] 2025-12-04T13:35:25.9001224Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_bfloat16 PASSED [0.0300s] [ 32%] 2025-12-04T13:35:25.9001366Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [1.2826s] [ 32%] 2025-12-04T13:35:25.9001508Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.0200s] [ 32%] 2025-12-04T13:35:25.9001645Z test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int64 PASSED [0.0195s] [ 32%] 2025-12-04T13:35:25.9001772Z test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_complex64 PASSED [1.3044s] [ 33%] 2025-12-04T13:35:25.9001962Z test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_bfloat16 SKIPPED [0.0003s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 33%] 2025-12-04T13:35:25.9002154Z test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float16 SKIPPED [0.0002s] (TODO: RuntimeError: no _refs support for torch.rand_like) [ 33%] 2025-12-04T13:35:25.9002262Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex128 PASSED [1.2459s] [ 33%] 2025-12-04T13:35:25.9002362Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int8 PASSED [0.0046s] [ 33%] 2025-12-04T13:35:25.9002478Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_bfloat16 PASSED [0.0296s] [ 33%] 2025-12-04T13:35:25.9002595Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_complex128 PASSED [0.0323s] [ 33%] 2025-12-04T13:35:25.9002707Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int64 PASSED [0.0228s] [ 33%] 2025-12-04T13:35:25.9002814Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16 PASSED [0.0247s] [ 33%] 2025-12-04T13:35:25.9002920Z test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float16 PASSED [1.1249s] [ 33%] 2025-12-04T13:35:25.9003028Z test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_bfloat16 PASSED [0.0154s] [ 33%] 2025-12-04T13:35:25.9003135Z test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float32 PASSED [0.7934s] [ 33%] 2025-12-04T13:35:25.9003233Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int64 PASSED [0.0517s] [ 33%] 2025-12-04T13:35:25.9003334Z test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int8 PASSED [0.0486s] [ 33%] 2025-12-04T13:35:25.9003433Z test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_bool PASSED [0.0180s] [ 33%] 2025-12-04T13:35:25.9003541Z test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex128 PASSED [0.0180s] [ 33%] 2025-12-04T13:35:25.9003643Z test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float16 PASSED [0.0255s] [ 33%] 2025-12-04T13:35:25.9003750Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float16 PASSED [0.0247s] [ 33%] 2025-12-04T13:35:25.9003852Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int32 PASSED [0.0193s] [ 33%] 2025-12-04T13:35:25.9003957Z test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex32 PASSED [0.7668s] [ 33%] 2025-12-04T13:35:25.9004058Z test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int16 PASSED [0.0050s] [ 33%] 2025-12-04T13:35:25.9004182Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex128 PASSED [0.0320s] [ 33%] 2025-12-04T13:35:25.9004296Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex64 PASSED [0.0303s] [ 33%] 2025-12-04T13:35:25.9004399Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float64 PASSED [0.7765s] [ 33%] 2025-12-04T13:35:25.9004499Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int32 PASSED [0.0117s] [ 33%] 2025-12-04T13:35:25.9004598Z test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int8 PASSED [0.9429s] [ 33%] 2025-12-04T13:35:25.9004710Z test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_bfloat16 PASSED [0.1036s] [ 33%] 2025-12-04T13:35:25.9004816Z test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float16 PASSED [0.0115s] [ 33%] 2025-12-04T13:35:25.9004921Z test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float64 PASSED [0.0082s] [ 33%] 2025-12-04T13:35:25.9005029Z test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex128 PASSED [0.0258s] [ 33%] 2025-12-04T13:35:25.9005131Z test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int8 PASSED [0.0220s] [ 33%] 2025-12-04T13:35:25.9005256Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex32 PASSED [1.3289s] [ 33%] 2025-12-04T13:35:25.9005366Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_float32 PASSED [0.0200s] [ 33%] 2025-12-04T13:35:25.9005472Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int32 PASSED [0.0140s] [ 33%] 2025-12-04T13:35:25.9005578Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float64 PASSED [0.0204s] [ 33%] 2025-12-04T13:35:25.9005682Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64 PASSED [0.0164s] [ 33%] 2025-12-04T13:35:25.9005787Z test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_uint8 PASSED [0.0162s] [ 33%] 2025-12-04T13:35:25.9005885Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bool PASSED [0.0096s] [ 33%] 2025-12-04T13:35:25.9005992Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex128 PASSED [0.0109s] [ 33%] 2025-12-04T13:35:25.9006098Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex32 PASSED [0.0108s] [ 33%] 2025-12-04T13:35:25.9006200Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float64 PASSED [0.0104s] [ 33%] 2025-12-04T13:35:25.9006297Z test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int8 PASSED [0.0092s] [ 33%] 2025-12-04T13:35:25.9006403Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bfloat16 PASSED [0.0150s] [ 33%] 2025-12-04T13:35:25.9006507Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex64 PASSED [0.0158s] [ 33%] 2025-12-04T13:35:25.9006612Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float32 PASSED [0.0149s] [ 33%] 2025-12-04T13:35:25.9006714Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float64 PASSED [1.3023s] [ 33%] 2025-12-04T13:35:25.9006819Z test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int16 PASSED [0.0139s] [ 33%] 2025-12-04T13:35:25.9006921Z test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int64 PASSED [0.0119s] [ 33%] 2025-12-04T13:35:25.9007023Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int16 PASSED [0.0506s] [ 33%] 2025-12-04T13:35:25.9007123Z test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int32 PASSED [1.3431s] [ 33%] 2025-12-04T13:35:25.9007234Z test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int32 PASSED [0.0093s] [ 33%] 2025-12-04T13:35:25.9007339Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex64 PASSED [0.0395s] [ 33%] 2025-12-04T13:35:25.9007436Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int8 PASSED [0.0117s] [ 33%] 2025-12-04T13:35:25.9007544Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float32 PASSED [0.0264s] [ 33%] 2025-12-04T13:35:25.9007647Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_uint8 PASSED [0.0265s] [ 33%] 2025-12-04T13:35:25.9007776Z test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bfloat16 PASSED [0.0164s] [ 33%] 2025-12-04T13:35:25.9007888Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_bfloat16 PASSED [0.0208s] [ 33%] 2025-12-04T13:35:25.9007990Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float16 PASSED [0.0206s] [ 33%] 2025-12-04T13:35:25.9008156Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_complex128 PASSED [1.7672s] [ 33%] 2025-12-04T13:35:25.9008257Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int16 PASSED [0.0373s] [ 33%] 2025-12-04T13:35:25.9008355Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int32 PASSED [0.0335s] [ 33%] 2025-12-04T13:35:25.9008454Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int8 PASSED [0.0313s] [ 33%] 2025-12-04T13:35:25.9008556Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float16 PASSED [1.2962s] [ 33%] 2025-12-04T13:35:25.9008660Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float64 PASSED [0.0182s] [ 33%] 2025-12-04T13:35:25.9008759Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int16 PASSED [0.0176s] [ 33%] 2025-12-04T13:35:25.9008876Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int64 PASSED [0.0173s] [ 33%] 2025-12-04T13:35:25.9009002Z test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex128 PASSED [0.0099s] [ 33%] 2025-12-04T13:35:25.9009125Z test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float64 PASSED [0.0084s] [ 33%] 2025-12-04T13:35:25.9009238Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_bool PASSED [0.2684s] [ 33%] 2025-12-04T13:35:25.9009349Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bool PASSED [0.0507s] [ 34%] 2025-12-04T13:35:25.9009461Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float32 PASSED [0.1586s] [ 34%] 2025-12-04T13:35:25.9009576Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float64 PASSED [0.4074s] [ 34%] 2025-12-04T13:35:25.9009689Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int32 PASSED [0.0190s] [ 34%] 2025-12-04T13:35:25.9009798Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int64 PASSED [0.2213s] [ 34%] 2025-12-04T13:35:25.9009906Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int64 PASSED [1.4747s] [ 34%] 2025-12-04T13:35:25.9010011Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int8 PASSED [0.0200s] [ 34%] 2025-12-04T13:35:25.9010119Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_uint8 PASSED [0.0170s] [ 34%] 2025-12-04T13:35:25.9010232Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int16 PASSED [0.2839s] [ 34%] 2025-12-04T13:35:25.9010346Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_uint8 PASSED [0.0471s] [ 34%] 2025-12-04T13:35:25.9010485Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bfloat16 PASSED [1.2643s] [ 34%] 2025-12-04T13:35:25.9010624Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float64 PASSED [1.2705s] [ 34%] 2025-12-04T13:35:25.9010755Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int8 PASSED [1.2554s] [ 34%] 2025-12-04T13:35:25.9010890Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_uint8 PASSED [1.2635s] [ 34%] 2025-12-04T13:35:25.9011002Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int16 PASSED [0.0464s] [ 34%] 2025-12-04T13:35:25.9011114Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int32 PASSED [0.0434s] [ 34%] 2025-12-04T13:35:25.9011223Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int8 PASSED [1.2917s] [ 34%] 2025-12-04T13:35:25.9011392Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16 PASSED [0.0817s] [ 34%] 2025-12-04T13:35:25.9011546Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float32 PASSED [0.0671s] [ 34%] 2025-12-04T13:35:25.9011691Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_float64 PASSED [0.0671s] [ 34%] 2025-12-04T13:35:25.9011830Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int64 PASSED [0.0711s] [ 34%] 2025-12-04T13:35:25.9011971Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float64 PASSED [1.3510s] [ 34%] 2025-12-04T13:35:25.9012111Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float16 PASSED [1.3721s] [ 34%] 2025-12-04T13:35:25.9012247Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int16 PASSED [0.0744s] [ 34%] 2025-12-04T13:35:25.9012358Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bool PASSED [0.0350s] [ 34%] 2025-12-04T13:35:25.9012468Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int64 PASSED [0.0293s] [ 34%] 2025-12-04T13:35:25.9012590Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_uint8 PASSED [0.0273s] [ 34%] 2025-12-04T13:35:25.9012699Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_bool PASSED [0.2313s] [ 34%] 2025-12-04T13:35:25.9012827Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_bool PASSED [1.3059s] [ 34%] 2025-12-04T13:35:25.9012961Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex128 PASSED [1.2640s] [ 34%] 2025-12-04T13:35:25.9013090Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_uint8 PASSED [1.2794s] [ 34%] 2025-12-04T13:35:25.9013221Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float32 PASSED [0.1616s] [ 34%] 2025-12-04T13:35:25.9013351Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int32 PASSED [0.2528s] [ 34%] 2025-12-04T13:35:25.9013481Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int64 PASSED [0.0190s] [ 34%] 2025-12-04T13:35:25.9013608Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int8 PASSED [0.0175s] [ 34%] 2025-12-04T13:35:25.9013721Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_uint8 PASSED [0.1387s] [ 34%] 2025-12-04T13:35:25.9013833Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int32 PASSED [1.4129s] [ 34%] 2025-12-04T13:35:25.9013942Z test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int8 PASSED [0.0801s] [ 34%] 2025-12-04T13:35:25.9014055Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bool PASSED [1.2857s] [ 34%] 2025-12-04T13:35:25.9014178Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex128 PASSED [0.0078s] [ 34%] 2025-12-04T13:35:25.9014297Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_float16 PASSED [0.8608s] [ 34%] 2025-12-04T13:35:25.9014412Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int16 PASSED [0.0063s] [ 34%] 2025-12-04T13:35:25.9014524Z test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_uint8 PASSED [0.7681s] [ 34%] 2025-12-04T13:35:25.9014631Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex32 PASSED [0.2532s] [ 34%] 2025-12-04T13:35:25.9014733Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float16 PASSED [0.0220s] [ 34%] 2025-12-04T13:35:25.9014835Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int64 PASSED [0.7935s] [ 34%] 2025-12-04T13:35:25.9014937Z test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bool PASSED [0.0231s] [ 34%] 2025-12-04T13:35:25.9015065Z test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex128 PASSED [0.0337s] [ 34%] 2025-12-04T13:35:25.9015187Z test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float64 PASSED [0.0199s] [ 34%] 2025-12-04T13:35:25.9015305Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex128 PASSED [0.0063s] [ 34%] 2025-12-04T13:35:25.9015417Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float16 PASSED [0.0059s] [ 34%] 2025-12-04T13:35:25.9015525Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int8 PASSED [0.0050s] [ 34%] 2025-12-04T13:35:25.9015634Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_uint8 PASSED [0.0050s] [ 34%] 2025-12-04T13:35:25.9015737Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_bool PASSED [0.0044s] [ 34%] 2025-12-04T13:35:25.9015847Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex128 PASSED [0.7773s] [ 34%] 2025-12-04T13:35:25.9015958Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex64 PASSED [0.0075s] [ 34%] 2025-12-04T13:35:25.9016063Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int16 PASSED [0.0047s] [ 34%] 2025-12-04T13:35:25.9016179Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_uint8 PASSED [0.0045s] [ 34%] 2025-12-04T13:35:25.9016295Z test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64 PASSED [0.7795s] [ 34%] 2025-12-04T13:35:25.9016401Z test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bfloat16 PASSED [0.0103s] [ 34%] 2025-12-04T13:35:25.9016501Z test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bool PASSED [0.0071s] [ 34%] 2025-12-04T13:35:25.9016608Z test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex128 PASSED [0.0079s] [ 34%] 2025-12-04T13:35:25.9016712Z test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float16 PASSED [0.0075s] [ 34%] 2025-12-04T13:35:25.9016814Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_bfloat16 PASSED [0.0125s] [ 34%] 2025-12-04T13:35:25.9016917Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float64 PASSED [0.7781s] [ 34%] 2025-12-04T13:35:25.9017029Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex128 PASSED [0.0169s] [ 34%] 2025-12-04T13:35:25.9017139Z test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_complex64 PASSED [0.7749s] [ 34%] 2025-12-04T13:35:25.9017242Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex64 PASSED [0.0833s] [ 34%] 2025-12-04T13:35:25.9017345Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_float16 PASSED [0.0980s] [ 34%] 2025-12-04T13:35:25.9017442Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bool PASSED [0.0099s] [ 35%] 2025-12-04T13:35:25.9017541Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int64 PASSED [0.7691s] [ 35%] 2025-12-04T13:35:25.9017638Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_uint8 PASSED [0.0125s] [ 35%] 2025-12-04T13:35:25.9017746Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bool PASSED [0.0095s] [ 35%] 2025-12-04T13:35:25.9017858Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float64 PASSED [0.7784s] [ 35%] 2025-12-04T13:35:25.9017965Z test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int16 PASSED [0.0114s] [ 35%] 2025-12-04T13:35:25.9018116Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float16 PASSED [0.0040s] [ 35%] 2025-12-04T13:35:25.9018221Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float32 PASSED [0.7651s] [ 35%] 2025-12-04T13:35:25.9018322Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex128 PASSED [0.0052s] [ 35%] 2025-12-04T13:35:25.9018425Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex64 PASSED [0.7646s] [ 35%] 2025-12-04T13:35:25.9018522Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float32 PASSED [0.0050s] [ 35%] 2025-12-04T13:35:25.9018647Z test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int8 PASSED [0.7657s] [ 35%] 2025-12-04T13:35:25.9018774Z test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bfloat16 XFAIL [0.0047s] [ 35%] 2025-12-04T13:35:25.9018886Z test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int32 XFAIL [0.7648s] [ 35%] 2025-12-04T13:35:25.9018994Z test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int8 XFAIL [0.7576s] [ 35%] 2025-12-04T13:35:25.9019096Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bfloat16 PASSED [0.8049s] [ 35%] 2025-12-04T13:35:25.9019194Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bool PASSED [0.0211s] [ 35%] 2025-12-04T13:35:25.9019296Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float64 PASSED [0.7622s] [ 35%] 2025-12-04T13:35:25.9019393Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int8 PASSED [0.0199s] [ 35%] 2025-12-04T13:35:25.9019496Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bfloat16 PASSED [0.0237s] [ 35%] 2025-12-04T13:35:25.9019595Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bool PASSED [0.7792s] [ 35%] 2025-12-04T13:35:25.9019710Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float64 PASSED [0.0194s] [ 35%] 2025-12-04T13:35:25.9019810Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int16 PASSED [0.0189s] [ 35%] 2025-12-04T13:35:25.9019908Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int32 PASSED [0.0185s] [ 35%] 2025-12-04T13:35:25.9020007Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_uint8 PASSED [0.0173s] [ 35%] 2025-12-04T13:35:25.9020103Z test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bool PASSED [0.7622s] [ 35%] 2025-12-04T13:35:25.9020200Z test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int64 PASSED [0.0162s] [ 35%] 2025-12-04T13:35:25.9020304Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bfloat16 PASSED [0.7617s] [ 35%] 2025-12-04T13:35:25.9020413Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex64 PASSED [0.0045s] [ 35%] 2025-12-04T13:35:25.9020517Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float64 PASSED [0.7825s] [ 35%] 2025-12-04T13:35:25.9020620Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int16 PASSED [0.0046s] [ 35%] 2025-12-04T13:35:25.9020720Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int64 PASSED [0.7393s] [ 35%] 2025-12-04T13:35:25.9020821Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_uint8 PASSED [0.0043s] [ 35%] 2025-12-04T13:35:25.9020934Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int32 PASSED [0.0058s] [ 35%] 2025-12-04T13:35:25.9021046Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int8 PASSED [0.0051s] [ 35%] 2025-12-04T13:35:25.9021155Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex64 PASSED [0.7503s] [ 35%] 2025-12-04T13:35:25.9021264Z test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int16 PASSED [0.0060s] [ 35%] 2025-12-04T13:35:25.9021365Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float64 PASSED [0.0118s] [ 35%] 2025-12-04T13:35:25.9021469Z test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int8 PASSED [0.0103s] [ 35%] 2025-12-04T13:35:25.9021566Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bool PASSED [0.0137s] [ 35%] 2025-12-04T13:35:25.9021670Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float32 PASSED [0.0118s] [ 35%] 2025-12-04T13:35:25.9021770Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float64 PASSED [0.0111s] [ 35%] 2025-12-04T13:35:25.9021871Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int16 PASSED [0.0104s] [ 35%] 2025-12-04T13:35:25.9021969Z test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int8 PASSED [0.0102s] [ 35%] 2025-12-04T13:35:25.9022074Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_bfloat16 PASSED [0.0225s] [ 35%] 2025-12-04T13:35:25.9022197Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float16 PASSED [0.7975s] [ 35%] 2025-12-04T13:35:25.9022309Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int32 PASSED [0.0138s] [ 35%] 2025-12-04T13:35:25.9022408Z test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int8 PASSED [0.0112s] [ 35%] 2025-12-04T13:35:25.9022507Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_bool PASSED [0.0078s] [ 35%] 2025-12-04T13:35:25.9022616Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex128 PASSED [0.0103s] [ 35%] 2025-12-04T13:35:25.9022720Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float64 PASSED [0.7507s] [ 35%] 2025-12-04T13:35:25.9022833Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex128 PASSED [0.0094s] [ 35%] 2025-12-04T13:35:25.9022943Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex32 PASSED [0.0071s] [ 35%] 2025-12-04T13:35:25.9023055Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex64 PASSED [0.0068s] [ 35%] 2025-12-04T13:35:25.9023174Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_uint8 PASSED [0.0056s] [ 35%] 2025-12-04T13:35:25.9023282Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bool PASSED [0.0096s] [ 35%] 2025-12-04T13:35:25.9023391Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float32 PASSED [0.0119s] [ 35%] 2025-12-04T13:35:25.9023504Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_float64 PASSED [0.7511s] [ 35%] 2025-12-04T13:35:25.9023608Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float32 PASSED [0.0123s] [ 35%] 2025-12-04T13:35:25.9023726Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_bfloat16 PASSED [0.0065s] [ 35%] 2025-12-04T13:35:25.9023847Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex128 PASSED [0.0065s] [ 35%] 2025-12-04T13:35:25.9023965Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float16 PASSED [0.0062s] [ 35%] 2025-12-04T13:35:25.9024081Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_float64 PASSED [0.0062s] [ 35%] 2025-12-04T13:35:25.9024193Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int32 PASSED [0.0053s] [ 35%] 2025-12-04T13:35:25.9024303Z test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bfloat16 PASSED [0.7478s] [ 35%] 2025-12-04T13:35:25.9024413Z test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_complex64 PASSED [0.0150s] [ 35%] 2025-12-04T13:35:25.9024521Z test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32 PASSED [0.7458s] [ 35%] 2025-12-04T13:35:25.9024625Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_bfloat16 PASSED [0.0048s] [ 35%] 2025-12-04T13:35:25.9024742Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float64 PASSED [0.7365s] [ 35%] 2025-12-04T13:35:25.9024851Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex64 PASSED [0.0193s] [ 36%] 2025-12-04T13:35:25.9024959Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int32 PASSED [0.0135s] [ 36%] 2025-12-04T13:35:25.9025060Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int8 PASSED [0.0133s] [ 36%] 2025-12-04T13:35:25.9025168Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float32 PASSED [0.0058s] [ 36%] 2025-12-04T13:35:25.9025274Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_int64 PASSED [0.7473s] [ 36%] 2025-12-04T13:35:25.9025378Z test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_float64 PASSED [0.0229s] [ 36%] 2025-12-04T13:35:25.9025479Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int8 PASSED [0.7439s] [ 36%] 2025-12-04T13:35:25.9025587Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex64 PASSED [0.0054s] [ 36%] 2025-12-04T13:35:25.9025708Z test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16 PASSED [0.0035s] [ 36%] 2025-12-04T13:35:25.9025825Z test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex64 PASSED [0.0143s] [ 36%] 2025-12-04T13:35:25.9025930Z test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_uint8 PASSED [0.7470s] [ 36%] 2025-12-04T13:35:25.9026034Z test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float32 PASSED [0.1228s] [ 36%] 2025-12-04T13:35:25.9026130Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_T_cuda PASSED [0.7434s] [ 36%] 2025-12-04T13:35:25.9026233Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda PASSED [0.7319s] [ 36%] 2025-12-04T13:35:25.9026330Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cat_cuda XFAIL [0.0063s] [ 36%] 2025-12-04T13:35:25.9026438Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_max_cuda XFAIL [0.0036s] [ 36%] 2025-12-04T13:35:25.9026716Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_copy_cuda E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9026866Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9027130Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9027257Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9027473Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9027607Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9027776Z E1204 13:02:00.725000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: diagonal dimensions cannot be identical 1, 1 2025-12-04T13:35:25.9027947Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9028118Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9028367Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9028492Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9028702Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9028837Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9029035Z E1204 13:02:00.728000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension out of range (expected to be in range of [-2, 1], but got 10000) 2025-12-04T13:35:25.9029204Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9029336Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9029581Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9029740Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9029959Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9030095Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9030289Z E1204 13:02:00.729000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension out of range (expected to be in range of [-2, 1], but got 10000) 2025-12-04T13:35:25.9030458Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9030587Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9030835Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9030973Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9031178Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9031310Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9031475Z E1204 13:02:00.730000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: diagonal dimensions cannot be identical 1, 1 2025-12-04T13:35:25.9031641Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9031772Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9032019Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9032139Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9032345Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9032475Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9032672Z E1204 13:02:00.731000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension out of range (expected to be in range of [-3, 2], but got 10000) 2025-12-04T13:35:25.9032838Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.diagonal.default 2025-12-04T13:35:25.9032968Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9033214Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9033333Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9033540Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9033689Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9033893Z E1204 13:02:00.732000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] IndexError: Dimension out of range (expected to be in range of [-3, 2], but got 10000) 2025-12-04T13:35:25.9033936Z PASSED [0.7503s] [ 36%] 2025-12-04T13:35:25.9034061Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_floor_rounding_cuda PASSED [0.0018s] [ 36%] 2025-12-04T13:35:25.9034176Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_exponential_cuda PASSED [0.7475s] [ 36%] 2025-12-04T13:35:25.9034280Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eye_cuda PASSED [0.0043s] [ 36%] 2025-12-04T13:35:25.9034386Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fftn_cuda PASSED [0.7385s] [ 36%] 2025-12-04T13:35:25.9034495Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft2_cuda PASSED [0.7350s] [ 36%] 2025-12-04T13:35:25.9034602Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft_cuda PASSED [0.7558s] [ 36%] 2025-12-04T13:35:25.9034713Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft_cuda PASSED [0.7478s] [ 36%] 2025-12-04T13:35:25.9034827Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda PASSED [0.7612s] [ 36%] 2025-12-04T13:35:25.9034933Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_geometric_cuda PASSED [0.7439s] [ 36%] 2025-12-04T13:35:25.9035040Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igamma_cuda PASSED [0.7460s] [ 36%] 2025-12-04T13:35:25.9035149Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linalg_cross_cuda XFAIL [0.0043s] [ 36%] 2025-12-04T13:35:25.9035259Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda PASSED [1.5285s] [ 36%] 2025-12-04T13:35:25.9035364Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_minimum_cuda XFAIL [0.0060s] [ 36%] 2025-12-04T13:35:25.9035650Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_poisson_nll_loss_cuda E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.mul.Tensor 2025-12-04T13:35:25.9035784Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9036033Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9036154Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9036362Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9036493Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9036725Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T13:35:25.9036854Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T13:35:25.9037060Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_compile.py", line 54, in inner 2025-12-04T13:35:25.9037196Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return disable_fn(*args, **kwargs) 2025-12-04T13:35:25.9037418Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T13:35:25.9037563Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return fn(*args, **kwargs) 2025-12-04T13:35:25.9037799Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 152, in _fn 2025-12-04T13:35:25.9037930Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(**bound.arguments) 2025-12-04T13:35:25.9038177Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 1164, in _ref 2025-12-04T13:35:25.9038307Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] a, b = _maybe_broadcast(a, b) 2025-12-04T13:35:25.9038539Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 470, in _maybe_broadcast 2025-12-04T13:35:25.9038676Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] common_shape = _broadcast_shapes( 2025-12-04T13:35:25.9038928Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 458, in _broadcast_shapes 2025-12-04T13:35:25.9039040Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T13:35:25.9039252Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T13:35:25.9039447Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T13:35:25.9039669Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T13:35:25.9039808Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T13:35:25.9040130Z E1204 13:02:09.039000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: Attempting to broadcast a dimension of length 4 at -1! Mismatching argument at index 1 had torch.Size([5, 4]); but expected shape should be broadcastable to [1, 5] 2025-12-04T13:35:25.9040171Z PASSED [0.7683s] [ 36%] 2025-12-04T13:35:25.9040296Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_prelu_cuda PASSED [0.7711s] [ 36%] 2025-12-04T13:35:25.9040582Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_triplet_margin_loss_cuda E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.sub.Tensor 2025-12-04T13:35:25.9040715Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9040966Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9041088Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9041298Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9041429Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9041683Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T13:35:25.9041822Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T13:35:25.9042031Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_compile.py", line 54, in inner 2025-12-04T13:35:25.9042164Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return disable_fn(*args, **kwargs) 2025-12-04T13:35:25.9042387Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T13:35:25.9042514Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return fn(*args, **kwargs) 2025-12-04T13:35:25.9042741Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 152, in _fn 2025-12-04T13:35:25.9042886Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(**bound.arguments) 2025-12-04T13:35:25.9043100Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 1841, in sub 2025-12-04T13:35:25.9043229Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] a, b = _maybe_broadcast(a, b) 2025-12-04T13:35:25.9043462Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 470, in _maybe_broadcast 2025-12-04T13:35:25.9043597Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] common_shape = _broadcast_shapes( 2025-12-04T13:35:25.9043831Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 458, in _broadcast_shapes 2025-12-04T13:35:25.9043944Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T13:35:25.9044157Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T13:35:25.9044351Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T13:35:25.9044572Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T13:35:25.9044711Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T13:35:25.9045020Z E1204 13:02:09.820000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: Attempting to broadcast a dimension of length 4 at -1! Mismatching argument at index 1 had torch.Size([3, 4]); but expected shape should be broadcastable to [3, 5] 2025-12-04T13:35:25.9045179Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.sub.Tensor 2025-12-04T13:35:25.9045311Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9045558Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9045711Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9045929Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9046060Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9046290Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T13:35:25.9046417Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T13:35:25.9046624Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_compile.py", line 54, in inner 2025-12-04T13:35:25.9046759Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return disable_fn(*args, **kwargs) 2025-12-04T13:35:25.9046995Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T13:35:25.9047119Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return fn(*args, **kwargs) 2025-12-04T13:35:25.9047347Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 152, in _fn 2025-12-04T13:35:25.9047476Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(**bound.arguments) 2025-12-04T13:35:25.9047695Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 1841, in sub 2025-12-04T13:35:25.9047826Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] a, b = _maybe_broadcast(a, b) 2025-12-04T13:35:25.9048057Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 470, in _maybe_broadcast 2025-12-04T13:35:25.9048228Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] common_shape = _broadcast_shapes( 2025-12-04T13:35:25.9048459Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 458, in _broadcast_shapes 2025-12-04T13:35:25.9048571Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T13:35:25.9048783Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T13:35:25.9048979Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T13:35:25.9049197Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T13:35:25.9049334Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T13:35:25.9049638Z E1204 13:02:09.821000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: Attempting to broadcast a dimension of length 5 at -1! Mismatching argument at index 1 had torch.Size([3, 5]); but expected shape should be broadcastable to [3, 4] 2025-12-04T13:35:25.9049827Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] failed while attempting to run meta for aten.sub.Tensor 2025-12-04T13:35:25.9049973Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] Traceback (most recent call last): 2025-12-04T13:35:25.9050221Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_subclasses/fake_tensor.py", line 2823, in _dispatch_impl 2025-12-04T13:35:25.9050345Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] r = func(*args, **kwargs) 2025-12-04T13:35:25.9050551Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_ops.py", line 836, in __call__ 2025-12-04T13:35:25.9050683Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return self._op(*args, **kwargs) 2025-12-04T13:35:25.9050911Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 315, in _fn 2025-12-04T13:35:25.9051055Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(*args, **kwargs) 2025-12-04T13:35:25.9051261Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_compile.py", line 54, in inner 2025-12-04T13:35:25.9051394Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return disable_fn(*args, **kwargs) 2025-12-04T13:35:25.9051617Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_dynamo/eval_frame.py", line 1154, in _fn 2025-12-04T13:35:25.9051742Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] return fn(*args, **kwargs) 2025-12-04T13:35:25.9051970Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_prims_common/wrappers.py", line 152, in _fn 2025-12-04T13:35:25.9052100Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] result = fn(**bound.arguments) 2025-12-04T13:35:25.9052317Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 1841, in sub 2025-12-04T13:35:25.9052446Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] a, b = _maybe_broadcast(a, b) 2025-12-04T13:35:25.9052676Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 470, in _maybe_broadcast 2025-12-04T13:35:25.9052810Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] common_shape = _broadcast_shapes( 2025-12-04T13:35:25.9053044Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/_refs/__init__.py", line 458, in _broadcast_shapes 2025-12-04T13:35:25.9053154Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] torch._check( 2025-12-04T13:35:25.9053364Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1734, in _check 2025-12-04T13:35:25.9053556Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] _check_with(RuntimeError, cond, message) # pyrefly: ignore [bad-argument-type] 2025-12-04T13:35:25.9053793Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] File "/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/__init__.py", line 1716, in _check_with 2025-12-04T13:35:25.9053944Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] raise error_type(message_evaluated) 2025-12-04T13:35:25.9054250Z E1204 13:02:09.824000 893507 site-packages/torch/_subclasses/fake_tensor.py:2827] RuntimeError: Attempting to broadcast a dimension of length 5 at -1! Mismatching argument at index 1 had torch.Size([3, 5]); but expected shape should be broadcastable to [3, 4] 2025-12-04T13:35:25.9054291Z PASSED [0.0143s] [ 36%] 2025-12-04T13:35:25.9054397Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_pow_cuda PASSED [0.0023s] [ 36%] 2025-12-04T13:35:25.9054507Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_renorm_cuda PASSED [0.7632s] [ 36%] 2025-12-04T13:35:25.9054618Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_reshape_as_cuda PASSED [0.7429s] [ 36%] 2025-12-04T13:35:25.9054738Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_xlog1py_cuda PASSED [0.0031s] [ 36%] 2025-12-04T13:35:25.9054850Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda PASSED [0.0026s] [ 36%] 2025-12-04T13:35:25.9054959Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_cuda PASSED [0.7455s] [ 36%] 2025-12-04T13:35:25.9055063Z test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vdot_cuda PASSED [0.7370s] [ 36%] 2025-12-04T13:35:25.9055193Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex32 PASSED [0.7654s] [ 36%] 2025-12-04T13:35:25.9055321Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float16 PASSED [0.0101s] [ 36%] 2025-12-04T13:35:25.9055446Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float32 PASSED [0.0073s] [ 36%] 2025-12-04T13:35:25.9055570Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int8 PASSED [0.0066s] [ 36%] 2025-12-04T13:35:25.9055731Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bfloat16 PASSED [0.0756s] [ 36%] 2025-12-04T13:35:25.9055890Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_float64 PASSED [0.1001s] [ 36%] 2025-12-04T13:35:25.9056041Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float64 PASSED [0.0889s] [ 36%] 2025-12-04T13:35:25.9056190Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int16 PASSED [0.0859s] [ 36%] 2025-12-04T13:35:25.9056337Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int32 PASSED [0.0907s] [ 36%] 2025-12-04T13:35:25.9056485Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int8 PASSED [0.0822s] [ 36%] 2025-12-04T13:35:25.9056632Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int16 PASSED [0.0814s] [ 36%] 2025-12-04T13:35:25.9056789Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex32 PASSED [0.1168s] [ 36%] 2025-12-04T13:35:25.9056944Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex64 PASSED [0.1124s] [ 36%] 2025-12-04T13:35:25.9057099Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float32 PASSED [0.0998s] [ 36%] 2025-12-04T13:35:25.9057253Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int16 PASSED [0.0818s] [ 36%] 2025-12-04T13:35:25.9057404Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int32 PASSED [0.0810s] [ 36%] 2025-12-04T13:35:25.9057558Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex32 PASSED [0.1052s] [ 36%] 2025-12-04T13:35:25.9057724Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int64 PASSED [0.0881s] [ 36%] 2025-12-04T13:35:25.9057887Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float32 PASSED [0.5022s] [ 36%] 2025-12-04T13:35:25.9058039Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex64 PASSED [0.1088s] [ 36%] 2025-12-04T13:35:25.9058229Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float32 PASSED [0.0901s] [ 36%] 2025-12-04T13:35:25.9058382Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex32 PASSED [0.1035s] [ 36%] 2025-12-04T13:35:25.9058534Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float64 PASSED [0.0885s] [ 36%] 2025-12-04T13:35:25.9058684Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int8 PASSED [0.0766s] [ 36%] 2025-12-04T13:35:25.9058858Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int8 PASSED [0.0807s] [ 36%] 2025-12-04T13:35:25.9059003Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_uint8 PASSED [0.0866s] [ 36%] 2025-12-04T13:35:25.9059146Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bool PASSED [0.1013s] [ 36%] 2025-12-04T13:35:25.9059299Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex128 PASSED [0.1061s] [ 36%] 2025-12-04T13:35:25.9059446Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex64 PASSED [0.0975s] [ 36%] 2025-12-04T13:35:25.9059595Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_float16 PASSED [0.0930s] [ 36%] 2025-12-04T13:35:25.9059742Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_bool PASSED [0.0988s] [ 37%] 2025-12-04T13:35:25.9059897Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex64 PASSED [0.0992s] [ 37%] 2025-12-04T13:35:25.9060046Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float32 PASSED [0.0900s] [ 37%] 2025-12-04T13:35:25.9060193Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int16 PASSED [0.0773s] [ 37%] 2025-12-04T13:35:25.9060338Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int64 PASSED [0.0610s] [ 37%] 2025-12-04T13:35:25.9060489Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float32 PASSED [0.9494s] [ 37%] 2025-12-04T13:35:25.9060641Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bfloat16 PASSED [0.0892s] [ 37%] 2025-12-04T13:35:25.9060796Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex64 PASSED [0.1011s] [ 37%] 2025-12-04T13:35:25.9060946Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_uint8 PASSED [0.0743s] [ 37%] 2025-12-04T13:35:25.9061074Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bool PASSED [0.0733s] [ 37%] 2025-12-04T13:35:25.9061203Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bool PASSED [0.1130s] [ 37%] 2025-12-04T13:35:25.9061333Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int32 PASSED [0.0927s] [ 37%] 2025-12-04T13:35:25.9061462Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int64 PASSED [0.0884s] [ 37%] 2025-12-04T13:35:25.9061619Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_uint8 PASSED [0.0847s] [ 37%] 2025-12-04T13:35:25.9061769Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bfloat16 PASSED [0.1190s] [ 37%] 2025-12-04T13:35:25.9061902Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float16 PASSED [0.1186s] [ 37%] 2025-12-04T13:35:25.9062033Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int16 PASSED [0.0897s] [ 37%] 2025-12-04T13:35:25.9062162Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float32 PASSED [0.3225s] [ 37%] 2025-12-04T13:35:25.9062290Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int32 PASSED [0.3209s] [ 37%] 2025-12-04T13:35:25.9062427Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_bfloat16 PASSED [0.3339s] [ 37%] 2025-12-04T13:35:25.9062572Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex128 PASSED [0.2625s] [ 37%] 2025-12-04T13:35:25.9062722Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float32 PASSED [0.2562s] [ 37%] 2025-12-04T13:35:25.9062859Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_float64 PASSED [0.2748s] [ 37%] 2025-12-04T13:35:25.9062999Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex64 PASSED [0.5553s] [ 37%] 2025-12-04T13:35:25.9063135Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float16 PASSED [0.3466s] [ 37%] 2025-12-04T13:35:25.9063268Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex128 PASSED [1.6307s] [ 37%] 2025-12-04T13:35:25.9063400Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float64 PASSED [0.0348s] [ 37%] 2025-12-04T13:35:25.9063545Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex32 PASSED [1.3616s] [ 37%] 2025-12-04T13:35:25.9063689Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex64 PASSED [0.0128s] [ 37%] 2025-12-04T13:35:25.9063826Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int16 PASSED [0.0082s] [ 37%] 2025-12-04T13:35:25.9063959Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int8 PASSED [1.3428s] [ 37%] 2025-12-04T13:35:25.9064089Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float32 PASSED [0.0962s] [ 37%] 2025-12-04T13:35:25.9064220Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float64 PASSED [0.0820s] [ 37%] 2025-12-04T13:35:25.9064348Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int64 PASSED [0.0424s] [ 37%] 2025-12-04T13:35:25.9064480Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float32 PASSED [0.0443s] [ 37%] 2025-12-04T13:35:25.9064615Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_complex64 PASSED [0.0724s] [ 37%] 2025-12-04T13:35:25.9064743Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float64 PASSED [0.0713s] [ 37%] 2025-12-04T13:35:25.9064870Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int64 PASSED [0.0720s] [ 37%] 2025-12-04T13:35:25.9064995Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int8 PASSED [0.0793s] [ 37%] 2025-12-04T13:35:25.9065131Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float64 PASSED [0.0810s] [ 37%] 2025-12-04T13:35:25.9065279Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_bfloat16 PASSED [1.3722s] [ 37%] 2025-12-04T13:35:25.9065443Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex128 PASSED [0.0169s] [ 37%] 2025-12-04T13:35:25.9065595Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_complex64 PASSED [0.0136s] [ 37%] 2025-12-04T13:35:25.9065749Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bool PASSED [0.0086s] [ 37%] 2025-12-04T13:35:25.9065908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float16 PASSED [0.0084s] [ 37%] 2025-12-04T13:35:25.9066063Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int32 PASSED [1.3582s] [ 37%] 2025-12-04T13:35:25.9066215Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex64 PASSED [0.0235s] [ 37%] 2025-12-04T13:35:25.9066350Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bfloat16 PASSED [0.1061s] [ 37%] 2025-12-04T13:35:25.9066481Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bool PASSED [0.0950s] [ 37%] 2025-12-04T13:35:25.9066619Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int8 PASSED [0.0726s] [ 37%] 2025-12-04T13:35:25.9066755Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bfloat16 PASSED [0.1044s] [ 37%] 2025-12-04T13:35:25.9066884Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bool PASSED [0.0950s] [ 37%] 2025-12-04T13:35:25.9067021Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex128 PASSED [0.0827s] [ 37%] 2025-12-04T13:35:25.9067154Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float16 PASSED [0.1117s] [ 37%] 2025-12-04T13:35:25.9067288Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float64 PASSED [0.0691s] [ 37%] 2025-12-04T13:35:25.9067420Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float32 PASSED [0.2734s] [ 37%] 2025-12-04T13:35:25.9067552Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float16 PASSED [0.1044s] [ 37%] 2025-12-04T13:35:25.9067684Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float64 PASSED [0.0682s] [ 37%] 2025-12-04T13:35:25.9067812Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int64 PASSED [0.0783s] [ 37%] 2025-12-04T13:35:25.9067939Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int8 PASSED [0.0726s] [ 37%] 2025-12-04T13:35:25.9068119Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex128 PASSED [0.2941s] [ 37%] 2025-12-04T13:35:25.9068257Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_complex32 PASSED [0.5031s] [ 37%] 2025-12-04T13:35:25.9068389Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float32 PASSED [0.0693s] [ 37%] 2025-12-04T13:35:25.9068521Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int64 PASSED [0.0777s] [ 37%] 2025-12-04T13:35:25.9068661Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_bfloat16 PASSED [0.0145s] [ 37%] 2025-12-04T13:35:25.9068805Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex128 PASSED [0.0137s] [ 37%] 2025-12-04T13:35:25.9068943Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float32 PASSED [0.0134s] [ 37%] 2025-12-04T13:35:25.9069080Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int16 PASSED [0.0136s] [ 37%] 2025-12-04T13:35:25.9069247Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int64 PASSED [0.0130s] [ 37%] 2025-12-04T13:35:25.9069397Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bool PASSED [0.0156s] [ 38%] 2025-12-04T13:35:25.9069533Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int64 PASSED [0.0141s] [ 38%] 2025-12-04T13:35:25.9069670Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_uint8 PASSED [0.0161s] [ 38%] 2025-12-04T13:35:25.9069811Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex32 PASSED [0.0195s] [ 38%] 2025-12-04T13:35:25.9069948Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float16 PASSED [0.0194s] [ 38%] 2025-12-04T13:35:25.9070084Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int32 PASSED [0.0181s] [ 38%] 2025-12-04T13:35:25.9070222Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int64 PASSED [0.3045s] [ 38%] 2025-12-04T13:35:25.9070370Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int8 PASSED [0.2921s] [ 38%] 2025-12-04T13:35:25.9070522Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int16 PASSED [0.0650s] [ 38%] 2025-12-04T13:35:25.9070659Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int32 PASSED [0.0650s] [ 38%] 2025-12-04T13:35:25.9070794Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int8 PASSED [0.0610s] [ 38%] 2025-12-04T13:35:25.9070930Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int64 PASSED [0.2777s] [ 38%] 2025-12-04T13:35:25.9071079Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int8 PASSED [0.2800s] [ 38%] 2025-12-04T13:35:25.9071218Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int16 PASSED [0.3062s] [ 38%] 2025-12-04T13:35:25.9071355Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int8 PASSED [0.2760s] [ 38%] 2025-12-04T13:35:25.9071500Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex32 PASSED [0.1019s] [ 38%] 2025-12-04T13:35:25.9071635Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int16 PASSED [0.0473s] [ 38%] 2025-12-04T13:35:25.9071782Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int32 PASSED [0.0305s] [ 38%] 2025-12-04T13:35:25.9071928Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64 PASSED [0.0307s] [ 38%] 2025-12-04T13:35:25.9072074Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_uint8 PASSED [0.0306s] [ 38%] 2025-12-04T13:35:25.9072217Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float16 PASSED [0.0180s] [ 38%] 2025-12-04T13:35:25.9072360Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float32 PASSED [0.0172s] [ 38%] 2025-12-04T13:35:25.9072499Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int32 PASSED [0.0167s] [ 38%] 2025-12-04T13:35:25.9072636Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int64 PASSED [1.4435s] [ 38%] 2025-12-04T13:35:25.9072774Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_uint8 PASSED [1.3705s] [ 38%] 2025-12-04T13:35:25.9072908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int16 XFAIL [0.0160s] [ 38%] 2025-12-04T13:35:25.9073058Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bfloat16 PASSED [1.3972s] [ 38%] 2025-12-04T13:35:25.9073208Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex64 PASSED [0.0394s] [ 38%] 2025-12-04T13:35:25.9073339Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float32 PASSED [0.0371s] [ 38%] 2025-12-04T13:35:25.9073464Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8 PASSED [0.0363s] [ 38%] 2025-12-04T13:35:25.9073592Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_uint8 PASSED [0.0368s] [ 38%] 2025-12-04T13:35:25.9073725Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float64 PASSED [0.0681s] [ 38%] 2025-12-04T13:35:25.9073855Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int32 PASSED [0.0577s] [ 38%] 2025-12-04T13:35:25.9073983Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_uint8 PASSED [0.0544s] [ 38%] 2025-12-04T13:35:25.9074121Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex64 PASSED [0.0759s] [ 38%] 2025-12-04T13:35:25.9074263Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int64 PASSED [0.0687s] [ 38%] 2025-12-04T13:35:25.9074392Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int16 PASSED [0.2044s] [ 38%] 2025-12-04T13:35:25.9074531Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bfloat16 PASSED [0.6467s] [ 38%] 2025-12-04T13:35:25.9074667Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float64 PASSED [0.4990s] [ 38%] 2025-12-04T13:35:25.9074801Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bool PASSED [0.4516s] [ 38%] 2025-12-04T13:35:25.9074933Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float32 PASSED [0.1356s] [ 38%] 2025-12-04T13:35:25.9075079Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float16 PASSED [1.4882s] [ 38%] 2025-12-04T13:35:25.9075222Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float32 PASSED [0.0164s] [ 38%] 2025-12-04T13:35:25.9075362Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int64 PASSED [0.0128s] [ 38%] 2025-12-04T13:35:25.9075494Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_bfloat16 PASSED [0.0549s] [ 38%] 2025-12-04T13:35:25.9075630Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex64 PASSED [0.0905s] [ 38%] 2025-12-04T13:35:25.9075758Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8 PASSED [0.0435s] [ 38%] 2025-12-04T13:35:25.9075901Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int32 PASSED [1.4124s] [ 38%] 2025-12-04T13:35:25.9076041Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int64 PASSED [0.0469s] [ 38%] 2025-12-04T13:35:25.9076194Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex128 PASSED [0.2544s] [ 38%] 2025-12-04T13:35:25.9076339Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float16 PASSED [0.2553s] [ 38%] 2025-12-04T13:35:25.9076480Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int16 PASSED [0.2504s] [ 38%] 2025-12-04T13:35:25.9076627Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex128 PASSED [0.1179s] [ 38%] 2025-12-04T13:35:25.9076766Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float16 PASSED [0.1161s] [ 38%] 2025-12-04T13:35:25.9076927Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float32 PASSED [0.1086s] [ 38%] 2025-12-04T13:35:25.9077074Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int16 PASSED [0.1079s] [ 38%] 2025-12-04T13:35:25.9077212Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float16 PASSED [2.1265s] [ 38%] 2025-12-04T13:35:25.9077344Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bfloat16 PASSED [0.1118s] [ 38%] 2025-12-04T13:35:25.9077475Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex32 PASSED [0.5286s] [ 38%] 2025-12-04T13:35:25.9077603Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int16 PASSED [0.0860s] [ 38%] 2025-12-04T13:35:25.9077739Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex64 PASSED [0.0888s] [ 38%] 2025-12-04T13:35:25.9077885Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float32 PASSED [0.0596s] [ 38%] 2025-12-04T13:35:25.9078033Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float16 PASSED [0.0780s] [ 38%] 2025-12-04T13:35:25.9078206Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float32 PASSED [0.0647s] [ 38%] 2025-12-04T13:35:25.9078343Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_float64 PASSED [0.0664s] [ 38%] 2025-12-04T13:35:25.9078478Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int16 PASSED [0.0704s] [ 38%] 2025-12-04T13:35:25.9078612Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int32 PASSED [0.0783s] [ 38%] 2025-12-04T13:35:25.9078751Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int64 PASSED [0.0647s] [ 38%] 2025-12-04T13:35:25.9078884Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int8 PASSED [0.0719s] [ 38%] 2025-12-04T13:35:25.9079021Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float32 PASSED [0.0265s] [ 39%] 2025-12-04T13:35:25.9079151Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int8 PASSED [0.0279s] [ 39%] 2025-12-04T13:35:25.9079290Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bfloat16 PASSED [0.1103s] [ 39%] 2025-12-04T13:35:25.9079425Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float64 PASSED [0.0727s] [ 39%] 2025-12-04T13:35:25.9079559Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_uint8 PASSED [0.0770s] [ 39%] 2025-12-04T13:35:25.9079698Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_float32 PASSED [0.2325s] [ 39%] 2025-12-04T13:35:25.9079836Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex64 PASSED [0.0483s] [ 39%] 2025-12-04T13:35:25.9079968Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float32 PASSED [0.0467s] [ 39%] 2025-12-04T13:35:25.9080098Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_uint8 PASSED [0.0465s] [ 39%] 2025-12-04T13:35:25.9080246Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex64 PASSED [0.0642s] [ 39%] 2025-12-04T13:35:25.9080382Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bool PASSED [0.0497s] [ 39%] 2025-12-04T13:35:25.9080527Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_bool PASSED [0.0706s] [ 39%] 2025-12-04T13:35:25.9080677Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32 PASSED [0.0658s] [ 39%] 2025-12-04T13:35:25.9080855Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float64 PASSED [0.0678s] [ 39%] 2025-12-04T13:35:25.9081015Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int32 PASSED [0.0653s] [ 39%] 2025-12-04T13:35:25.9081161Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_uint8 PASSED [0.0640s] [ 39%] 2025-12-04T13:35:25.9081298Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16 PASSED [0.1160s] [ 39%] 2025-12-04T13:35:25.9081447Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int32 PASSED [0.7536s] [ 39%] 2025-12-04T13:35:25.9081592Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bool PASSED [0.3976s] [ 39%] 2025-12-04T13:35:25.9081740Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8 PASSED [0.4093s] [ 39%] 2025-12-04T13:35:25.9081891Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float16 PASSED [0.5382s] [ 39%] 2025-12-04T13:35:25.9082053Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int32 PASSED [0.2927s] [ 39%] 2025-12-04T13:35:25.9082200Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_uint8 PASSED [0.3030s] [ 39%] 2025-12-04T13:35:25.9082336Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex128 PASSED [0.0099s] [ 39%] 2025-12-04T13:35:25.9082467Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_float32 PASSED [0.0050s] [ 39%] 2025-12-04T13:35:25.9082604Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex32 PASSED [0.0095s] [ 39%] 2025-12-04T13:35:25.9082741Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16 PASSED [0.0092s] [ 39%] 2025-12-04T13:35:25.9082874Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int8 PASSED [1.5675s] [ 39%] 2025-12-04T13:35:25.9083008Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int16 PASSED [0.0163s] [ 39%] 2025-12-04T13:35:25.9083140Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int64 PASSED [0.0128s] [ 39%] 2025-12-04T13:35:25.9083273Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int8 PASSED [0.0131s] [ 39%] 2025-12-04T13:35:25.9083405Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_uint8 PASSED [0.0126s] [ 39%] 2025-12-04T13:35:25.9083584Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bfloat16 SKIPPED [0.0001s] (Can't check result for empty) [ 39%] 2025-12-04T13:35:25.9083764Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex64 SKIPPED [0.0001s] (Can't check result for empty) [ 39%] 2025-12-04T13:35:25.9083938Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float16 SKIPPED [0.0001s] (Can't check result for empty) [ 39%] 2025-12-04T13:35:25.9084110Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float32 SKIPPED [0.0001s] (Can't check result for empty) [ 39%] 2025-12-04T13:35:25.9084284Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float64 SKIPPED [0.0001s] (Can't check result for empty) [ 39%] 2025-12-04T13:35:25.9084518Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_uint8 SKIPPED [0.0001s] (Can't check result for empty_like) [ 39%] 2025-12-04T13:35:25.9084780Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_complex64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 39%] 2025-12-04T13:35:25.9092489Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 39%] 2025-12-04T13:35:25.9092633Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int8 PASSED [1.6491s] [ 39%] 2025-12-04T13:35:25.9092778Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex64 PASSED [0.0313s] [ 39%] 2025-12-04T13:35:25.9092935Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int16 PASSED [0.0268s] [ 39%] 2025-12-04T13:35:25.9093072Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_bfloat16 PASSED [0.1045s] [ 39%] 2025-12-04T13:35:25.9093201Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float16 PASSED [0.1033s] [ 39%] 2025-12-04T13:35:25.9093331Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float32 PASSED [0.0674s] [ 39%] 2025-12-04T13:35:25.9093460Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int8 PASSED [0.0722s] [ 39%] 2025-12-04T13:35:25.9093627Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bool PASSED [0.0949s] [ 39%] 2025-12-04T13:35:25.9093764Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float32 PASSED [0.0752s] [ 39%] 2025-12-04T13:35:25.9093897Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int32 PASSED [0.0783s] [ 39%] 2025-12-04T13:35:25.9094030Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bfloat16 PASSED [0.4268s] [ 39%] 2025-12-04T13:35:25.9094158Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int8 PASSED [0.2744s] [ 39%] 2025-12-04T13:35:25.9094289Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_uint8 PASSED [0.0816s] [ 39%] 2025-12-04T13:35:25.9094415Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bool PASSED [0.1039s] [ 39%] 2025-12-04T13:35:25.9094546Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float16 PASSED [0.1134s] [ 39%] 2025-12-04T13:35:25.9094674Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float32 PASSED [0.0754s] [ 39%] 2025-12-04T13:35:25.9094802Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float64 PASSED [0.0749s] [ 39%] 2025-12-04T13:35:25.9094928Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int64 PASSED [0.0883s] [ 39%] 2025-12-04T13:35:25.9095063Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bool PASSED [0.0092s] [ 39%] 2025-12-04T13:35:25.9095199Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_uint8 PASSED [0.0084s] [ 39%] 2025-12-04T13:35:25.9095330Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bool PASSED [0.0204s] [ 39%] 2025-12-04T13:35:25.9095469Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex64 PASSED [0.0824s] [ 39%] 2025-12-04T13:35:25.9095602Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float64 PASSED [0.0686s] [ 39%] 2025-12-04T13:35:25.9095731Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float32 PASSED [0.4454s] [ 39%] 2025-12-04T13:35:25.9095858Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int64 PASSED [0.4124s] [ 39%] 2025-12-04T13:35:25.9095985Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_uint8 PASSED [0.4137s] [ 39%] 2025-12-04T13:35:25.9096156Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex32 PASSED [0.0226s] [ 39%] 2025-12-04T13:35:25.9096307Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_complex64 PASSED [1.5211s] [ 39%] 2025-12-04T13:35:25.9096444Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float16 PASSED [0.0283s] [ 39%] 2025-12-04T13:35:25.9096577Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int16 PASSED [0.0296s] [ 40%] 2025-12-04T13:35:25.9096708Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int8 PASSED [0.0293s] [ 40%] 2025-12-04T13:35:25.9096841Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_uint8 PASSED [0.0293s] [ 40%] 2025-12-04T13:35:25.9096971Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_int64 PASSED [0.0285s] [ 40%] 2025-12-04T13:35:25.9097104Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_uint8 PASSED [0.0271s] [ 40%] 2025-12-04T13:35:25.9097241Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex32 PASSED [0.0253s] [ 40%] 2025-12-04T13:35:25.9097387Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_float64 PASSED [0.0360s] [ 40%] 2025-12-04T13:35:25.9097531Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float32 PASSED [0.0257s] [ 40%] 2025-12-04T13:35:25.9097669Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int16 PASSED [0.0245s] [ 40%] 2025-12-04T13:35:25.9097806Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int32 PASSED [0.0239s] [ 40%] 2025-12-04T13:35:25.9097942Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int64 PASSED [0.0242s] [ 40%] 2025-12-04T13:35:25.9098130Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_uint8 PASSED [0.0247s] [ 40%] 2025-12-04T13:35:25.9098271Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_complex64 PASSED [0.0257s] [ 40%] 2025-12-04T13:35:25.9098409Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float16 PASSED [0.0350s] [ 40%] 2025-12-04T13:35:25.9098543Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int16 PASSED [0.0277s] [ 40%] 2025-12-04T13:35:25.9098677Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int32 PASSED [0.0283s] [ 40%] 2025-12-04T13:35:25.9098816Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_complex128 PASSED [1.4454s] [ 40%] 2025-12-04T13:35:25.9098949Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int16 PASSED [0.0335s] [ 40%] 2025-12-04T13:35:25.9099081Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int8 PASSED [0.0280s] [ 40%] 2025-12-04T13:35:25.9099217Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_uint8 PASSED [0.0280s] [ 40%] 2025-12-04T13:35:25.9099350Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_bool PASSED [0.0331s] [ 40%] 2025-12-04T13:35:25.9099488Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex64 PASSED [0.0277s] [ 40%] 2025-12-04T13:35:25.9099621Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int8 PASSED [0.0324s] [ 40%] 2025-12-04T13:35:25.9099759Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex32 PASSED [0.0283s] [ 40%] 2025-12-04T13:35:25.9099894Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int64 PASSED [0.0315s] [ 40%] 2025-12-04T13:35:25.9100086Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_uint8 PASSED [0.0312s] [ 40%] 2025-12-04T13:35:25.9100236Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_int64 PASSED [0.0332s] [ 40%] 2025-12-04T13:35:25.9100368Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_bool PASSED [0.0380s] [ 40%] 2025-12-04T13:35:25.9100507Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64 PASSED [0.0325s] [ 40%] 2025-12-04T13:35:25.9100640Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int32 PASSED [0.0373s] [ 40%] 2025-12-04T13:35:25.9100775Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_uint8 PASSED [0.0373s] [ 40%] 2025-12-04T13:35:25.9100921Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bfloat16 PASSED [0.0253s] [ 40%] 2025-12-04T13:35:25.9101072Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex128 PASSED [0.0256s] [ 40%] 2025-12-04T13:35:25.9101231Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex64 PASSED [0.0253s] [ 40%] 2025-12-04T13:35:25.9101376Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float16 PASSED [0.0244s] [ 40%] 2025-12-04T13:35:25.9101518Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float32 PASSED [0.0239s] [ 40%] 2025-12-04T13:35:25.9101658Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32 PASSED [0.0234s] [ 40%] 2025-12-04T13:35:25.9101798Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int64 PASSED [0.0235s] [ 40%] 2025-12-04T13:35:25.9101955Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int32 SKIPPED [0.0001s] (Skipped!) [ 40%] 2025-12-04T13:35:25.9102091Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_int32 PASSED [0.0334s] [ 40%] 2025-12-04T13:35:25.9102226Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_uint8 PASSED [0.0327s] [ 40%] 2025-12-04T13:35:25.9102361Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int16 PASSED [0.0436s] [ 40%] 2025-12-04T13:35:25.9102502Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex32 PASSED [1.8014s] [ 40%] 2025-12-04T13:35:25.9102637Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int32 PASSED [0.0281s] [ 40%] 2025-12-04T13:35:25.9102772Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float32 PASSED [0.0260s] [ 40%] 2025-12-04T13:35:25.9102907Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int16 PASSED [0.0254s] [ 40%] 2025-12-04T13:35:25.9103040Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_int64 PASSED [0.0254s] [ 40%] 2025-12-04T13:35:25.9103175Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_bool PASSED [1.4223s] [ 40%] 2025-12-04T13:35:25.9103312Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float64 PASSED [0.3541s] [ 40%] 2025-12-04T13:35:25.9103445Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int64 PASSED [0.0295s] [ 40%] 2025-12-04T13:35:25.9103580Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float64 PASSED [0.0232s] [ 40%] 2025-12-04T13:35:25.9103712Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_uint8 PASSED [0.0266s] [ 40%] 2025-12-04T13:35:25.9103867Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex64 PASSED [0.0906s] [ 40%] 2025-12-04T13:35:25.9104013Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float32 PASSED [0.1039s] [ 40%] 2025-12-04T13:35:25.9104147Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int32 PASSED [0.0936s] [ 40%] 2025-12-04T13:35:25.9104280Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int8 PASSED [0.0916s] [ 40%] 2025-12-04T13:35:25.9104407Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bool PASSED [0.0203s] [ 40%] 2025-12-04T13:35:25.9104536Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int16 PASSED [0.0204s] [ 40%] 2025-12-04T13:35:25.9104666Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int32 PASSED [0.0196s] [ 40%] 2025-12-04T13:35:25.9104794Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int64 PASSED [0.0203s] [ 40%] 2025-12-04T13:35:25.9104930Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bfloat16 PASSED [0.0061s] [ 40%] 2025-12-04T13:35:25.9105077Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_complex64 PASSED [0.0062s] [ 40%] 2025-12-04T13:35:25.9105210Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16 PASSED [0.0055s] [ 40%] 2025-12-04T13:35:25.9105356Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex128 PASSED [0.3369s] [ 40%] 2025-12-04T13:35:25.9105499Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex64 PASSED [0.4449s] [ 40%] 2025-12-04T13:35:25.9105641Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float16 PASSED [0.4263s] [ 40%] 2025-12-04T13:35:25.9105778Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int32 PASSED [0.4105s] [ 40%] 2025-12-04T13:35:25.9105923Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float64 PASSED [1.7785s] [ 40%] 2025-12-04T13:35:25.9106061Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int32 PASSED [0.7398s] [ 40%] 2025-12-04T13:35:25.9106194Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float16 PASSED [0.4386s] [ 41%] 2025-12-04T13:35:25.9106323Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int16 PASSED [0.2589s] [ 41%] 2025-12-04T13:35:25.9106453Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int16 PASSED [0.2586s] [ 41%] 2025-12-04T13:35:25.9106582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float32 PASSED [0.3039s] [ 41%] 2025-12-04T13:35:25.9106712Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8 PASSED [0.2797s] [ 41%] 2025-12-04T13:35:25.9106843Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float32 PASSED [0.1439s] [ 41%] 2025-12-04T13:35:25.9106978Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_bfloat16 PASSED [0.1066s] [ 41%] 2025-12-04T13:35:25.9107110Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float16 PASSED [0.1058s] [ 41%] 2025-12-04T13:35:25.9107252Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bfloat16 PASSED [0.4092s] [ 41%] 2025-12-04T13:35:25.9107380Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float16 PASSED [0.4098s] [ 41%] 2025-12-04T13:35:25.9107505Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int64 PASSED [0.2819s] [ 41%] 2025-12-04T13:35:25.9107653Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_uint8 PASSED [2.1172s] [ 41%] 2025-12-04T13:35:25.9107860Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float32 SKIPPED [0.0002s] (Expected: geometric is not comparable) [ 41%] 2025-12-04T13:35:25.9107989Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bfloat16 PASSED [0.3921s] [ 41%] 2025-12-04T13:35:25.9108149Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int8 PASSED [0.2677s] [ 41%] 2025-12-04T13:35:25.9108290Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bfloat16 PASSED [0.7337s] [ 41%] 2025-12-04T13:35:25.9108428Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float32 PASSED [0.5899s] [ 41%] 2025-12-04T13:35:25.9108565Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_uint8 PASSED [0.6221s] [ 41%] 2025-12-04T13:35:25.9108697Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_bool PASSED [0.0095s] [ 41%] 2025-12-04T13:35:25.9108851Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int32 PASSED [0.0094s] [ 41%] 2025-12-04T13:35:25.9108982Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int64 PASSED [0.0084s] [ 41%] 2025-12-04T13:35:25.9109120Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex128 PASSED [0.0090s] [ 41%] 2025-12-04T13:35:25.9109253Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float32 PASSED [0.0084s] [ 41%] 2025-12-04T13:35:25.9109379Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int64 PASSED [0.0837s] [ 41%] 2025-12-04T13:35:25.9109514Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igammac_executor_aten_cuda_float64 PASSED [0.2974s] [ 41%] 2025-12-04T13:35:25.9109652Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float16 PASSED [0.0282s] [ 41%] 2025-12-04T13:35:25.9109789Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float64 PASSED [0.0278s] [ 41%] 2025-12-04T13:35:25.9109925Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int32 PASSED [0.0270s] [ 41%] 2025-12-04T13:35:25.9110064Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float32 PASSED [0.0125s] [ 41%] 2025-12-04T13:35:25.9110201Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_uint8 PASSED [0.0109s] [ 41%] 2025-12-04T13:35:25.9110343Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex32 PASSED [0.0275s] [ 41%] 2025-12-04T13:35:25.9110483Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_complex64 PASSED [0.0270s] [ 41%] 2025-12-04T13:35:25.9110624Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float64 PASSED [0.0269s] [ 41%] 2025-12-04T13:35:25.9110762Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64 PASSED [0.0265s] [ 41%] 2025-12-04T13:35:25.9110900Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bool PASSED [1.5660s] [ 41%] 2025-12-04T13:35:25.9111046Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex128 PASSED [0.0144s] [ 41%] 2025-12-04T13:35:25.9111179Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bool PASSED [0.9330s] [ 41%] 2025-12-04T13:35:25.9111317Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_complex32 PASSED [0.1139s] [ 41%] 2025-12-04T13:35:25.9111488Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float16 PASSED [0.1091s] [ 41%] 2025-12-04T13:35:25.9111639Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float64 PASSED [0.0756s] [ 41%] 2025-12-04T13:35:25.9111775Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int32 PASSED [0.0847s] [ 41%] 2025-12-04T13:35:25.9111907Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int8 PASSED [0.0794s] [ 41%] 2025-12-04T13:35:25.9112044Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex128 PASSED [0.1909s] [ 41%] 2025-12-04T13:35:25.9112179Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32 PASSED [0.2078s] [ 41%] 2025-12-04T13:35:25.9112312Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex64 PASSED [0.1886s] [ 41%] 2025-12-04T13:35:25.9112445Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_float16 PASSED [0.1126s] [ 41%] 2025-12-04T13:35:25.9112582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex128 PASSED [0.0802s] [ 41%] 2025-12-04T13:35:25.9112734Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float32 PASSED [0.0665s] [ 41%] 2025-12-04T13:35:25.9112871Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_float16 PASSED [0.0901s] [ 41%] 2025-12-04T13:35:25.9113111Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex64 SKIPPED [0.0002s] (Expected: unfold_backward() got an unexpected keyword argument 'input_sizes') [ 41%] 2025-12-04T13:35:25.9113246Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex128 PASSED [0.0088s] [ 41%] 2025-12-04T13:35:25.9113377Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float64 PASSED [0.0085s] [ 41%] 2025-12-04T13:35:25.9113505Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int64 PASSED [0.7810s] [ 41%] 2025-12-04T13:35:25.9113633Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32 PASSED [0.2999s] [ 41%] 2025-12-04T13:35:25.9113758Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int32 PASSED [0.2805s] [ 41%] 2025-12-04T13:35:25.9113882Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int8 PASSED [0.2754s] [ 41%] 2025-12-04T13:35:25.9114014Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float16 PASSED [0.1618s] [ 41%] 2025-12-04T13:35:25.9114148Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bfloat16 PASSED [0.5535s] [ 41%] 2025-12-04T13:35:25.9114278Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bool PASSED [0.1073s] [ 41%] 2025-12-04T13:35:25.9114411Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int32 PASSED [0.0889s] [ 41%] 2025-12-04T13:35:25.9114557Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_bfloat16 PASSED [1.5671s] [ 41%] 2025-12-04T13:35:25.9114703Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex128 PASSED [0.0344s] [ 41%] 2025-12-04T13:35:25.9114847Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float64 PASSED [0.0306s] [ 41%] 2025-12-04T13:35:25.9114997Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bfloat16 PASSED [0.0334s] [ 41%] 2025-12-04T13:35:25.9115141Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int64 PASSED [0.0314s] [ 41%] 2025-12-04T13:35:25.9115285Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_int8 PASSED [0.0311s] [ 41%] 2025-12-04T13:35:25.9115458Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float32 PASSED [0.3204s] [ 41%] 2025-12-04T13:35:25.9115613Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float64 PASSED [0.1296s] [ 41%] 2025-12-04T13:35:25.9115766Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex128 PASSED [0.6012s] [ 41%] 2025-12-04T13:35:25.9115905Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float64 PASSED [0.2195s] [ 42%] 2025-12-04T13:35:25.9116038Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int8 XFAIL [0.0057s] [ 42%] 2025-12-04T13:35:25.9116200Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex64 PASSED [2.4063s] [ 42%] 2025-12-04T13:35:25.9116361Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float16 PASSED [1.0824s] [ 42%] 2025-12-04T13:35:25.9116530Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float64 PASSED [0.9992s] [ 42%] 2025-12-04T13:35:25.9116660Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_bool PASSED [0.1078s] [ 42%] 2025-12-04T13:35:25.9116798Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex128 PASSED [0.0842s] [ 42%] 2025-12-04T13:35:25.9116926Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int8 PASSED [0.0742s] [ 42%] 2025-12-04T13:35:25.9117059Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bfloat16 PASSED [0.1177s] [ 42%] 2025-12-04T13:35:25.9117188Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int16 PASSED [0.0888s] [ 42%] 2025-12-04T13:35:25.9117320Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int64 PASSED [0.0889s] [ 42%] 2025-12-04T13:35:25.9117449Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int8 PASSED [0.0842s] [ 42%] 2025-12-04T13:35:25.9117576Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_uint8 PASSED [0.0841s] [ 42%] 2025-12-04T13:35:25.9117771Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_bfloat16 SKIPPED [0.0001s] (Expected: log_normal is not comparable) [ 42%] 2025-12-04T13:35:25.9117964Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float64 SKIPPED [0.0001s] (Expected: log_normal is not comparable) [ 42%] 2025-12-04T13:35:25.9118161Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex32 PASSED [0.0616s] [ 42%] 2025-12-04T13:35:25.9118318Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex64 PASSED [0.0597s] [ 42%] 2025-12-04T13:35:25.9118474Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_float16 PASSED [0.0594s] [ 42%] 2025-12-04T13:35:25.9118618Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bfloat16 PASSED [0.4837s] [ 42%] 2025-12-04T13:35:25.9118769Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex128 PASSED [0.4011s] [ 42%] 2025-12-04T13:35:25.9118907Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int8 PASSED [0.0674s] [ 42%] 2025-12-04T13:35:25.9119045Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_uint8 PASSED [0.0663s] [ 42%] 2025-12-04T13:35:25.9119184Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float16 PASSED [0.4926s] [ 42%] 2025-12-04T13:35:25.9119349Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32 PASSED [0.3648s] [ 42%] 2025-12-04T13:35:25.9119502Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float32 PASSED [0.3782s] [ 42%] 2025-12-04T13:35:25.9119641Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float16 PASSED [2.0702s] [ 42%] 2025-12-04T13:35:25.9119777Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_float32 PASSED [3.8397s] [ 42%] 2025-12-04T13:35:25.9119911Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int16 XFAIL [0.2609s] [ 42%] 2025-12-04T13:35:25.9120043Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int8 PASSED [2.2363s] [ 42%] 2025-12-04T13:35:25.9120205Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float32 PASSED [10.7655s] [ 42%] 2025-12-04T13:35:25.9120342Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int64 PASSED [0.0608s] [ 42%] 2025-12-04T13:35:25.9120487Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bfloat16 PASSED [0.4094s] [ 42%] 2025-12-04T13:35:25.9120615Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_float16 PASSED [0.4021s] [ 42%] 2025-12-04T13:35:25.9120739Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int16 PASSED [0.2799s] [ 42%] 2025-12-04T13:35:25.9120873Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int16 PASSED [0.2608s] [ 42%] 2025-12-04T13:35:25.9121005Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int32 PASSED [0.2624s] [ 42%] 2025-12-04T13:35:25.9121137Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_int64 PASSED [0.2633s] [ 42%] 2025-12-04T13:35:25.9121270Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_uint8 PASSED [0.2597s] [ 42%] 2025-12-04T13:35:25.9121406Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex64 PASSED [0.0578s] [ 42%] 2025-12-04T13:35:25.9121563Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bfloat16 PASSED [0.0462s] [ 42%] 2025-12-04T13:35:25.9121721Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool PASSED [0.0395s] [ 42%] 2025-12-04T13:35:25.9121884Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_complex128 PASSED [0.0412s] [ 42%] 2025-12-04T13:35:25.9122043Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float16 PASSED [0.0399s] [ 42%] 2025-12-04T13:35:25.9122203Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float32 PASSED [0.0398s] [ 42%] 2025-12-04T13:35:25.9122359Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int64 PASSED [0.0384s] [ 42%] 2025-12-04T13:35:25.9122492Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int32 PASSED [0.2619s] [ 42%] 2025-12-04T13:35:25.9122630Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex128 PASSED [0.0257s] [ 42%] 2025-12-04T13:35:25.9122763Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int32 PASSED [0.0245s] [ 42%] 2025-12-04T13:35:25.9122893Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex64 PASSED [0.3280s] [ 42%] 2025-12-04T13:35:25.9123040Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex32 PASSED [0.0642s] [ 42%] 2025-12-04T13:35:25.9123198Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int16 PASSED [0.0672s] [ 42%] 2025-12-04T13:35:25.9123345Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex64 PASSED [0.1170s] [ 42%] 2025-12-04T13:35:25.9123477Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int64 PASSED [0.1120s] [ 42%] 2025-12-04T13:35:25.9123609Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_uint8 PASSED [0.1114s] [ 42%] 2025-12-04T13:35:25.9123740Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex128 PASSED [2.1902s] [ 42%] 2025-12-04T13:35:25.9123869Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float16 PASSED [0.4078s] [ 42%] 2025-12-04T13:35:25.9123997Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float32 PASSED [0.2857s] [ 42%] 2025-12-04T13:35:25.9124129Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_bfloat16 PASSED [0.1063s] [ 42%] 2025-12-04T13:35:25.9124280Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float64 PASSED [0.0690s] [ 42%] 2025-12-04T13:35:25.9124406Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int16 PASSED [0.0587s] [ 42%] 2025-12-04T13:35:25.9124531Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int8 PASSED [0.0571s] [ 42%] 2025-12-04T13:35:25.9124715Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int64 SKIPPED [0.0002s] (Can't check result for new_empty) [ 42%] 2025-12-04T13:35:25.9124896Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_uint8 SKIPPED [0.0001s] (Can't check result for new_empty) [ 42%] 2025-12-04T13:35:25.9125103Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_bfloat16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9125314Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex128 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9125520Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex32 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9125723Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float32 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9125925Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_float64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9126128Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 42%] 2025-12-04T13:35:25.9126269Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_complex64 PASSED [0.0244s] [ 43%] 2025-12-04T13:35:25.9126406Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32 PASSED [0.0228s] [ 43%] 2025-12-04T13:35:25.9126540Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int64 PASSED [0.0212s] [ 43%] 2025-12-04T13:35:25.9126672Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bool PASSED [0.0218s] [ 43%] 2025-12-04T13:35:25.9126808Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float16 PASSED [0.0229s] [ 43%] 2025-12-04T13:35:25.9126942Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float32 PASSED [0.0210s] [ 43%] 2025-12-04T13:35:25.9127093Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_uint8 PASSED [0.0216s] [ 43%] 2025-12-04T13:35:25.9127241Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float64 PASSED [0.0250s] [ 43%] 2025-12-04T13:35:25.9127375Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int16 PASSED [0.0226s] [ 43%] 2025-12-04T13:35:25.9127512Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float16 PASSED [0.2826s] [ 43%] 2025-12-04T13:35:25.9127725Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_bfloat16 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 43%] 2025-12-04T13:35:25.9127932Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float16 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 43%] 2025-12-04T13:35:25.9128186Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float64 SKIPPED [0.0001s] (Expected: dropout is not comparable) [ 43%] 2025-12-04T13:35:25.9128354Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float32 PASSED [0.1521s] [ 43%] 2025-12-04T13:35:25.9128504Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float64 PASSED [0.1510s] [ 43%] 2025-12-04T13:35:25.9128671Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_bfloat16 PASSED [0.0134s] [ 43%] 2025-12-04T13:35:25.9128839Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_complex128 PASSED [0.0131s] [ 43%] 2025-12-04T13:35:25.9129000Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int8 PASSED [0.0134s] [ 43%] 2025-12-04T13:35:25.9129156Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_bfloat16 XFAIL [0.0279s] [ 43%] 2025-12-04T13:35:25.9129306Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float16 PASSED [1.6350s] [ 43%] 2025-12-04T13:35:25.9129452Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float32 PASSED [0.1727s] [ 43%] 2025-12-04T13:35:25.9129601Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float32 PASSED [0.0655s] [ 43%] 2025-12-04T13:35:25.9129758Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_bfloat16 PASSED [2.0776s] [ 43%] 2025-12-04T13:35:25.9129914Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64 PASSED [0.2409s] [ 43%] 2025-12-04T13:35:25.9130068Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int64 PASSED [0.2188s] [ 43%] 2025-12-04T13:35:25.9130225Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float64 PASSED [0.0642s] [ 43%] 2025-12-04T13:35:25.9130383Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float64 PASSED [0.0551s] [ 43%] 2025-12-04T13:35:25.9130558Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bfloat16 PASSED [0.0608s] [ 43%] 2025-12-04T13:35:25.9130733Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex128 PASSED [0.0599s] [ 43%] 2025-12-04T13:35:25.9130906Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex32 PASSED [0.0604s] [ 43%] 2025-12-04T13:35:25.9131101Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int64 PASSED [0.0616s] [ 43%] 2025-12-04T13:35:25.9131266Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_nll_loss_executor_aten_cuda_float32 XFAIL [0.0557s] [ 43%] 2025-12-04T13:35:25.9131438Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex128 PASSED [0.0372s] [ 43%] 2025-12-04T13:35:25.9131603Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int64 PASSED [0.0375s] [ 43%] 2025-12-04T13:35:25.9131765Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bfloat16 PASSED [0.0244s] [ 43%] 2025-12-04T13:35:25.9131921Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_bool PASSED [0.0235s] [ 43%] 2025-12-04T13:35:25.9132088Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex128 PASSED [0.0234s] [ 43%] 2025-12-04T13:35:25.9132246Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int64 PASSED [0.0233s] [ 43%] 2025-12-04T13:35:25.9132418Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bool PASSED [0.0212s] [ 43%] 2025-12-04T13:35:25.9132581Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float16 PASSED [0.0328s] [ 43%] 2025-12-04T13:35:25.9132745Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32 PASSED [0.0248s] [ 43%] 2025-12-04T13:35:25.9132906Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int16 PASSED [0.0234s] [ 43%] 2025-12-04T13:35:25.9133071Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_bfloat16 PASSED [0.6035s] [ 43%] 2025-12-04T13:35:25.9133233Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int16 PASSED [0.5515s] [ 43%] 2025-12-04T13:35:25.9133385Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_prelu_executor_aten_cuda_float64 PASSED [0.4369s] [ 43%] 2025-12-04T13:35:25.9133537Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_bfloat16 PASSED [0.2599s] [ 43%] 2025-12-04T13:35:25.9133686Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16 PASSED [0.2588s] [ 43%] 2025-12-04T13:35:25.9133836Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float64 PASSED [0.2196s] [ 43%] 2025-12-04T13:35:25.9133983Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_uint8 PASSED [0.1901s] [ 43%] 2025-12-04T13:35:25.9134133Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_bfloat16 PASSED [0.1570s] [ 43%] 2025-12-04T13:35:25.9134284Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float16 PASSED [0.1532s] [ 43%] 2025-12-04T13:35:25.9134431Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int64 PASSED [0.0955s] [ 43%] 2025-12-04T13:35:25.9134580Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_uint8 PASSED [0.0907s] [ 43%] 2025-12-04T13:35:25.9134727Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float32 PASSED [0.1566s] [ 43%] 2025-12-04T13:35:25.9134897Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float16 PASSED [0.0445s] [ 43%] 2025-12-04T13:35:25.9135080Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int8 PASSED [0.0439s] [ 43%] 2025-12-04T13:35:25.9135263Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex128 PASSED [0.0472s] [ 43%] 2025-12-04T13:35:25.9135420Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_bfloat16 PASSED [0.4599s] [ 43%] 2025-12-04T13:35:25.9135577Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_bfloat16 PASSED [0.1615s] [ 43%] 2025-12-04T13:35:25.9135730Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int8 PASSED [0.1103s] [ 43%] 2025-12-04T13:35:25.9135902Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_complex64 PASSED [0.0951s] [ 43%] 2025-12-04T13:35:25.9136071Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float32 PASSED [0.0865s] [ 43%] 2025-12-04T13:35:25.9136251Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64 PASSED [0.0915s] [ 43%] 2025-12-04T13:35:25.9136417Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_uint8 PASSED [0.0900s] [ 43%] 2025-12-04T13:35:25.9136551Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_bfloat16 PASSED [0.1985s] [ 43%] 2025-12-04T13:35:25.9136684Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float64 PASSED [0.1497s] [ 43%] 2025-12-04T13:35:25.9136819Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128 PASSED [0.0066s] [ 43%] 2025-12-04T13:35:25.9136954Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex32 PASSED [0.0065s] [ 43%] 2025-12-04T13:35:25.9137090Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex64 PASSED [0.0072s] [ 43%] 2025-12-04T13:35:25.9137225Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float16 PASSED [0.0063s] [ 44%] 2025-12-04T13:35:25.9137356Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_float32 PASSED [0.0063s] [ 44%] 2025-12-04T13:35:25.9137486Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_uint8 PASSED [0.0062s] [ 44%] 2025-12-04T13:35:25.9137633Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex128 PASSED [0.1744s] [ 44%] 2025-12-04T13:35:25.9137771Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex64 PASSED [0.1388s] [ 44%] 2025-12-04T13:35:25.9137906Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float16 PASSED [0.1372s] [ 44%] 2025-12-04T13:35:25.9138043Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float64 PASSED [0.1369s] [ 44%] 2025-12-04T13:35:25.9138219Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int16 PASSED [0.1316s] [ 44%] 2025-12-04T13:35:25.9138352Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int8 PASSED [0.1305s] [ 44%] 2025-12-04T13:35:25.9138490Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_bfloat16 PASSED [0.0524s] [ 44%] 2025-12-04T13:35:25.9138629Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex32 PASSED [0.0650s] [ 44%] 2025-12-04T13:35:25.9138769Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_complex64 PASSED [0.0652s] [ 44%] 2025-12-04T13:35:25.9138930Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int32 PASSED [1.8447s] [ 44%] 2025-12-04T13:35:25.9139084Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int64 PASSED [0.0457s] [ 44%] 2025-12-04T13:35:25.9139215Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float16 PASSED [0.4808s] [ 44%] 2025-12-04T13:35:25.9139348Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bfloat16 PASSED [0.1284s] [ 44%] 2025-12-04T13:35:25.9139481Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_float64 PASSED [0.0845s] [ 44%] 2025-12-04T13:35:25.9139611Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int16 PASSED [0.1039s] [ 44%] 2025-12-04T13:35:25.9139739Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_uint8 PASSED [0.1024s] [ 44%] 2025-12-04T13:35:25.9139873Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int8 PASSED [0.0767s] [ 44%] 2025-12-04T13:35:25.9140007Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_uint8 PASSED [0.0769s] [ 44%] 2025-12-04T13:35:25.9140212Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_bfloat16 SKIPPED [0.0001s] (make_traced() doesn't set seed properly!) [ 44%] 2025-12-04T13:35:25.9140345Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bfloat16 PASSED [0.0086s] [ 44%] 2025-12-04T13:35:25.9140473Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bool PASSED [0.0081s] [ 44%] 2025-12-04T13:35:25.9140608Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex32 PASSED [1.3851s] [ 44%] 2025-12-04T13:35:25.9140746Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex64 PASSED [0.0129s] [ 44%] 2025-12-04T13:35:25.9140880Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_float16 PASSED [0.0088s] [ 44%] 2025-12-04T13:35:25.9141016Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex128 PASSED [0.0894s] [ 44%] 2025-12-04T13:35:25.9141146Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int16 PASSED [0.0457s] [ 44%] 2025-12-04T13:35:25.9141288Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16 PASSED [0.1153s] [ 44%] 2025-12-04T13:35:25.9141424Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bool PASSED [0.1037s] [ 44%] 2025-12-04T13:35:25.9141564Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float16 PASSED [0.1150s] [ 44%] 2025-12-04T13:35:25.9141703Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int64 PASSED [0.0863s] [ 44%] 2025-12-04T13:35:25.9141841Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float32 PASSED [0.3056s] [ 44%] 2025-12-04T13:35:25.9141975Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_bool PASSED [0.1307s] [ 44%] 2025-12-04T13:35:25.9142110Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int16 PASSED [0.0897s] [ 44%] 2025-12-04T13:35:25.9142246Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bfloat16 PASSED [0.1137s] [ 44%] 2025-12-04T13:35:25.9142385Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex128 PASSED [0.1144s] [ 44%] 2025-12-04T13:35:25.9142522Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex32 PASSED [0.1152s] [ 44%] 2025-12-04T13:35:25.9142650Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bool PASSED [0.0523s] [ 44%] 2025-12-04T13:35:25.9142798Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int16 PASSED [0.0512s] [ 44%] 2025-12-04T13:35:25.9142947Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_complex128 PASSED [0.0797s] [ 44%] 2025-12-04T13:35:25.9143079Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float32 PASSED [0.0778s] [ 44%] 2025-12-04T13:35:25.9143208Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int8 PASSED [0.0749s] [ 44%] 2025-12-04T13:35:25.9143338Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64 PASSED [0.0583s] [ 44%] 2025-12-04T13:35:25.9143467Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int8 PASSED [0.0541s] [ 44%] 2025-12-04T13:35:25.9143598Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bool PASSED [0.1052s] [ 44%] 2025-12-04T13:35:25.9143732Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int32 PASSED [0.0868s] [ 44%] 2025-12-04T13:35:25.9143864Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int8 PASSED [0.0814s] [ 44%] 2025-12-04T13:35:25.9144010Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_bfloat16 PASSED [0.4853s] [ 44%] 2025-12-04T13:35:25.9144155Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_int64 PASSED [0.0308s] [ 44%] 2025-12-04T13:35:25.9144295Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bfloat16 PASSED [0.1103s] [ 44%] 2025-12-04T13:35:25.9144427Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex32 PASSED [0.3282s] [ 44%] 2025-12-04T13:35:25.9144555Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float64 PASSED [0.0676s] [ 44%] 2025-12-04T13:35:25.9144694Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bfloat16 PASSED [0.1656s] [ 44%] 2025-12-04T13:35:25.9144827Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_bool PASSED [0.1676s] [ 44%] 2025-12-04T13:35:25.9144967Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32 PASSED [0.9101s] [ 44%] 2025-12-04T13:35:25.9145104Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float32 PASSED [0.1286s] [ 44%] 2025-12-04T13:35:25.9145240Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float64 PASSED [0.1275s] [ 44%] 2025-12-04T13:35:25.9145372Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int64 PASSED [0.1496s] [ 44%] 2025-12-04T13:35:25.9145499Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_bool PASSED [0.0757s] [ 44%] 2025-12-04T13:35:25.9145631Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float16 PASSED [0.1078s] [ 44%] 2025-12-04T13:35:25.9145762Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int16 PASSED [0.0590s] [ 44%] 2025-12-04T13:35:25.9145893Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bfloat16 PASSED [0.1041s] [ 44%] 2025-12-04T13:35:25.9146025Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float16 PASSED [0.5727s] [ 44%] 2025-12-04T13:35:25.9146153Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16 PASSED [0.0801s] [ 44%] 2025-12-04T13:35:25.9146309Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex64 PASSED [0.0444s] [ 44%] 2025-12-04T13:35:25.9146457Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float64 PASSED [0.0409s] [ 44%] 2025-12-04T13:35:25.9146624Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int64 PASSED [0.0883s] [ 44%] 2025-12-04T13:35:25.9146781Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8 PASSED [0.0840s] [ 45%] 2025-12-04T13:35:25.9146926Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int8 PASSED [0.0840s] [ 45%] 2025-12-04T13:35:25.9147073Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_uint8 PASSED [0.0836s] [ 45%] 2025-12-04T13:35:25.9147216Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float64 PASSED [0.0771s] [ 45%] 2025-12-04T13:35:25.9147359Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bfloat16 PASSED [0.4645s] [ 45%] 2025-12-04T13:35:25.9147497Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int64 PASSED [0.2887s] [ 45%] 2025-12-04T13:35:25.9147636Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_uint8 PASSED [0.0798s] [ 45%] 2025-12-04T13:35:25.9147785Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bool PASSED [0.1081s] [ 45%] 2025-12-04T13:35:25.9147926Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int64 PASSED [0.0850s] [ 45%] 2025-12-04T13:35:25.9148109Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_bool PASSED [0.3457s] [ 45%] 2025-12-04T13:35:25.9148257Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int64 PASSED [0.2913s] [ 45%] 2025-12-04T13:35:25.9148399Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int8 PASSED [0.2719s] [ 45%] 2025-12-04T13:35:25.9148562Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bool PASSED [0.0714s] [ 45%] 2025-12-04T13:35:25.9148724Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int64 PASSED [0.0606s] [ 45%] 2025-12-04T13:35:25.9148888Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_uint8 PASSED [0.0664s] [ 45%] 2025-12-04T13:35:25.9149035Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float64 PASSED [0.2086s] [ 45%] 2025-12-04T13:35:25.9149176Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int16 PASSED [0.2062s] [ 45%] 2025-12-04T13:35:25.9149318Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int64 PASSED [0.2069s] [ 45%] 2025-12-04T13:35:25.9149456Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int8 PASSED [0.1954s] [ 45%] 2025-12-04T13:35:25.9149628Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16 PASSED [0.4268s] [ 45%] 2025-12-04T13:35:25.9149799Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float16 PASSED [0.4182s] [ 45%] 2025-12-04T13:35:25.9149967Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int16 PASSED [0.3634s] [ 45%] 2025-12-04T13:35:25.9150135Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float16 PASSED [0.4621s] [ 45%] 2025-12-04T13:35:25.9150307Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float32 PASSED [0.4024s] [ 45%] 2025-12-04T13:35:25.9150473Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int16 PASSED [0.3814s] [ 45%] 2025-12-04T13:35:25.9150643Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_bool PASSED [0.1093s] [ 45%] 2025-12-04T13:35:25.9150800Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float32 PASSED [0.2084s] [ 45%] 2025-12-04T13:35:25.9150945Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float64 PASSED [0.3990s] [ 45%] 2025-12-04T13:35:25.9151105Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int64 PASSED [0.0445s] [ 45%] 2025-12-04T13:35:25.9151266Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_float32 PASSED [0.0796s] [ 45%] 2025-12-04T13:35:25.9151425Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int8 PASSED [0.0841s] [ 45%] 2025-12-04T13:35:25.9151584Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_uint8 PASSED [0.0840s] [ 45%] 2025-12-04T13:35:25.9151748Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_bfloat16 PASSED [2.7731s] [ 45%] 2025-12-04T13:35:25.9151885Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_bool PASSED [0.3961s] [ 45%] 2025-12-04T13:35:25.9152029Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float32 PASSED [1.0787s] [ 45%] 2025-12-04T13:35:25.9152172Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bool PASSED [0.0153s] [ 45%] 2025-12-04T13:35:25.9152321Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_complex32 PASSED [0.0170s] [ 45%] 2025-12-04T13:35:25.9152467Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float16 PASSED [0.0155s] [ 45%] 2025-12-04T13:35:25.9152616Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float32 PASSED [0.0167s] [ 45%] 2025-12-04T13:35:25.9152760Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_int16 PASSED [0.0146s] [ 45%] 2025-12-04T13:35:25.9152896Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex128 PASSED [0.2929s] [ 45%] 2025-12-04T13:35:25.9153028Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_float64 PASSED [0.0687s] [ 45%] 2025-12-04T13:35:25.9153156Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int16 PASSED [0.0785s] [ 45%] 2025-12-04T13:35:25.9153288Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int64 PASSED [0.0684s] [ 45%] 2025-12-04T13:35:25.9153428Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int64 PASSED [0.0225s] [ 45%] 2025-12-04T13:35:25.9153568Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex32 PASSED [0.0185s] [ 45%] 2025-12-04T13:35:25.9153703Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float64 PASSED [0.0177s] [ 45%] 2025-12-04T13:35:25.9153837Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_uint8 PASSED [0.0244s] [ 45%] 2025-12-04T13:35:25.9153987Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bfloat16 PASSED [0.0148s] [ 45%] 2025-12-04T13:35:25.9154133Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int16 PASSED [0.0146s] [ 45%] 2025-12-04T13:35:25.9154277Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int8 PASSED [0.0140s] [ 45%] 2025-12-04T13:35:25.9154437Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bfloat16 PASSED [0.0371s] [ 45%] 2025-12-04T13:35:25.9154585Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex128 PASSED [0.0367s] [ 45%] 2025-12-04T13:35:25.9154717Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int8 PASSED [0.0360s] [ 45%] 2025-12-04T13:35:25.9154847Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_uint8 PASSED [0.0354s] [ 45%] 2025-12-04T13:35:25.9154978Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_bfloat16 PASSED [0.0553s] [ 45%] 2025-12-04T13:35:25.9155118Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex128 PASSED [0.0583s] [ 45%] 2025-12-04T13:35:25.9155250Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex32 PASSED [0.5333s] [ 45%] 2025-12-04T13:35:25.9155383Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex64 PASSED [0.3314s] [ 45%] 2025-12-04T13:35:25.9155513Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float32 PASSED [0.3246s] [ 45%] 2025-12-04T13:35:25.9155652Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int16 PASSED [0.2970s] [ 45%] 2025-12-04T13:35:25.9155778Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int64 PASSED [0.3042s] [ 45%] 2025-12-04T13:35:25.9155908Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bfloat16 PASSED [0.0669s] [ 45%] 2025-12-04T13:35:25.9156039Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex128 PASSED [0.0466s] [ 45%] 2025-12-04T13:35:25.9156170Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex32 PASSED [0.0685s] [ 45%] 2025-12-04T13:35:25.9156301Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_complex64 PASSED [0.0451s] [ 45%] 2025-12-04T13:35:25.9156432Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float16 PASSED [0.0666s] [ 45%] 2025-12-04T13:35:25.9156562Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float64 PASSED [0.0455s] [ 45%] 2025-12-04T13:35:25.9156702Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bfloat16 PASSED [0.0546s] [ 45%] 2025-12-04T13:35:25.9156845Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex128 PASSED [0.0418s] [ 45%] 2025-12-04T13:35:25.9156984Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float32 PASSED [0.0406s] [ 46%] 2025-12-04T13:35:25.9157121Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float64 PASSED [0.0405s] [ 46%] 2025-12-04T13:35:25.9157258Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int32 PASSED [0.0469s] [ 46%] 2025-12-04T13:35:25.9157385Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_bool PASSED [0.0080s] [ 46%] 2025-12-04T13:35:25.9157513Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64 PASSED [0.0080s] [ 46%] 2025-12-04T13:35:25.9157638Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int64 PASSED [1.5408s] [ 46%] 2025-12-04T13:35:25.9157788Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_complex128 PASSED [0.0263s] [ 46%] 2025-12-04T13:35:25.9157934Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float64 PASSED [0.0227s] [ 46%] 2025-12-04T13:35:25.9158128Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int32 PASSED [0.0211s] [ 46%] 2025-12-04T13:35:25.9158294Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int8 PASSED [0.0206s] [ 46%] 2025-12-04T13:35:25.9158435Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int32 PASSED [0.0801s] [ 46%] 2025-12-04T13:35:25.9158571Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_complex128 PASSED [0.0845s] [ 46%] 2025-12-04T13:35:25.9158703Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float64 PASSED [0.0700s] [ 46%] 2025-12-04T13:35:25.9158847Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bfloat16 PASSED [0.0386s] [ 46%] 2025-12-04T13:35:25.9158989Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float32 PASSED [0.0376s] [ 46%] 2025-12-04T13:35:25.9159129Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int64 PASSED [0.0346s] [ 46%] 2025-12-04T13:35:25.9159267Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int8 PASSED [0.0358s] [ 46%] 2025-12-04T13:35:25.9159419Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int16 PASSED [0.0707s] [ 46%] 2025-12-04T13:35:25.9159554Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bfloat16 PASSED [0.0066s] [ 46%] 2025-12-04T13:35:25.9159690Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex128 PASSED [0.0054s] [ 46%] 2025-12-04T13:35:25.9159823Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float64 PASSED [0.0051s] [ 46%] 2025-12-04T13:35:25.9159953Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int64 PASSED [1.4078s] [ 46%] 2025-12-04T13:35:25.9160099Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_float64 PASSED [0.0266s] [ 46%] 2025-12-04T13:35:25.9160229Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int64 PASSED [0.0518s] [ 46%] 2025-12-04T13:35:25.9160369Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int32 PASSED [0.1154s] [ 46%] 2025-12-04T13:35:25.9160495Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_bool PASSED [0.0508s] [ 46%] 2025-12-04T13:35:25.9160626Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float64 PASSED [0.0597s] [ 46%] 2025-12-04T13:35:25.9160753Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int16 PASSED [0.0515s] [ 46%] 2025-12-04T13:35:25.9160890Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bool PASSED [0.3959s] [ 46%] 2025-12-04T13:35:25.9161029Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float16 PASSED [0.5007s] [ 46%] 2025-12-04T13:35:25.9161171Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float32 PASSED [0.3128s] [ 46%] 2025-12-04T13:35:25.9161308Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16 PASSED [0.4248s] [ 46%] 2025-12-04T13:35:25.9161445Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int64 PASSED [0.4137s] [ 46%] 2025-12-04T13:35:25.9161582Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int8 PASSED [0.4062s] [ 46%] 2025-12-04T13:35:25.9161714Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float64 PASSED [0.0702s] [ 46%] 2025-12-04T13:35:25.9161848Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float64 PASSED [0.0414s] [ 46%] 2025-12-04T13:35:25.9161984Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_bool PASSED [0.0245s] [ 46%] 2025-12-04T13:35:25.9162148Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_bfloat16 PASSED [0.0647s] [ 46%] 2025-12-04T13:35:25.9162299Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float64 PASSED [0.0629s] [ 46%] 2025-12-04T13:35:25.9162437Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int32 PASSED [0.0614s] [ 46%] 2025-12-04T13:35:25.9162573Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex128 PASSED [0.0525s] [ 46%] 2025-12-04T13:35:25.9162710Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex32 PASSED [0.0518s] [ 46%] 2025-12-04T13:35:25.9162844Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float16 PASSED [0.0518s] [ 46%] 2025-12-04T13:35:25.9162980Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float64 PASSED [0.0480s] [ 46%] 2025-12-04T13:35:25.9163112Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int16 PASSED [0.0488s] [ 46%] 2025-12-04T13:35:25.9163276Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_complex128 PASSED [0.0256s] [ 46%] 2025-12-04T13:35:25.9163420Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int16 PASSED [0.0247s] [ 46%] 2025-12-04T13:35:25.9163553Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex64 PASSED [0.0345s] [ 46%] 2025-12-04T13:35:25.9163688Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_bfloat16 PASSED [0.0822s] [ 46%] 2025-12-04T13:35:25.9163823Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex64 PASSED [0.0534s] [ 46%] 2025-12-04T13:35:25.9163970Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16 PASSED [1.5170s] [ 46%] 2025-12-04T13:35:25.9164104Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex32 PASSED [0.0876s] [ 46%] 2025-12-04T13:35:25.9164240Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float32 PASSED [0.0913s] [ 46%] 2025-12-04T13:35:25.9164371Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int16 PASSED [0.0802s] [ 46%] 2025-12-04T13:35:25.9164506Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bool PASSED [0.0194s] [ 46%] 2025-12-04T13:35:25.9164640Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int16 PASSED [0.0195s] [ 46%] 2025-12-04T13:35:25.9164774Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_uint8 PASSED [0.0194s] [ 46%] 2025-12-04T13:35:25.9164907Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex64 PASSED [1.5337s] [ 46%] 2025-12-04T13:35:25.9165047Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex128 PASSED [0.0137s] [ 46%] 2025-12-04T13:35:25.9165181Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float64 PASSED [0.0097s] [ 46%] 2025-12-04T13:35:25.9165314Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int32 PASSED [0.0090s] [ 46%] 2025-12-04T13:35:25.9165445Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_uint8 PASSED [0.0092s] [ 46%] 2025-12-04T13:35:25.9165576Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool PASSED [0.0099s] [ 46%] 2025-12-04T13:35:25.9165709Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float64 PASSED [0.0100s] [ 46%] 2025-12-04T13:35:25.9165860Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_bfloat16 PASSED [0.0684s] [ 46%] 2025-12-04T13:35:25.9166002Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float32 PASSED [0.0668s] [ 46%] 2025-12-04T13:35:25.9166132Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_uint8 PASSED [0.0509s] [ 46%] 2025-12-04T13:35:25.9166264Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int16 PASSED [0.7217s] [ 46%] 2025-12-04T13:35:25.9166392Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int8 PASSED [0.7204s] [ 46%] 2025-12-04T13:35:25.9166526Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bfloat16 PASSED [0.0069s] [ 47%] 2025-12-04T13:35:25.9166659Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex64 PASSED [0.0066s] [ 47%] 2025-12-04T13:35:25.9166789Z test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int8 PASSED [0.0062s] [ 47%] 2025-12-04T13:35:25.9166898Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float32 PASSED [1.4951s] [ 47%] 2025-12-04T13:35:25.9167017Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float64 PASSED [1.3945s] [ 47%] 2025-12-04T13:35:25.9167152Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex128 PASSED [1.4559s] [ 47%] 2025-12-04T13:35:25.9167283Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float16 PASSED [1.4374s] [ 47%] 2025-12-04T13:35:25.9167409Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex32 PASSED [1.5395s] [ 47%] 2025-12-04T13:35:25.9167535Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_complex64 PASSED [1.5114s] [ 47%] 2025-12-04T13:35:25.9167658Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_uint8 PASSED [1.4916s] [ 47%] 2025-12-04T13:35:25.9167786Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float16 PASSED [1.5425s] [ 47%] 2025-12-04T13:35:25.9167910Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_bool PASSED [1.5218s] [ 47%] 2025-12-04T13:35:25.9168038Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex32 PASSED [1.5144s] [ 47%] 2025-12-04T13:35:25.9168204Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float16 PASSED [1.5199s] [ 47%] 2025-12-04T13:35:25.9168328Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float32 PASSED [1.5007s] [ 47%] 2025-12-04T13:35:25.9168452Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bfloat16 PASSED [1.5311s] [ 47%] 2025-12-04T13:35:25.9168578Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex128 PASSED [1.5264s] [ 47%] 2025-12-04T13:35:25.9168709Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex64 PASSED [1.5161s] [ 47%] 2025-12-04T13:35:25.9168830Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int32 PASSED [1.5179s] [ 47%] 2025-12-04T13:35:25.9168958Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float16 PASSED [0.1447s] [ 47%] 2025-12-04T13:35:25.9169085Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_bfloat16 PASSED [0.0404s] [ 47%] 2025-12-04T13:35:25.9169209Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int64 PASSED [0.0353s] [ 47%] 2025-12-04T13:35:25.9169334Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float32 PASSED [0.0194s] [ 47%] 2025-12-04T13:35:25.9169459Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bfloat16 PASSED [0.0399s] [ 47%] 2025-12-04T13:35:25.9169618Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float16 PASSED [1.5003s] [ 47%] 2025-12-04T13:35:25.9169752Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int64 PASSED [1.5293s] [ 47%] 2025-12-04T13:35:25.9169877Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_bfloat16 PASSED [1.5383s] [ 47%] 2025-12-04T13:35:25.9169999Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int64 PASSED [1.5217s] [ 47%] 2025-12-04T13:35:25.9170119Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bool PASSED [1.5053s] [ 47%] 2025-12-04T13:35:25.9170242Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float32 PASSED [1.5339s] [ 47%] 2025-12-04T13:35:25.9170353Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bfloat16 PASSED [1.5268s] [ 47%] 2025-12-04T13:35:25.9170466Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex32 PASSED [1.5450s] [ 47%] 2025-12-04T13:35:25.9170573Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int32 PASSED [1.5267s] [ 47%] 2025-12-04T13:35:25.9170693Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_bool PASSED [1.5401s] [ 47%] 2025-12-04T13:35:25.9170805Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex32 PASSED [1.5601s] [ 47%] 2025-12-04T13:35:25.9170915Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bfloat16 PASSED [1.5139s] [ 47%] 2025-12-04T13:35:25.9171024Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bool PASSED [1.5043s] [ 47%] 2025-12-04T13:35:25.9171137Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex128 PASSED [1.5447s] [ 47%] 2025-12-04T13:35:25.9171247Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float16 PASSED [1.5390s] [ 47%] 2025-12-04T13:35:25.9171353Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int8 PASSED [1.5236s] [ 47%] 2025-12-04T13:35:25.9171464Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex128 PASSED [0.1392s] [ 47%] 2025-12-04T13:35:25.9171573Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float64 PASSED [0.1287s] [ 47%] 2025-12-04T13:35:25.9171680Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int8 PASSED [0.1171s] [ 47%] 2025-12-04T13:35:25.9171784Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_uint8 PASSED [0.1166s] [ 47%] 2025-12-04T13:35:25.9171898Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float32 PASSED [0.0619s] [ 47%] 2025-12-04T13:35:25.9172007Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int64 PASSED [0.0545s] [ 47%] 2025-12-04T13:35:25.9172117Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex64 PASSED [0.0078s] [ 47%] 2025-12-04T13:35:25.9172225Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float32 PASSED [0.0100s] [ 47%] 2025-12-04T13:35:25.9172333Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float64 PASSED [1.4775s] [ 47%] 2025-12-04T13:35:25.9172440Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_uint8 PASSED [1.4973s] [ 47%] 2025-12-04T13:35:25.9172561Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_complex32 PASSED [1.4800s] [ 47%] 2025-12-04T13:35:25.9172671Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex64 PASSED [1.5078s] [ 47%] 2025-12-04T13:35:25.9172788Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex128 PASSED [1.5315s] [ 47%] 2025-12-04T13:35:25.9172902Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float64 PASSED [1.5286s] [ 47%] 2025-12-04T13:35:25.9173006Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_bool PASSED [1.5066s] [ 47%] 2025-12-04T13:35:25.9173112Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int64 PASSED [1.4900s] [ 47%] 2025-12-04T13:35:25.9173238Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int8 PASSED [1.5010s] [ 47%] 2025-12-04T13:35:25.9173360Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex128 PASSED [1.5226s] [ 47%] 2025-12-04T13:35:25.9173463Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int32 PASSED [1.5043s] [ 47%] 2025-12-04T13:35:25.9173568Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int64 PASSED [1.4949s] [ 47%] 2025-12-04T13:35:25.9173678Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float32 PASSED [0.0255s] [ 47%] 2025-12-04T13:35:25.9173788Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_float64 PASSED [0.0191s] [ 47%] 2025-12-04T13:35:25.9173896Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int16 PASSED [0.0110s] [ 47%] 2025-12-04T13:35:25.9174005Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int64 PASSED [0.0090s] [ 47%] 2025-12-04T13:35:25.9174125Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int64 PASSED [0.0060s] [ 47%] 2025-12-04T13:35:25.9174257Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_complex128 PASSED [0.0060s] [ 47%] 2025-12-04T13:35:25.9174368Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int64 PASSED [1.5053s] [ 47%] 2025-12-04T13:35:25.9174502Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int32 PASSED [1.5079s] [ 47%] 2025-12-04T13:35:25.9174633Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_uint8 PASSED [1.4817s] [ 47%] 2025-12-04T13:35:25.9174761Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex32 PASSED [1.5060s] [ 48%] 2025-12-04T13:35:25.9174886Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float64 PASSED [1.5107s] [ 48%] 2025-12-04T13:35:25.9175008Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int16 PASSED [1.5141s] [ 48%] 2025-12-04T13:35:25.9175131Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int64 PASSED [1.5045s] [ 48%] 2025-12-04T13:35:25.9175243Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex32 PASSED [1.5599s] [ 48%] 2025-12-04T13:35:25.9175351Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float64 PASSED [1.5240s] [ 48%] 2025-12-04T13:35:25.9175457Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int16 PASSED [1.5326s] [ 48%] 2025-12-04T13:35:25.9175564Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int32 PASSED [1.5243s] [ 48%] 2025-12-04T13:35:25.9175670Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_uint8 PASSED [1.5238s] [ 48%] 2025-12-04T13:35:25.9175776Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_bool PASSED [0.1221s] [ 48%] 2025-12-04T13:35:25.9175887Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float64 PASSED [0.0962s] [ 48%] 2025-12-04T13:35:25.9175996Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_uint8 PASSED [0.1060s] [ 48%] 2025-12-04T13:35:25.9176106Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bfloat16 PASSED [0.0290s] [ 48%] 2025-12-04T13:35:25.9176214Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float32 PASSED [0.0239s] [ 48%] 2025-12-04T13:35:25.9176326Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32 PASSED [0.0422s] [ 48%] 2025-12-04T13:35:25.9176433Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_uint8 PASSED [0.0229s] [ 48%] 2025-12-04T13:35:25.9176547Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int16 PASSED [1.5071s] [ 48%] 2025-12-04T13:35:25.9176659Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int8 PASSED [1.5119s] [ 48%] 2025-12-04T13:35:25.9176798Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_bfloat16 PASSED [1.5184s] [ 48%] 2025-12-04T13:35:25.9176924Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float16 PASSED [1.4975s] [ 48%] 2025-12-04T13:35:25.9177043Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex64 PASSED [1.5211s] [ 48%] 2025-12-04T13:35:25.9177155Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int32 PASSED [1.5107s] [ 48%] 2025-12-04T13:35:25.9177268Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int32 PASSED [0.1182s] [ 48%] 2025-12-04T13:35:25.9177391Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int64 PASSED [0.1148s] [ 48%] 2025-12-04T13:35:25.9177515Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int8 PASSED [0.0978s] [ 48%] 2025-12-04T13:35:25.9177628Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int16 PASSED [0.1138s] [ 48%] 2025-12-04T13:35:25.9177740Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bool PASSED [0.0119s] [ 48%] 2025-12-04T13:35:25.9177870Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex128 PASSED [0.0116s] [ 48%] 2025-12-04T13:35:25.9177986Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float64 PASSED [0.0114s] [ 48%] 2025-12-04T13:35:25.9178132Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int16 PASSED [0.0115s] [ 48%] 2025-12-04T13:35:25.9178256Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_shapes_cuda_float32 PASSED [0.0033s] [ 48%] 2025-12-04T13:35:25.9178384Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128 PASSED [0.0104s] [ 48%] 2025-12-04T13:35:25.9178512Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex64 PASSED [1.5327s] [ 48%] 2025-12-04T13:35:25.9178634Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int64 PASSED [1.5280s] [ 48%] 2025-12-04T13:35:25.9178758Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_uint8 PASSED [1.5264s] [ 48%] 2025-12-04T13:35:25.9178872Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bool PASSED [1.5162s] [ 48%] 2025-12-04T13:35:25.9178991Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float64 PASSED [1.5119s] [ 48%] 2025-12-04T13:35:25.9179106Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int32 PASSED [1.5164s] [ 48%] 2025-12-04T13:35:25.9179217Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int32 PASSED [0.1877s] [ 48%] 2025-12-04T13:35:25.9179323Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float64 PASSED [0.0135s] [ 48%] 2025-12-04T13:35:25.9179433Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_bfloat16 PASSED [0.0095s] [ 48%] 2025-12-04T13:35:25.9179542Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float32 PASSED [0.0261s] [ 48%] 2025-12-04T13:35:25.9179653Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float64 PASSED [0.0259s] [ 48%] 2025-12-04T13:35:25.9179761Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int64 PASSED [0.0240s] [ 48%] 2025-12-04T13:35:25.9179872Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex64 PASSED [0.0182s] [ 48%] 2025-12-04T13:35:25.9179983Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float32 PASSED [0.0182s] [ 48%] 2025-12-04T13:35:25.9180093Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_float64 PASSED [0.0180s] [ 48%] 2025-12-04T13:35:25.9180204Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float64 PASSED [0.0526s] [ 48%] 2025-12-04T13:35:25.9180311Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int64 PASSED [0.0457s] [ 48%] 2025-12-04T13:35:25.9180451Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int8 PASSED [0.0455s] [ 48%] 2025-12-04T13:35:25.9180577Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float64 PASSED [0.0991s] [ 48%] 2025-12-04T13:35:25.9180690Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32 PASSED [1.6088s] [ 48%] 2025-12-04T13:35:25.9180799Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bool PASSED [1.6389s] [ 48%] 2025-12-04T13:35:25.9180914Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float16 PASSED [0.1015s] [ 48%] 2025-12-04T13:35:25.9181023Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float32 PASSED [1.5798s] [ 48%] 2025-12-04T13:35:25.9181131Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_int64 PASSED [1.5819s] [ 48%] 2025-12-04T13:35:25.9181248Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float32 PASSED [1.5367s] [ 48%] 2025-12-04T13:35:25.9181369Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64 PASSED [1.5426s] [ 48%] 2025-12-04T13:35:25.9181484Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int64 PASSED [1.5388s] [ 48%] 2025-12-04T13:35:25.9181613Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bfloat16 PASSED [1.5461s] [ 48%] 2025-12-04T13:35:25.9181719Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int8 PASSED [1.5479s] [ 48%] 2025-12-04T13:35:25.9181836Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int32 PASSED [1.5572s] [ 48%] 2025-12-04T13:35:25.9181959Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bfloat16 PASSED [0.0752s] [ 48%] 2025-12-04T13:35:25.9182079Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int64 PASSED [0.0717s] [ 48%] 2025-12-04T13:35:25.9182201Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex128 PASSED [0.0329s] [ 48%] 2025-12-04T13:35:25.9182319Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float16 PASSED [0.0326s] [ 48%] 2025-12-04T13:35:25.9182437Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int64 PASSED [0.0302s] [ 48%] 2025-12-04T13:35:25.9182546Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8 PASSED [0.2069s] [ 48%] 2025-12-04T13:35:25.9182652Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_bool PASSED [0.0304s] [ 48%] 2025-12-04T13:35:25.9182759Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float32 PASSED [0.0255s] [ 49%] 2025-12-04T13:35:25.9182864Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_uint8 PASSED [1.5489s] [ 49%] 2025-12-04T13:35:25.9182968Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_bool PASSED [1.5467s] [ 49%] 2025-12-04T13:35:25.9183078Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex64 PASSED [1.5654s] [ 49%] 2025-12-04T13:35:25.9183186Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float32 PASSED [1.5560s] [ 49%] 2025-12-04T13:35:25.9183293Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int8 PASSED [1.5427s] [ 49%] 2025-12-04T13:35:25.9183417Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex128 PASSED [1.5339s] [ 49%] 2025-12-04T13:35:25.9183533Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex128 PASSED [1.5406s] [ 49%] 2025-12-04T13:35:25.9183648Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex64 PASSED [1.5296s] [ 49%] 2025-12-04T13:35:25.9183760Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float32 PASSED [1.5370s] [ 49%] 2025-12-04T13:35:25.9183869Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int16 PASSED [1.5525s] [ 49%] 2025-12-04T13:35:25.9183978Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int32 PASSED [1.5402s] [ 49%] 2025-12-04T13:35:25.9184107Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int64 PASSED [1.5451s] [ 49%] 2025-12-04T13:35:25.9184229Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_uint8 PASSED [1.5295s] [ 49%] 2025-12-04T13:35:25.9184342Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex64 PASSED [1.5171s] [ 49%] 2025-12-04T13:35:25.9184455Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float64 PASSED [1.5418s] [ 49%] 2025-12-04T13:35:25.9184563Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int32 PASSED [1.5492s] [ 49%] 2025-12-04T13:35:25.9184672Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bfloat16 PASSED [1.5261s] [ 49%] 2025-12-04T13:35:25.9184780Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float16 PASSED [1.5392s] [ 49%] 2025-12-04T13:35:25.9184899Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex128 PASSED [1.5644s] [ 49%] 2025-12-04T13:35:25.9185016Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float64 PASSED [1.5723s] [ 49%] 2025-12-04T13:35:25.9185139Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int16 PASSED [1.5768s] [ 49%] 2025-12-04T13:35:25.9185252Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int32 PASSED [1.5567s] [ 49%] 2025-12-04T13:35:25.9185375Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex32 PASSED [1.5476s] [ 49%] 2025-12-04T13:35:25.9185494Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float16 PASSED [1.5512s] [ 49%] 2025-12-04T13:35:25.9185612Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float64 PASSED [1.5505s] [ 49%] 2025-12-04T13:35:25.9185730Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int32 PASSED [1.5735s] [ 49%] 2025-12-04T13:35:25.9185849Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_uint8 PASSED [1.5739s] [ 49%] 2025-12-04T13:35:25.9185963Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_bfloat16 PASSED [1.5292s] [ 49%] 2025-12-04T13:35:25.9186079Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex32 PASSED [1.4992s] [ 49%] 2025-12-04T13:35:25.9186196Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex64 PASSED [1.5333s] [ 49%] 2025-12-04T13:35:25.9186307Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float32 PASSED [1.5483s] [ 49%] 2025-12-04T13:35:25.9186419Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_uint8 PASSED [1.5391s] [ 49%] 2025-12-04T13:35:25.9186541Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float16 PASSED [1.5640s] [ 49%] 2025-12-04T13:35:25.9186662Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int32 PASSED [1.5468s] [ 49%] 2025-12-04T13:35:25.9186775Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float16 PASSED [1.5589s] [ 49%] 2025-12-04T13:35:25.9186884Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int8 PASSED [1.5875s] [ 49%] 2025-12-04T13:35:25.9186993Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_uint8 PASSED [1.5472s] [ 49%] 2025-12-04T13:35:25.9187116Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float16 PASSED [0.6899s] [ 49%] 2025-12-04T13:35:25.9187239Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32 PASSED [0.5837s] [ 49%] 2025-12-04T13:35:25.9187366Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_bfloat16 PASSED [0.1442s] [ 49%] 2025-12-04T13:35:25.9187496Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex128 PASSED [0.1260s] [ 49%] 2025-12-04T13:35:25.9187644Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex64 PASSED [0.1379s] [ 49%] 2025-12-04T13:35:25.9187778Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int8 PASSED [0.1051s] [ 49%] 2025-12-04T13:35:25.9187901Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float16 PASSED [0.1459s] [ 49%] 2025-12-04T13:35:25.9188014Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_complex128 PASSED [0.0056s] [ 49%] 2025-12-04T13:35:25.9188165Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float16 PASSED [1.5605s] [ 49%] 2025-12-04T13:35:25.9188272Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float64 PASSED [1.5381s] [ 49%] 2025-12-04T13:35:25.9188380Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int32 PASSED [1.5434s] [ 49%] 2025-12-04T13:35:25.9188487Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int8 PASSED [1.5504s] [ 49%] 2025-12-04T13:35:25.9188599Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_bfloat16 PASSED [1.5444s] [ 49%] 2025-12-04T13:35:25.9188711Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float16 PASSED [1.5495s] [ 49%] 2025-12-04T13:35:25.9188837Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int16 PASSED [1.5569s] [ 49%] 2025-12-04T13:35:25.9188947Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float16 PASSED [0.0062s] [ 49%] 2025-12-04T13:35:25.9189053Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int64 PASSED [0.0046s] [ 49%] 2025-12-04T13:35:25.9189171Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex64 PASSED [0.0198s] [ 49%] 2025-12-04T13:35:25.9189291Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float16 PASSED [1.5483s] [ 49%] 2025-12-04T13:35:25.9189408Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float64 PASSED [1.5325s] [ 49%] 2025-12-04T13:35:25.9189526Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int64 PASSED [1.5215s] [ 49%] 2025-12-04T13:35:25.9189634Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_bfloat16 PASSED [0.1160s] [ 49%] 2025-12-04T13:35:25.9189743Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex32 PASSED [0.1491s] [ 49%] 2025-12-04T13:35:25.9189849Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float32 PASSED [0.0985s] [ 49%] 2025-12-04T13:35:25.9189955Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int64 PASSED [0.1008s] [ 49%] 2025-12-04T13:35:25.9190062Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float16 XFAIL [0.0039s] [ 49%] 2025-12-04T13:35:25.9190170Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float32 XFAIL [1.5476s] [ 49%] 2025-12-04T13:35:25.9190276Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_float64 XFAIL [1.5438s] [ 49%] 2025-12-04T13:35:25.9190384Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int32 XFAIL [1.5263s] [ 49%] 2025-12-04T13:35:25.9190490Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int8 XFAIL [1.5593s] [ 49%] 2025-12-04T13:35:25.9190599Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float16 PASSED [1.5779s] [ 49%] 2025-12-04T13:35:25.9190703Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int32 PASSED [0.0208s] [ 50%] 2025-12-04T13:35:25.9190810Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int8 PASSED [0.0201s] [ 50%] 2025-12-04T13:35:25.9190916Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float32 PASSED [0.3060s] [ 50%] 2025-12-04T13:35:25.9191023Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float64 PASSED [1.8671s] [ 50%] 2025-12-04T13:35:25.9191128Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int64 PASSED [1.5853s] [ 50%] 2025-12-04T13:35:25.9191267Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bfloat16 PASSED [1.5819s] [ 50%] 2025-12-04T13:35:25.9191390Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float32 PASSED [1.5563s] [ 50%] 2025-12-04T13:35:25.9191499Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int64 PASSED [1.5681s] [ 50%] 2025-12-04T13:35:25.9191605Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int8 PASSED [1.5559s] [ 50%] 2025-12-04T13:35:25.9191711Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_uint8 PASSED [1.5639s] [ 50%] 2025-12-04T13:35:25.9191818Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float32 PASSED [1.5887s] [ 50%] 2025-12-04T13:35:25.9191928Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bool PASSED [1.5663s] [ 50%] 2025-12-04T13:35:25.9192043Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float64 PASSED [1.5566s] [ 50%] 2025-12-04T13:35:25.9192154Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int16 PASSED [1.5599s] [ 50%] 2025-12-04T13:35:25.9192266Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_uint8 PASSED [1.5517s] [ 50%] 2025-12-04T13:35:25.9192391Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_uint8 PASSED [1.5724s] [ 50%] 2025-12-04T13:35:25.9192501Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float32 PASSED [1.5634s] [ 50%] 2025-12-04T13:35:25.9192608Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int64 PASSED [1.5740s] [ 50%] 2025-12-04T13:35:25.9192717Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_uint8 PASSED [1.5470s] [ 50%] 2025-12-04T13:35:25.9192827Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bfloat16 PASSED [1.5979s] [ 50%] 2025-12-04T13:35:25.9192934Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int8 PASSED [1.5704s] [ 50%] 2025-12-04T13:35:25.9193041Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_uint8 PASSED [1.5644s] [ 50%] 2025-12-04T13:35:25.9193161Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float64 PASSED [1.5560s] [ 50%] 2025-12-04T13:35:25.9193268Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bfloat16 PASSED [0.0930s] [ 50%] 2025-12-04T13:35:25.9193373Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int16 PASSED [0.0684s] [ 50%] 2025-12-04T13:35:25.9193484Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float16 PASSED [0.0076s] [ 50%] 2025-12-04T13:35:25.9193595Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int64 PASSED [1.5345s] [ 50%] 2025-12-04T13:35:25.9193701Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_bool PASSED [1.5324s] [ 50%] 2025-12-04T13:35:25.9193818Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex128 PASSED [1.5237s] [ 50%] 2025-12-04T13:35:25.9193930Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float16 PASSED [1.4911s] [ 50%] 2025-12-04T13:35:25.9194041Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int8 PASSED [1.5120s] [ 50%] 2025-12-04T13:35:25.9194160Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float16 PASSED [1.4936s] [ 50%] 2025-12-04T13:35:25.9194277Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64 PASSED [1.4959s] [ 50%] 2025-12-04T13:35:25.9194388Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_bool PASSED [1.4700s] [ 50%] 2025-12-04T13:35:25.9194499Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float16 PASSED [1.4805s] [ 50%] 2025-12-04T13:35:25.9194611Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float32 PASSED [1.5026s] [ 50%] 2025-12-04T13:35:25.9194720Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int16 PASSED [1.4901s] [ 50%] 2025-12-04T13:35:25.9194851Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int32 PASSED [1.4987s] [ 50%] 2025-12-04T13:35:25.9194977Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_uint8 PASSED [1.4939s] [ 50%] 2025-12-04T13:35:25.9195085Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_bool PASSED [1.4790s] [ 50%] 2025-12-04T13:35:25.9195202Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex128 PASSED [1.4778s] [ 50%] 2025-12-04T13:35:25.9195315Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float32 PASSED [1.5093s] [ 50%] 2025-12-04T13:35:25.9195424Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int16 PASSED [1.5155s] [ 50%] 2025-12-04T13:35:25.9195534Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int64 PASSED [1.5016s] [ 50%] 2025-12-04T13:35:25.9195643Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_bool PASSED [1.5041s] [ 50%] 2025-12-04T13:35:25.9195759Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex32 PASSED [1.5292s] [ 50%] 2025-12-04T13:35:25.9195884Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_complex64 PASSED [1.4937s] [ 50%] 2025-12-04T13:35:25.9195996Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float16 PASSED [1.4810s] [ 50%] 2025-12-04T13:35:25.9196114Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64 PASSED [1.4760s] [ 50%] 2025-12-04T13:35:25.9196230Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int64 PASSED [1.4857s] [ 50%] 2025-12-04T13:35:25.9196340Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_bool PASSED [1.4717s] [ 50%] 2025-12-04T13:35:25.9196450Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int8 PASSED [1.4792s] [ 50%] 2025-12-04T13:35:25.9196562Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int64 PASSED [1.4768s] [ 50%] 2025-12-04T13:35:25.9196672Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int8 PASSED [1.4800s] [ 50%] 2025-12-04T13:35:25.9196785Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int16 PASSED [1.4807s] [ 50%] 2025-12-04T13:35:25.9196894Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_bool PASSED [1.4693s] [ 50%] 2025-12-04T13:35:25.9197004Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_bool PASSED [1.4878s] [ 50%] 2025-12-04T13:35:25.9197119Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex32 PASSED [1.4772s] [ 50%] 2025-12-04T13:35:25.9197230Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_uint8 PASSED [1.5064s] [ 50%] 2025-12-04T13:35:25.9197339Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_bool PASSED [1.4958s] [ 50%] 2025-12-04T13:35:25.9197459Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex64 PASSED [1.4996s] [ 50%] 2025-12-04T13:35:25.9197572Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float32 PASSED [1.4818s] [ 50%] 2025-12-04T13:35:25.9197684Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64 PASSED [1.5005s] [ 50%] 2025-12-04T13:35:25.9197794Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_bool PASSED [1.4830s] [ 50%] 2025-12-04T13:35:25.9197903Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int64 PASSED [1.4742s] [ 50%] 2025-12-04T13:35:25.9198011Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8 PASSED [1.4659s] [ 50%] 2025-12-04T13:35:25.9198168Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_float64 PASSED [1.4802s] [ 50%] 2025-12-04T13:35:25.9198412Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int8 PASSED [1.4858s] [ 50%] 2025-12-04T13:35:25.9198557Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_bool PASSED [1.4896s] [ 50%] 2025-12-04T13:35:25.9198682Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex128 PASSED [1.5264s] [ 51%] 2025-12-04T13:35:25.9198791Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16 PASSED [1.5196s] [ 51%] 2025-12-04T13:35:25.9198901Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bfloat16 PASSED [0.0103s] [ 51%] 2025-12-04T13:35:25.9199012Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex128 PASSED [0.0079s] [ 51%] 2025-12-04T13:35:25.9199121Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float32 PASSED [0.0075s] [ 51%] 2025-12-04T13:35:25.9199227Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int64 PASSED [0.0074s] [ 51%] 2025-12-04T13:35:25.9199337Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float32 PASSED [0.0032s] [ 51%] 2025-12-04T13:35:25.9199444Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int32 PASSED [0.0035s] [ 51%] 2025-12-04T13:35:25.9199553Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_uint8 PASSED [0.0035s] [ 51%] 2025-12-04T13:35:25.9199676Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float16 PASSED [0.0035s] [ 51%] 2025-12-04T13:35:25.9199795Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bfloat16 PASSED [0.1395s] [ 51%] 2025-12-04T13:35:25.9199912Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float16 PASSED [0.1384s] [ 51%] 2025-12-04T13:35:25.9200022Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float32 PASSED [1.5002s] [ 51%] 2025-12-04T13:35:25.9200138Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float32 PASSED [0.5403s] [ 51%] 2025-12-04T13:35:25.9200247Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_bool PASSED [0.0922s] [ 51%] 2025-12-04T13:35:25.9200355Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int16 PASSED [1.5700s] [ 51%] 2025-12-04T13:35:25.9200463Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float64 PASSED [1.5913s] [ 51%] 2025-12-04T13:35:25.9200570Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float32 PASSED [1.5907s] [ 51%] 2025-12-04T13:35:25.9200678Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int8 PASSED [1.5832s] [ 51%] 2025-12-04T13:35:25.9200784Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float32 PASSED [1.5269s] [ 51%] 2025-12-04T13:35:25.9200896Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_bfloat16 PASSED [1.5272s] [ 51%] 2025-12-04T13:35:25.9201006Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float64 PASSED [1.5206s] [ 51%] 2025-12-04T13:35:25.9201113Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int32 PASSED [1.6852s] [ 51%] 2025-12-04T13:35:25.9201220Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_uint8 PASSED [0.0925s] [ 51%] 2025-12-04T13:35:25.9201326Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32 PASSED [0.0984s] [ 51%] 2025-12-04T13:35:25.9201431Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bool PASSED [0.0938s] [ 51%] 2025-12-04T13:35:25.9201534Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int64 PASSED [0.0978s] [ 51%] 2025-12-04T13:35:25.9201637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_uint8 PASSED [1.5687s] [ 51%] 2025-12-04T13:35:25.9201747Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bool PASSED [0.1890s] [ 51%] 2025-12-04T13:35:25.9201857Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_uint8 PASSED [0.1759s] [ 51%] 2025-12-04T13:35:25.9201965Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int32 PASSED [0.0034s] [ 51%] 2025-12-04T13:35:25.9202093Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int8 PASSED [1.4791s] [ 51%] 2025-12-04T13:35:25.9202210Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int64 PASSED [1.4572s] [ 51%] 2025-12-04T13:35:25.9202325Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_bfloat16 PASSED [0.1113s] [ 51%] 2025-12-04T13:35:25.9202428Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bool PASSED [0.0295s] [ 51%] 2025-12-04T13:35:25.9202539Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igamma_cuda_float64 PASSED [0.0976s] [ 51%] 2025-12-04T13:35:25.9202648Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float32 PASSED [0.0978s] [ 51%] 2025-12-04T13:35:25.9202759Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex128 PASSED [0.0437s] [ 51%] 2025-12-04T13:35:25.9202867Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex32 PASSED [0.0479s] [ 51%] 2025-12-04T13:35:25.9202978Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int64 PASSED [1.4870s] [ 51%] 2025-12-04T13:35:25.9203098Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex128 PASSED [1.4863s] [ 51%] 2025-12-04T13:35:25.9203226Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_float32 PASSED [1.4886s] [ 51%] 2025-12-04T13:35:25.9203338Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int32 PASSED [1.5004s] [ 51%] 2025-12-04T13:35:25.9203456Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex32 PASSED [1.5109s] [ 51%] 2025-12-04T13:35:25.9203571Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float32 PASSED [1.5058s] [ 51%] 2025-12-04T13:35:25.9203682Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int32 PASSED [1.4946s] [ 51%] 2025-12-04T13:35:25.9203790Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int8 PASSED [1.4938s] [ 51%] 2025-12-04T13:35:25.9203903Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_uint8 PASSED [1.5121s] [ 51%] 2025-12-04T13:35:25.9204025Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex128 PASSED [1.4864s] [ 51%] 2025-12-04T13:35:25.9204138Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int8 PASSED [1.4940s] [ 51%] 2025-12-04T13:35:25.9204253Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex128 PASSED [0.3463s] [ 51%] 2025-12-04T13:35:25.9204364Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_bfloat16 PASSED [0.0320s] [ 51%] 2025-12-04T13:35:25.9204481Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex128 PASSED [0.0472s] [ 51%] 2025-12-04T13:35:25.9204595Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex32 PASSED [0.0465s] [ 51%] 2025-12-04T13:35:25.9204705Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int32 PASSED [0.0253s] [ 51%] 2025-12-04T13:35:25.9204816Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int8 PASSED [1.6230s] [ 51%] 2025-12-04T13:35:25.9204927Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64 PASSED [1.5803s] [ 51%] 2025-12-04T13:35:25.9205034Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_uint8 PASSED [1.5785s] [ 51%] 2025-12-04T13:35:25.9205144Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float64 PASSED [1.6074s] [ 51%] 2025-12-04T13:35:25.9205252Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int16 PASSED [1.5924s] [ 51%] 2025-12-04T13:35:25.9205359Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int32 PASSED [1.6297s] [ 51%] 2025-12-04T13:35:25.9205472Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_bfloat16 PASSED [1.5858s] [ 51%] 2025-12-04T13:35:25.9205587Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float16 PASSED [1.5911s] [ 51%] 2025-12-04T13:35:25.9205721Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float32 PASSED [1.5960s] [ 51%] 2025-12-04T13:35:25.9205844Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int16 PASSED [1.5968s] [ 51%] 2025-12-04T13:35:25.9205957Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_bfloat16 PASSED [1.6000s] [ 51%] 2025-12-04T13:35:25.9206069Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float32 PASSED [1.6071s] [ 51%] 2025-12-04T13:35:25.9206178Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_bfloat16 XFAIL [0.0053s] [ 51%] 2025-12-04T13:35:25.9206288Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex128 XFAIL [1.5598s] [ 51%] 2025-12-04T13:35:25.9206392Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int8 XFAIL [1.6000s] [ 52%] 2025-12-04T13:35:25.9206498Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float32 PASSED [1.6740s] [ 52%] 2025-12-04T13:35:25.9206611Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex128 PASSED [0.0378s] [ 52%] 2025-12-04T13:35:25.9206731Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex32 PASSED [0.0428s] [ 52%] 2025-12-04T13:35:25.9206843Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_float16 PASSED [0.0323s] [ 52%] 2025-12-04T13:35:25.9206949Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int32 PASSED [1.6074s] [ 52%] 2025-12-04T13:35:25.9207064Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int32 PASSED [1.5725s] [ 52%] 2025-12-04T13:35:25.9207180Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int64 PASSED [1.5856s] [ 52%] 2025-12-04T13:35:25.9207298Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_uint8 PASSED [1.5838s] [ 52%] 2025-12-04T13:35:25.9207424Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex32 PASSED [1.6049s] [ 52%] 2025-12-04T13:35:25.9207551Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float32 PASSED [1.5758s] [ 52%] 2025-12-04T13:35:25.9207670Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_uint8 PASSED [1.5608s] [ 52%] 2025-12-04T13:35:25.9207790Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex128 PASSED [0.0998s] [ 52%] 2025-12-04T13:35:25.9207905Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float32 PASSED [0.0917s] [ 52%] 2025-12-04T13:35:25.9208022Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex64 PASSED [0.1578s] [ 52%] 2025-12-04T13:35:25.9208179Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_float64 PASSED [0.0289s] [ 52%] 2025-12-04T13:35:25.9208297Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_bfloat16 PASSED [0.0405s] [ 52%] 2025-12-04T13:35:25.9208420Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vecdot_cuda_complex128 PASSED [0.0408s] [ 52%] 2025-12-04T13:35:25.9208554Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_complex128 PASSED [0.1099s] [ 52%] 2025-12-04T13:35:25.9208668Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int16 PASSED [0.0315s] [ 52%] 2025-12-04T13:35:25.9208778Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int8 PASSED [0.0234s] [ 52%] 2025-12-04T13:35:25.9208893Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_uint8 PASSED [0.0154s] [ 52%] 2025-12-04T13:35:25.9209029Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_bfloat16 PASSED [0.1218s] [ 52%] 2025-12-04T13:35:25.9209162Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int64 PASSED [0.1149s] [ 52%] 2025-12-04T13:35:25.9209275Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex128 PASSED [1.9647s] [ 52%] 2025-12-04T13:35:25.9209423Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64 PASSED [1.6115s] [ 52%] 2025-12-04T13:35:25.9209548Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex64 PASSED [1.6058s] [ 52%] 2025-12-04T13:35:25.9209655Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int32 PASSED [1.6014s] [ 52%] 2025-12-04T13:35:25.9209760Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int64 PASSED [1.5866s] [ 52%] 2025-12-04T13:35:25.9209865Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_uint8 PASSED [1.5970s] [ 52%] 2025-12-04T13:35:25.9209973Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bfloat16 PASSED [1.6043s] [ 52%] 2025-12-04T13:35:25.9210080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bool PASSED [1.5863s] [ 52%] 2025-12-04T13:35:25.9210187Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_float16 PASSED [1.5998s] [ 52%] 2025-12-04T13:35:25.9210297Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_uint8 PASSED [1.6248s] [ 52%] 2025-12-04T13:35:25.9210421Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex128 PASSED [1.9258s] [ 52%] 2025-12-04T13:35:25.9210532Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float16 PASSED [0.0365s] [ 52%] 2025-12-04T13:35:25.9210637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float32 PASSED [1.5923s] [ 52%] 2025-12-04T13:35:25.9210746Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int16 PASSED [1.5996s] [ 52%] 2025-12-04T13:35:25.9210850Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int64 PASSED [1.6123s] [ 52%] 2025-12-04T13:35:25.9210968Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float16 PASSED [1.6044s] [ 52%] 2025-12-04T13:35:25.9211084Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float64 PASSED [1.5859s] [ 52%] 2025-12-04T13:35:25.9211222Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_bfloat16 PASSED [1.5991s] [ 52%] 2025-12-04T13:35:25.9211361Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float16 PASSED [1.5791s] [ 52%] 2025-12-04T13:35:25.9211480Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_bfloat16 PASSED [1.5445s] [ 52%] 2025-12-04T13:35:25.9211593Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float32 PASSED [0.2900s] [ 52%] 2025-12-04T13:35:25.9211713Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex64 PASSED [0.0428s] [ 52%] 2025-12-04T13:35:25.9211827Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int16 PASSED [1.5376s] [ 52%] 2025-12-04T13:35:25.9211938Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int32 PASSED [1.5284s] [ 52%] 2025-12-04T13:35:25.9212053Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_uint8 PASSED [1.5281s] [ 52%] 2025-12-04T13:35:25.9212168Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_float32 PASSED [0.1268s] [ 52%] 2025-12-04T13:35:25.9212291Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex128 PASSED [0.1297s] [ 52%] 2025-12-04T13:35:25.9212409Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex64 PASSED [0.1219s] [ 52%] 2025-12-04T13:35:25.9212527Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float16 PASSED [0.1020s] [ 52%] 2025-12-04T13:35:25.9212644Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_float32 PASSED [0.0865s] [ 52%] 2025-12-04T13:35:25.9212756Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int16 PASSED [1.6243s] [ 52%] 2025-12-04T13:35:25.9212866Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int8 PASSED [0.1067s] [ 52%] 2025-12-04T13:35:25.9212998Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16 PASSED [0.1264s] [ 52%] 2025-12-04T13:35:25.9213126Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_complex128 PASSED [0.0269s] [ 52%] 2025-12-04T13:35:25.9213237Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int8 PASSED [0.0151s] [ 52%] 2025-12-04T13:35:25.9213343Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float16 PASSED [0.1092s] [ 52%] 2025-12-04T13:35:25.9213448Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int8 PASSED [1.6134s] [ 52%] 2025-12-04T13:35:25.9213562Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bool PASSED [1.5365s] [ 52%] 2025-12-04T13:35:25.9213672Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float16 PASSED [0.1109s] [ 52%] 2025-12-04T13:35:25.9213782Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int8 PASSED [0.0881s] [ 52%] 2025-12-04T13:35:25.9213891Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float64 PASSED [1.5506s] [ 52%] 2025-12-04T13:35:25.9214044Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bfloat16 PASSED [1.5261s] [ 52%] 2025-12-04T13:35:25.9214179Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float32 PASSED [1.5360s] [ 52%] 2025-12-04T13:35:25.9214291Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bfloat16 PASSED [0.1107s] [ 52%] 2025-12-04T13:35:25.9214398Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int8 PASSED [0.0886s] [ 52%] 2025-12-04T13:35:25.9214512Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex64 PASSED [1.5278s] [ 53%] 2025-12-04T13:35:25.9214622Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float32 PASSED [1.5373s] [ 53%] 2025-12-04T13:35:25.9214731Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bfloat16 PASSED [0.1218s] [ 53%] 2025-12-04T13:35:25.9214837Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bool PASSED [0.0958s] [ 53%] 2025-12-04T13:35:25.9214948Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex128 PASSED [0.1186s] [ 53%] 2025-12-04T13:35:25.9215055Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float16 PASSED [0.1143s] [ 53%] 2025-12-04T13:35:25.9215161Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_float64 PASSED [0.1048s] [ 53%] 2025-12-04T13:35:25.9215270Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int16 PASSED [1.6417s] [ 53%] 2025-12-04T13:35:25.9215376Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int64 PASSED [0.1058s] [ 53%] 2025-12-04T13:35:25.9215479Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_uint8 PASSED [0.0989s] [ 53%] 2025-12-04T13:35:25.9215603Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex128 PASSED [0.0205s] [ 53%] 2025-12-04T13:35:25.9215723Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex64 PASSED [0.0201s] [ 53%] 2025-12-04T13:35:25.9215840Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float32 PASSED [1.5557s] [ 53%] 2025-12-04T13:35:25.9215954Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex128 PASSED [1.5611s] [ 53%] 2025-12-04T13:35:25.9216064Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex64 PASSED [1.5517s] [ 53%] 2025-12-04T13:35:25.9216192Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16 PASSED [0.0320s] [ 53%] 2025-12-04T13:35:25.9216316Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float16 PASSED [0.0239s] [ 53%] 2025-12-04T13:35:25.9216439Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float64 PASSED [0.0239s] [ 53%] 2025-12-04T13:35:25.9216564Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int16 PASSED [0.0244s] [ 53%] 2025-12-04T13:35:25.9216704Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_complex128 PASSED [0.0057s] [ 53%] 2025-12-04T13:35:25.9216826Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int16 PASSED [0.0054s] [ 53%] 2025-12-04T13:35:25.9216947Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_uint8 PASSED [0.0054s] [ 53%] 2025-12-04T13:35:25.9217061Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex64 PASSED [0.0059s] [ 53%] 2025-12-04T13:35:25.9217173Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int64 PASSED [0.0055s] [ 53%] 2025-12-04T13:35:25.9217283Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int8 PASSED [0.0055s] [ 53%] 2025-12-04T13:35:25.9217399Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bfloat16 PASSED [0.0056s] [ 53%] 2025-12-04T13:35:25.9217513Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float16 PASSED [0.0054s] [ 53%] 2025-12-04T13:35:25.9217637Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float64 PASSED [0.0054s] [ 53%] 2025-12-04T13:35:25.9217748Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int32 PASSED [0.0055s] [ 53%] 2025-12-04T13:35:25.9217860Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int8 PASSED [0.0054s] [ 53%] 2025-12-04T13:35:25.9217971Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64 PASSED [0.0950s] [ 53%] 2025-12-04T13:35:25.9218145Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_bfloat16 PASSED [0.0124s] [ 53%] 2025-12-04T13:35:25.9218284Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float64 PASSED [0.0101s] [ 53%] 2025-12-04T13:35:25.9218423Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_bool PASSED [0.0078s] [ 53%] 2025-12-04T13:35:25.9218566Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0075s] [ 53%] 2025-12-04T13:35:25.9218702Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int16 PASSED [0.0075s] [ 53%] 2025-12-04T13:35:25.9218837Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int64 PASSED [0.0074s] [ 53%] 2025-12-04T13:35:25.9218967Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_bfloat16 PASSED [0.0094s] [ 53%] 2025-12-04T13:35:25.9219099Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float64 PASSED [1.5422s] [ 53%] 2025-12-04T13:35:25.9219224Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_bfloat16 PASSED [0.0989s] [ 53%] 2025-12-04T13:35:25.9219363Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float16 PASSED [1.5665s] [ 53%] 2025-12-04T13:35:25.9219500Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_bfloat16 PASSED [1.5705s] [ 53%] 2025-12-04T13:35:25.9219634Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float64 PASSED [1.5716s] [ 53%] 2025-12-04T13:35:25.9219764Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float16 PASSED [1.5868s] [ 53%] 2025-12-04T13:35:25.9219896Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float64 PASSED [1.5847s] [ 53%] 2025-12-04T13:35:25.9220043Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [1.5848s] [ 53%] 2025-12-04T13:35:25.9220176Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex128 PASSED [1.5463s] [ 53%] 2025-12-04T13:35:25.9220340Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_bfloat16 PASSED [1.5370s] [ 53%] 2025-12-04T13:35:25.9220486Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float16 PASSED [1.5439s] [ 53%] 2025-12-04T13:35:25.9220640Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_complex64 PASSED [1.5349s] [ 53%] 2025-12-04T13:35:25.9220785Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8 PASSED [1.5574s] [ 53%] 2025-12-04T13:35:25.9220916Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float16 PASSED [1.5606s] [ 53%] 2025-12-04T13:35:25.9221063Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex128 PASSED [1.5616s] [ 53%] 2025-12-04T13:35:25.9221207Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float16 PASSED [1.5678s] [ 53%] 2025-12-04T13:35:25.9221349Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float32 PASSED [1.5532s] [ 53%] 2025-12-04T13:35:25.9221504Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int8 PASSED [1.5563s] [ 53%] 2025-12-04T13:35:25.9221641Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float64 PASSED [0.0151s] [ 53%] 2025-12-04T13:35:25.9221774Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int8 PASSED [0.0125s] [ 53%] 2025-12-04T13:35:25.9221912Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0117s] [ 53%] 2025-12-04T13:35:25.9222051Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_uint8 PASSED [0.0115s] [ 53%] 2025-12-04T13:35:25.9222188Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int32 PASSED [0.0785s] [ 53%] 2025-12-04T13:35:25.9222327Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int8 PASSED [0.0718s] [ 53%] 2025-12-04T13:35:25.9222458Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_bfloat16 PASSED [0.1451s] [ 53%] 2025-12-04T13:35:25.9222581Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int8 PASSED [0.0381s] [ 53%] 2025-12-04T13:35:25.9222707Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_uint8 PASSED [0.0380s] [ 53%] 2025-12-04T13:35:25.9222831Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float64 PASSED [0.0374s] [ 53%] 2025-12-04T13:35:25.9222970Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_bfloat16 PASSED [0.0135s] [ 53%] 2025-12-04T13:35:25.9223106Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0086s] [ 53%] 2025-12-04T13:35:25.9223254Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_bfloat16 PASSED [0.0066s] [ 53%] 2025-12-04T13:35:25.9223400Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float32 PASSED [0.0064s] [ 54%] 2025-12-04T13:35:25.9223542Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float64 PASSED [1.5452s] [ 54%] 2025-12-04T13:35:25.9223691Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex128 PASSED [1.5390s] [ 54%] 2025-12-04T13:35:25.9223829Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int8 PASSED [1.5367s] [ 54%] 2025-12-04T13:35:25.9223961Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float16 PASSED [0.0842s] [ 54%] 2025-12-04T13:35:25.9224128Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex128 PASSED [1.5946s] [ 54%] 2025-12-04T13:35:25.9224272Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float64 PASSED [1.5816s] [ 54%] 2025-12-04T13:35:25.9224407Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_bfloat16 PASSED [1.5841s] [ 54%] 2025-12-04T13:35:25.9224538Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int16 PASSED [1.5882s] [ 54%] 2025-12-04T13:35:25.9224682Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float32 PASSED [1.5587s] [ 54%] 2025-12-04T13:35:25.9224826Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_uint8 PASSED [1.5679s] [ 54%] 2025-12-04T13:35:25.9224938Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_complex64 PASSED [1.5905s] [ 54%] 2025-12-04T13:35:25.9225049Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float16 PASSED [1.5538s] [ 54%] 2025-12-04T13:35:25.9225178Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_bfloat16 PASSED [1.5676s] [ 54%] 2025-12-04T13:35:25.9225301Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex64 PASSED [1.5490s] [ 54%] 2025-12-04T13:35:25.9225409Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int32 PASSED [1.5446s] [ 54%] 2025-12-04T13:35:25.9225518Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_uint8 PASSED [1.5356s] [ 54%] 2025-12-04T13:35:25.9225638Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bfloat16 PASSED [1.5943s] [ 54%] 2025-12-04T13:35:25.9225760Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_float64 PASSED [1.6755s] [ 54%] 2025-12-04T13:35:25.9225876Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int16 PASSED [1.6689s] [ 54%] 2025-12-04T13:35:25.9225990Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bfloat16 PASSED [1.6766s] [ 54%] 2025-12-04T13:35:25.9226100Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int32 PASSED [1.6821s] [ 54%] 2025-12-04T13:35:25.9226219Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex32 PASSED [1.6546s] [ 54%] 2025-12-04T13:35:25.9226330Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float32 PASSED [1.6090s] [ 54%] 2025-12-04T13:35:25.9226440Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int64 PASSED [1.6288s] [ 54%] 2025-12-04T13:35:25.9226549Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_bfloat16 PASSED [0.1220s] [ 54%] 2025-12-04T13:35:25.9226663Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex32 PASSED [0.1379s] [ 54%] 2025-12-04T13:35:25.9226774Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64 PASSED [1.7551s] [ 54%] 2025-12-04T13:35:25.9226882Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float32 PASSED [1.7373s] [ 54%] 2025-12-04T13:35:25.9226993Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex32 PASSED [1.6545s] [ 54%] 2025-12-04T13:35:25.9227104Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64 PASSED [1.6505s] [ 54%] 2025-12-04T13:35:25.9227212Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float16 PASSED [1.6579s] [ 54%] 2025-12-04T13:35:25.9227320Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32 PASSED [1.6711s] [ 54%] 2025-12-04T13:35:25.9227432Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex128 PASSED [1.6487s] [ 54%] 2025-12-04T13:35:25.9227543Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex64 PASSED [1.6315s] [ 54%] 2025-12-04T13:35:25.9227654Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex64 PASSED [1.6403s] [ 54%] 2025-12-04T13:35:25.9227780Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int8 PASSED [1.6573s] [ 54%] 2025-12-04T13:35:25.9227899Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_uint8 PASSED [1.6324s] [ 54%] 2025-12-04T13:35:25.9228006Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float32 PASSED [1.6625s] [ 54%] 2025-12-04T13:35:25.9228223Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float64 PASSED [1.6700s] [ 54%] 2025-12-04T13:35:25.9228334Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int64 PASSED [1.6496s] [ 54%] 2025-12-04T13:35:25.9228447Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_uint8 PASSED [1.6668s] [ 54%] 2025-12-04T13:35:25.9228558Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int8 PASSED [1.7434s] [ 54%] 2025-12-04T13:35:25.9228667Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float16 PASSED [1.6652s] [ 54%] 2025-12-04T13:35:25.9228778Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float32 PASSED [1.6515s] [ 54%] 2025-12-04T13:35:25.9228906Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bool PASSED [0.0478s] [ 54%] 2025-12-04T13:35:25.9229018Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128 PASSED [0.0459s] [ 54%] 2025-12-04T13:35:25.9229128Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_float64 PASSED [0.0453s] [ 54%] 2025-12-04T13:35:25.9229234Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int8 PASSED [0.0456s] [ 54%] 2025-12-04T13:35:25.9229346Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int64 PASSED [0.0381s] [ 54%] 2025-12-04T13:35:25.9229459Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float32 PASSED [0.0447s] [ 54%] 2025-12-04T13:35:25.9229587Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int64 PASSED [1.6705s] [ 54%] 2025-12-04T13:35:25.9229724Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int8 PASSED [0.0472s] [ 54%] 2025-12-04T13:35:25.9229856Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex64 PASSED [0.0158s] [ 54%] 2025-12-04T13:35:25.9229981Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex128 PASSED [0.4474s] [ 54%] 2025-12-04T13:35:25.9230158Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex64 PASSED [0.4674s] [ 54%] 2025-12-04T13:35:25.9230293Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_bfloat16 PASSED [0.0927s] [ 54%] 2025-12-04T13:35:25.9230412Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float32 PASSED [0.0869s] [ 54%] 2025-12-04T13:35:25.9230551Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int32 PASSED [0.0885s] [ 54%] 2025-12-04T13:35:25.9230665Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float32 PASSED [0.0183s] [ 54%] 2025-12-04T13:35:25.9230817Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int16 PASSED [0.0224s] [ 54%] 2025-12-04T13:35:25.9230939Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int32 PASSED [0.0499s] [ 54%] 2025-12-04T13:35:25.9231078Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bfloat16 PASSED [1.6489s] [ 54%] 2025-12-04T13:35:25.9231202Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bool PASSED [1.6723s] [ 54%] 2025-12-04T13:35:25.9231330Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int64 PASSED [1.6836s] [ 54%] 2025-12-04T13:35:25.9231462Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_uint8 PASSED [1.6774s] [ 54%] 2025-12-04T13:35:25.9231603Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex64 PASSED [1.6838s] [ 54%] 2025-12-04T13:35:25.9231720Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int16 PASSED [1.6654s] [ 54%] 2025-12-04T13:35:25.9231893Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int32 PASSED [1.6862s] [ 54%] 2025-12-04T13:35:25.9232024Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int64 PASSED [1.6746s] [ 54%] 2025-12-04T13:35:25.9232164Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int8 PASSED [1.6689s] [ 55%] 2025-12-04T13:35:25.9232287Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_uint8 PASSED [1.6624s] [ 55%] 2025-12-04T13:35:25.9232431Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float16 PASSED [1.6770s] [ 55%] 2025-12-04T13:35:25.9232551Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int32 PASSED [1.6695s] [ 55%] 2025-12-04T13:35:25.9232685Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float16 PASSED [1.6678s] [ 55%] 2025-12-04T13:35:25.9232824Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_uint8 PASSED [1.6701s] [ 55%] 2025-12-04T13:35:25.9232976Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex64 PASSED [1.6589s] [ 55%] 2025-12-04T13:35:25.9233141Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int16 PASSED [1.6454s] [ 55%] 2025-12-04T13:35:25.9233277Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int32 PASSED [1.6798s] [ 55%] 2025-12-04T13:35:25.9233430Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_float64 PASSED [2.0269s] [ 55%] 2025-12-04T13:35:25.9233558Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int32 PASSED [0.0308s] [ 55%] 2025-12-04T13:35:25.9233723Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int64 PASSED [0.0246s] [ 55%] 2025-12-04T13:35:25.9233854Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int8 PASSED [0.0212s] [ 55%] 2025-12-04T13:35:25.9233998Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_float16 PASSED [0.0670s] [ 55%] 2025-12-04T13:35:25.9234125Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int8 PASSED [0.0366s] [ 55%] 2025-12-04T13:35:25.9234262Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_bool PASSED [1.6457s] [ 55%] 2025-12-04T13:35:25.9234397Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int64 PASSED [1.6689s] [ 55%] 2025-12-04T13:35:25.9234543Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_bool PASSED [1.6782s] [ 55%] 2025-12-04T13:35:25.9234669Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float16 PASSED [1.6647s] [ 55%] 2025-12-04T13:35:25.9234806Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int16 PASSED [1.6662s] [ 55%] 2025-12-04T13:35:25.9234934Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int8 PASSED [1.6739s] [ 55%] 2025-12-04T13:35:25.9235080Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int32 PASSED [1.6732s] [ 55%] 2025-12-04T13:35:25.9235234Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_bool PASSED [1.7329s] [ 55%] 2025-12-04T13:35:25.9235367Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int16 PASSED [1.7162s] [ 55%] 2025-12-04T13:35:25.9235537Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex128 PASSED [1.6425s] [ 55%] 2025-12-04T13:35:25.9235692Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float32 PASSED [1.6414s] [ 55%] 2025-12-04T13:35:25.9235867Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int64 PASSED [1.6440s] [ 55%] 2025-12-04T13:35:25.9236001Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bool PASSED [1.7057s] [ 55%] 2025-12-04T13:35:25.9236194Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int32 PASSED [1.7175s] [ 55%] 2025-12-04T13:35:25.9236367Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16 PASSED [1.7143s] [ 55%] 2025-12-04T13:35:25.9236546Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float16 PASSED [1.7072s] [ 55%] 2025-12-04T13:35:25.9236699Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float32 PASSED [1.6904s] [ 55%] 2025-12-04T13:35:25.9236894Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int32 PASSED [1.6824s] [ 55%] 2025-12-04T13:35:25.9237063Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int64 PASSED [1.6949s] [ 55%] 2025-12-04T13:35:25.9237219Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int64 PASSED [1.7133s] [ 55%] 2025-12-04T13:35:25.9237396Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int8 PASSED [1.6978s] [ 55%] 2025-12-04T13:35:25.9237532Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float16 PASSED [1.6810s] [ 55%] 2025-12-04T13:35:25.9237698Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float32 PASSED [1.6831s] [ 55%] 2025-12-04T13:35:25.9237825Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int16 PASSED [1.6838s] [ 55%] 2025-12-04T13:35:25.9237972Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int32 PASSED [1.6714s] [ 55%] 2025-12-04T13:35:25.9238147Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_float64 PASSED [1.6550s] [ 55%] 2025-12-04T13:35:25.9238289Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int64 PASSED [1.6700s] [ 55%] 2025-12-04T13:35:25.9238457Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_complex64 PASSED [1.6516s] [ 55%] 2025-12-04T13:35:25.9238632Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float16 PASSED [1.6550s] [ 55%] 2025-12-04T13:35:25.9238789Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int64 PASSED [1.6768s] [ 55%] 2025-12-04T13:35:25.9238942Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bfloat16 PASSED [0.1826s] [ 55%] 2025-12-04T13:35:25.9239075Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int8 PASSED [0.1358s] [ 55%] 2025-12-04T13:35:25.9239223Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int64 PASSED [0.1215s] [ 55%] 2025-12-04T13:35:25.9239381Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_bfloat16 PASSED [0.0056s] [ 55%] 2025-12-04T13:35:25.9239525Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex128 PASSED [0.0054s] [ 55%] 2025-12-04T13:35:25.9239679Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex64 PASSED [0.0050s] [ 55%] 2025-12-04T13:35:25.9239814Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_float16 PASSED [0.0054s] [ 55%] 2025-12-04T13:35:25.9239966Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int16 PASSED [0.0055s] [ 55%] 2025-12-04T13:35:25.9240090Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bool PASSED [0.0284s] [ 55%] 2025-12-04T13:35:25.9240232Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex128 PASSED [0.0388s] [ 55%] 2025-12-04T13:35:25.9240352Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float32 PASSED [0.0242s] [ 55%] 2025-12-04T13:35:25.9240489Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int16 PASSED [0.0200s] [ 55%] 2025-12-04T13:35:25.9240629Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int32 PASSED [0.0196s] [ 55%] 2025-12-04T13:35:25.9240811Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex64 PASSED [1.6707s] [ 55%] 2025-12-04T13:35:25.9240936Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float32 PASSED [1.6663s] [ 55%] 2025-12-04T13:35:25.9241073Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int32 PASSED [1.6626s] [ 55%] 2025-12-04T13:35:25.9241193Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int64 PASSED [1.6760s] [ 55%] 2025-12-04T13:35:25.9241333Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bfloat16 PASSED [1.6521s] [ 55%] 2025-12-04T13:35:25.9241504Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex32 PASSED [1.6676s] [ 55%] 2025-12-04T13:35:25.9241636Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex64 PASSED [1.6599s] [ 55%] 2025-12-04T13:35:25.9241784Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float16 PASSED [1.6584s] [ 55%] 2025-12-04T13:35:25.9241912Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bfloat16 PASSED [1.6610s] [ 55%] 2025-12-04T13:35:25.9242071Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex32 PASSED [1.6306s] [ 55%] 2025-12-04T13:35:25.9242222Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex64 PASSED [1.6585s] [ 55%] 2025-12-04T13:35:25.9242379Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float16 PASSED [1.6763s] [ 56%] 2025-12-04T13:35:25.9242501Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_float64 PASSED [0.0133s] [ 56%] 2025-12-04T13:35:25.9242636Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex64 PASSED [1.6463s] [ 56%] 2025-12-04T13:35:25.9242761Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float32 PASSED [1.6569s] [ 56%] 2025-12-04T13:35:25.9242903Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float64 PASSED [1.6734s] [ 56%] 2025-12-04T13:35:25.9243030Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int8 PASSED [1.6621s] [ 56%] 2025-12-04T13:35:25.9243171Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int32 PASSED [1.6635s] [ 56%] 2025-12-04T13:35:25.9243299Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_uint8 PASSED [1.6696s] [ 56%] 2025-12-04T13:35:25.9243440Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bfloat16 PASSED [1.6414s] [ 56%] 2025-12-04T13:35:25.9243555Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bool PASSED [1.6513s] [ 56%] 2025-12-04T13:35:25.9243708Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int64 PASSED [1.6589s] [ 56%] 2025-12-04T13:35:25.9243842Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64 PASSED [1.6593s] [ 56%] 2025-12-04T13:35:25.9243959Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int16 PASSED [1.6572s] [ 56%] 2025-12-04T13:35:25.9244100Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int32 PASSED [1.6697s] [ 56%] 2025-12-04T13:35:25.9244222Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bool PASSED [1.6869s] [ 56%] 2025-12-04T13:35:25.9244387Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float32 PASSED [1.6795s] [ 56%] 2025-12-04T13:35:25.9244505Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_bool PASSED [1.6965s] [ 56%] 2025-12-04T13:35:25.9244639Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int16 PASSED [1.6845s] [ 56%] 2025-12-04T13:35:25.9244767Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex128 PASSED [1.6786s] [ 56%] 2025-12-04T13:35:25.9244898Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_complex64 PASSED [1.7026s] [ 56%] 2025-12-04T13:35:25.9245058Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float16 PASSED [1.6983s] [ 56%] 2025-12-04T13:35:25.9245208Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int8 PASSED [1.6710s] [ 56%] 2025-12-04T13:35:25.9245338Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bool PASSED [1.6564s] [ 56%] 2025-12-04T13:35:25.9245471Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int16 PASSED [1.6833s] [ 56%] 2025-12-04T13:35:25.9245595Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex64 PASSED [1.6431s] [ 56%] 2025-12-04T13:35:25.9245747Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float32 PASSED [1.6701s] [ 56%] 2025-12-04T13:35:25.9245891Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bfloat16 PASSED [1.6568s] [ 56%] 2025-12-04T13:35:25.9246037Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float16 PASSED [1.6736s] [ 56%] 2025-12-04T13:35:25.9246181Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float32 PASSED [1.6764s] [ 56%] 2025-12-04T13:35:25.9246321Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int16 PASSED [1.6447s] [ 56%] 2025-12-04T13:35:25.9246481Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_uint8 PASSED [1.6534s] [ 56%] 2025-12-04T13:35:25.9246610Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int32 PASSED [1.6536s] [ 56%] 2025-12-04T13:35:25.9246740Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bool PASSED [1.6623s] [ 56%] 2025-12-04T13:35:25.9246874Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex128 PASSED [1.7702s] [ 56%] 2025-12-04T13:35:25.9247015Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float16 PASSED [1.7651s] [ 56%] 2025-12-04T13:35:25.9247138Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float64 PASSED [0.1140s] [ 56%] 2025-12-04T13:35:25.9247301Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int32 PASSED [0.1210s] [ 56%] 2025-12-04T13:35:25.9247421Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int32 PASSED [0.0228s] [ 56%] 2025-12-04T13:35:25.9247566Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_complex32 PASSED [0.0099s] [ 56%] 2025-12-04T13:35:25.9247692Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int32 PASSED [0.0096s] [ 56%] 2025-12-04T13:35:25.9247825Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_copy_cuda_int8 PASSED [0.0096s] [ 56%] 2025-12-04T13:35:25.9247957Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bfloat16 PASSED [0.0177s] [ 56%] 2025-12-04T13:35:25.9248204Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int8 PASSED [0.0177s] [ 56%] 2025-12-04T13:35:25.9248335Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex128 PASSED [0.0094s] [ 56%] 2025-12-04T13:35:25.9248479Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex32 PASSED [0.0093s] [ 56%] 2025-12-04T13:35:25.9248617Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float32 PASSED [0.0093s] [ 56%] 2025-12-04T13:35:25.9248755Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int64 PASSED [0.0094s] [ 56%] 2025-12-04T13:35:25.9248898Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bool PASSED [1.6769s] [ 56%] 2025-12-04T13:35:25.9249026Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float32 PASSED [1.6749s] [ 56%] 2025-12-04T13:35:25.9249163Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex128 PASSED [1.6694s] [ 56%] 2025-12-04T13:35:25.9249286Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float64 PASSED [1.6681s] [ 56%] 2025-12-04T13:35:25.9249459Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int8 PASSED [1.6829s] [ 56%] 2025-12-04T13:35:25.9249613Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_bfloat16 PASSED [1.6571s] [ 56%] 2025-12-04T13:35:25.9249762Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_complex64 PASSED [1.6333s] [ 56%] 2025-12-04T13:35:25.9249897Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_float16 PASSED [1.6720s] [ 56%] 2025-12-04T13:35:25.9250049Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int32 PASSED [1.6416s] [ 56%] 2025-12-04T13:35:25.9250173Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int8 PASSED [1.6517s] [ 56%] 2025-12-04T13:35:25.9250337Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex64 PASSED [1.6636s] [ 56%] 2025-12-04T13:35:25.9250460Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_complex128 PASSED [1.6656s] [ 56%] 2025-12-04T13:35:25.9250601Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_bfloat16 PASSED [1.6939s] [ 56%] 2025-12-04T13:35:25.9250764Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float16 PASSED [1.6987s] [ 56%] 2025-12-04T13:35:25.9250893Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float32 PASSED [1.6813s] [ 56%] 2025-12-04T13:35:25.9251060Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float64 PASSED [1.6730s] [ 56%] 2025-12-04T13:35:25.9251188Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex128 PASSED [1.6985s] [ 56%] 2025-12-04T13:35:25.9251330Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float64 PASSED [1.6968s] [ 56%] 2025-12-04T13:35:25.9251439Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128 2025-12-04T13:35:25.9251443Z 2025-12-04T13:35:25.9251636Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-6b6e7e1c3695e7c0.xml - 2025-12-04T13:35:25.9251725Z !!!!!!!!!!!!!!!!!!!!!!!!!!!!!! KeyboardInterrupt !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! 2025-12-04T13:35:25.9251914Z /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/testing/_internal/common_utils.py:2653: KeyboardInterrupt 2025-12-04T13:35:25.9252009Z (to show a full traceback on KeyboardInterrupt use --full-trace) 2025-12-04T13:35:25.9252114Z ========== 3349 passed, 525 skipped, 61 xfailed in 1793.22s (0:29:53) ========== 2025-12-04T13:35:25.9252177Z Command took >30min, returning 124 2025-12-04T13:35:25.9252246Z Got exit code 124 2025-12-04T13:35:25.9252307Z Retrying single test... 2025-12-04T13:35:25.9252462Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-2f965dc3facbfc50.xml 2025-12-04T13:35:25.9252535Z ============================= test session starts ============================== 2025-12-04T13:35:25.9252674Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:35:25.9252725Z cachedir: .pytest_cache 2025-12-04T13:35:25.9252927Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:35:25.9252995Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T13:35:25.9253061Z configfile: pytest.ini 2025-12-04T13:35:25.9253239Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T13:35:25.9253344Z collecting ... collected 33666 items / 6910 deselected / 26756 selected 2025-12-04T13:35:25.9253551Z stepcurrent: skipping 3935 already run items. Running only test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128 2025-12-04T13:35:25.9253634Z Running 1 items in this shard 2025-12-04T13:35:25.9253637Z 2025-12-04T13:35:25.9253786Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128 PASSED [0.1116s] [100%] 2025-12-04T13:35:25.9253788Z 2025-12-04T13:35:25.9253985Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-2f965dc3facbfc50.xml - 2025-12-04T13:35:25.9254088Z ====================== 1 passed, 6910 deselected in 2.26s ====================== 2025-12-04T13:35:25.9254154Z Got exit code 0 2025-12-04T13:35:25.9254270Z Test succeeded in new process, continuing with the rest of the tests 2025-12-04T13:35:25.9254399Z Test results will be stored in test-reports/python-pytest/test_ops/test_ops-7a7b1e95275579c0.xml 2025-12-04T13:35:25.9254483Z ============================= test session starts ============================== 2025-12-04T13:35:25.9254605Z platform linux -- Python 3.10.14, pytest-7.3.2, pluggy-1.6.0 -- /opt/conda/envs/py_3.10/bin/python 2025-12-04T13:35:25.9254678Z cachedir: .pytest_cache 2025-12-04T13:35:25.9254852Z hypothesis profile 'pytorch_ci' -> database=None, max_examples=50, derandomize=True, suppress_health_check=[HealthCheck.too_slow] 2025-12-04T13:35:25.9254926Z rootdir: /var/lib/jenkins/pytorch 2025-12-04T13:35:25.9254979Z configfile: pytest.ini 2025-12-04T13:35:25.9255166Z plugins: hypothesis-6.56.4, cpp-2.3.0, flakefinder-1.1.0, rerunfailures-14.0, subtests-0.13.1, xdist-3.3.1, xdoctest-1.3.0, typeguard-4.3.0 2025-12-04T13:35:25.9255274Z collecting ... collected 33666 items / 3936 deselected / 29730 selected 2025-12-04T13:35:25.9255374Z stepcurrent: skipping 3936 already run items. 2025-12-04T13:35:25.9255435Z Running 2975 items in this shard 2025-12-04T13:35:25.9255437Z 2025-12-04T13:35:25.9255588Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex64 PASSED [0.1108s] [ 0%] 2025-12-04T13:35:25.9255716Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float32 PASSED [0.0517s] [ 0%] 2025-12-04T13:35:25.9255858Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_bfloat16 PASSED [0.0876s] [ 0%] 2025-12-04T13:35:25.9255988Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float16 PASSED [0.0446s] [ 0%] 2025-12-04T13:35:25.9256132Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float64 PASSED [0.0450s] [ 0%] 2025-12-04T13:35:25.9256268Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int64 PASSED [0.8982s] [ 0%] 2025-12-04T13:35:25.9256394Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_uint8 PASSED [0.7557s] [ 0%] 2025-12-04T13:35:25.9256525Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16 PASSED [0.0185s] [ 0%] 2025-12-04T13:35:25.9256662Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex128 PASSED [0.0065s] [ 0%] 2025-12-04T13:35:25.9256810Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex32 PASSED [0.7529s] [ 0%] 2025-12-04T13:35:25.9256935Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float64 PASSED [0.7425s] [ 0%] 2025-12-04T13:35:25.9257072Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int64 PASSED [0.0091s] [ 0%] 2025-12-04T13:35:25.9257199Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex128 PASSED [0.0919s] [ 0%] 2025-12-04T13:35:25.9257341Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex32 PASSED [0.7724s] [ 0%] 2025-12-04T13:35:25.9257468Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32 PASSED [0.7863s] [ 0%] 2025-12-04T13:35:25.9257601Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int16 PASSED [0.0224s] [ 0%] 2025-12-04T13:35:25.9257726Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bool PASSED [0.2301s] [ 0%] 2025-12-04T13:35:25.9257861Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float32 PASSED [0.1688s] [ 0%] 2025-12-04T13:35:25.9257976Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float64 PASSED [0.2008s] [ 0%] 2025-12-04T13:35:25.9258159Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int8 PASSED [0.1676s] [ 0%] 2025-12-04T13:35:25.9258319Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex128 PASSED [0.7741s] [ 0%] 2025-12-04T13:35:25.9258473Z test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_uint8 PASSED [0.7648s] [ 0%] 2025-12-04T13:35:25.9258620Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bfloat16 PASSED [0.8158s] [ 0%] 2025-12-04T13:35:25.9258744Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float16 PASSED [0.7646s] [ 0%] 2025-12-04T13:35:25.9258906Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float32 PASSED [0.7530s] [ 0%] 2025-12-04T13:35:25.9259038Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int16 PASSED [0.7561s] [ 0%] 2025-12-04T13:35:25.9259178Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int32 PASSED [0.7532s] [ 0%] 2025-12-04T13:35:25.9259305Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_uint8 PASSED [0.7475s] [ 0%] 2025-12-04T13:35:25.9259475Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bfloat16 PASSED [0.7666s] [ 0%] 2025-12-04T13:35:25.9259654Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int32 PASSED [0.7601s] [ 1%] 2025-12-04T13:35:25.9259829Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int16 PASSED [0.7675s] [ 1%] 2025-12-04T13:35:25.9259977Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_uint8 PASSED [0.7623s] [ 1%] 2025-12-04T13:35:25.9260146Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bfloat16 PASSED [0.7734s] [ 1%] 2025-12-04T13:35:25.9260297Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float16 PASSED [0.0136s] [ 1%] 2025-12-04T13:35:25.9260466Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float32 PASSED [0.7620s] [ 1%] 2025-12-04T13:35:25.9260646Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int16 PASSED [0.0119s] [ 1%] 2025-12-04T13:35:25.9260796Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int32 PASSED [0.7660s] [ 1%] 2025-12-04T13:35:25.9260959Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_bool PASSED [0.0189s] [ 1%] 2025-12-04T13:35:25.9261117Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex128 PASSED [0.8258s] [ 1%] 2025-12-04T13:35:25.9261297Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bfloat16 PASSED [0.0192s] [ 1%] 2025-12-04T13:35:25.9261457Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float16 PASSED [0.8112s] [ 1%] 2025-12-04T13:35:25.9261620Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int16 PASSED [0.0169s] [ 1%] 2025-12-04T13:35:25.9261770Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int64 PASSED [0.7800s] [ 1%] 2025-12-04T13:35:25.9261933Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bfloat16 PASSED [0.0131s] [ 1%] 2025-12-04T13:35:25.9262077Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bool PASSED [0.7823s] [ 1%] 2025-12-04T13:35:25.9262261Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float32 PASSED [0.0132s] [ 1%] 2025-12-04T13:35:25.9262409Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int32 PASSED [0.7696s] [ 1%] 2025-12-04T13:35:25.9262570Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int64 PASSED [0.0118s] [ 1%] 2025-12-04T13:35:25.9262762Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_uint8 PASSED [0.7683s] [ 1%] 2025-12-04T13:35:25.9262922Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex64 PASSED [0.0288s] [ 1%] 2025-12-04T13:35:25.9263100Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bool PASSED [0.7665s] [ 1%] 2025-12-04T13:35:25.9263254Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex128 PASSED [0.0258s] [ 1%] 2025-12-04T13:35:25.9263416Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int8 PASSED [0.7852s] [ 1%] 2025-12-04T13:35:25.9263568Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_uint8 PASSED [0.0116s] [ 1%] 2025-12-04T13:35:25.9263721Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bool PASSED [0.7703s] [ 1%] 2025-12-04T13:35:25.9263889Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex128 PASSED [0.0252s] [ 1%] 2025-12-04T13:35:25.9264070Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_uint8 PASSED [0.7647s] [ 1%] 2025-12-04T13:35:25.9264202Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int64 PASSED [0.0238s] [ 1%] 2025-12-04T13:35:25.9264360Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex64 PASSED [0.0961s] [ 1%] 2025-12-04T13:35:25.9264495Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float16 PASSED [0.0171s] [ 2%] 2025-12-04T13:35:25.9264648Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int64 PASSED [0.0149s] [ 2%] 2025-12-04T13:35:25.9264804Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float32 PASSED [0.0520s] [ 2%] 2025-12-04T13:35:25.9264941Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int16 PASSED [0.0392s] [ 2%] 2025-12-04T13:35:25.9265100Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_bfloat16 PASSED [0.0577s] [ 2%] 2025-12-04T13:35:25.9265246Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_complex128 PASSED [0.7987s] [ 2%] 2025-12-04T13:35:25.9265401Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float32 PASSED [0.0179s] [ 2%] 2025-12-04T13:35:25.9265547Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_bool PASSED [0.7645s] [ 2%] 2025-12-04T13:35:25.9265714Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex128 PASSED [0.0042s] [ 2%] 2025-12-04T13:35:25.9265861Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex64 PASSED [0.7613s] [ 2%] 2025-12-04T13:35:25.9266020Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int16 PASSED [0.0038s] [ 2%] 2025-12-04T13:35:25.9266152Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int32 PASSED [0.7625s] [ 2%] 2025-12-04T13:35:25.9266316Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bfloat16 PASSED [0.0209s] [ 2%] 2025-12-04T13:35:25.9266458Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex128 PASSED [0.7656s] [ 2%] 2025-12-04T13:35:25.9266606Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex64 PASSED [0.0077s] [ 2%] 2025-12-04T13:35:25.9266739Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float16 PASSED [0.7561s] [ 2%] 2025-12-04T13:35:25.9266879Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float64 PASSED [0.0077s] [ 2%] 2025-12-04T13:35:25.9267040Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int8 PASSED [0.7567s] [ 2%] 2025-12-04T13:35:25.9267207Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16 PASSED [0.0109s] [ 2%] 2025-12-04T13:35:25.9267371Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float64 PASSED [0.7744s] [ 2%] 2025-12-04T13:35:25.9267506Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_bool PASSED [0.0156s] [ 2%] 2025-12-04T13:35:25.9267650Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float16 PASSED [0.0130s] [ 2%] 2025-12-04T13:35:25.9267801Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float32 PASSED [0.0096s] [ 2%] 2025-12-04T13:35:25.9267951Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int64 PASSED [0.0073s] [ 2%] 2025-12-04T13:35:25.9268130Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bfloat16 PASSED [0.0164s] [ 2%] 2025-12-04T13:35:25.9268396Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float16 PASSED [0.0105s] [ 2%] 2025-12-04T13:35:25.9268529Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bfloat16 PASSED [0.7706s] [ 2%] 2025-12-04T13:35:25.9268709Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex128 PASSED [0.0078s] [ 2%] 2025-12-04T13:35:25.9268846Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float32 PASSED [0.7652s] [ 2%] 2025-12-04T13:35:25.9268993Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float32 PASSED [0.0161s] [ 2%] 2025-12-04T13:35:25.9269124Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int32 PASSED [0.0078s] [ 3%] 2025-12-04T13:35:25.9269288Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_float16 PASSED [0.7560s] [ 3%] 2025-12-04T13:35:25.9269448Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int32 PASSED [0.0046s] [ 3%] 2025-12-04T13:35:25.9269609Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bool PASSED [0.7524s] [ 3%] 2025-12-04T13:35:25.9269776Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int8 PASSED [0.0044s] [ 3%] 2025-12-04T13:35:25.9269931Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_uint8 PASSED [0.7506s] [ 3%] 2025-12-04T13:35:25.9270097Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float64 PASSED [0.0062s] [ 3%] 2025-12-04T13:35:25.9274428Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int16 PASSED [0.7717s] [ 3%] 2025-12-04T13:35:25.9274579Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8 PASSED [0.0052s] [ 3%] 2025-12-04T13:35:25.9274706Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float16 PASSED [0.0207s] [ 3%] 2025-12-04T13:35:25.9274827Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int64 PASSED [0.7701s] [ 3%] 2025-12-04T13:35:25.9274953Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int32 PASSED [0.0201s] [ 3%] 2025-12-04T13:35:25.9275072Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int8 PASSED [0.0125s] [ 3%] 2025-12-04T13:35:25.9275193Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_uint8 PASSED [0.0572s] [ 3%] 2025-12-04T13:35:25.9275315Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bfloat16 PASSED [0.0186s] [ 3%] 2025-12-04T13:35:25.9275436Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_float64 PASSED [0.7813s] [ 3%] 2025-12-04T13:35:25.9275559Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16 PASSED [0.0225s] [ 3%] 2025-12-04T13:35:25.9275733Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex64 PASSED [0.2138s] [ 3%] 2025-12-04T13:35:25.9275871Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float16 PASSED [0.0154s] [ 3%] 2025-12-04T13:35:25.9275990Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int32 PASSED [0.7686s] [ 3%] 2025-12-04T13:35:25.9276115Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_bool PASSED [0.0053s] [ 3%] 2025-12-04T13:35:25.9276239Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int8 PASSED [0.7806s] [ 3%] 2025-12-04T13:35:25.9276370Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bfloat16 PASSED [0.0063s] [ 3%] 2025-12-04T13:35:25.9276494Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bool PASSED [0.0041s] [ 3%] 2025-12-04T13:35:25.9276631Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex128 PASSED [0.7654s] [ 3%] 2025-12-04T13:35:25.9276754Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_bool PASSED [0.0056s] [ 3%] 2025-12-04T13:35:25.9276898Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex128 PASSED [0.7664s] [ 3%] 2025-12-04T13:35:25.9277031Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex32 PASSED [0.0069s] [ 3%] 2025-12-04T13:35:25.9277168Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex64 PASSED [0.7536s] [ 3%] 2025-12-04T13:35:25.9277293Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int8 PASSED [0.0055s] [ 4%] 2025-12-04T13:35:25.9277422Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int64 PASSED [0.7971s] [ 4%] 2025-12-04T13:35:25.9277550Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_uint8 PASSED [0.0385s] [ 4%] 2025-12-04T13:35:25.9277691Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int16 PASSED [0.0509s] [ 4%] 2025-12-04T13:35:25.9277821Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16 PASSED [0.0382s] [ 4%] 2025-12-04T13:35:25.9277948Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int64 PASSED [0.0373s] [ 4%] 2025-12-04T13:35:25.9278112Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int64 PASSED [0.8029s] [ 4%] 2025-12-04T13:35:25.9278238Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int8 PASSED [0.0385s] [ 4%] 2025-12-04T13:35:25.9278366Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_uint8 PASSED [0.0367s] [ 4%] 2025-12-04T13:35:25.9278495Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float64 PASSED [0.0041s] [ 4%] 2025-12-04T13:35:25.9278641Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64 PASSED [0.7692s] [ 4%] 2025-12-04T13:35:25.9278771Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_bool PASSED [0.7527s] [ 4%] 2025-12-04T13:35:25.9278903Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int64 PASSED [0.7615s] [ 4%] 2025-12-04T13:35:25.9279032Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8 PASSED [0.7738s] [ 4%] 2025-12-04T13:35:25.9279151Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int64 PASSED [0.0083s] [ 4%] 2025-12-04T13:35:25.9279328Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_bfloat16 SKIPPED [0.0002s] (Expected: cauchy is not comparable) [ 4%] 2025-12-04T13:35:25.9279452Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float64 PASSED [0.0254s] [ 4%] 2025-12-04T13:35:25.9279596Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int16 PASSED [0.0108s] [ 4%] 2025-12-04T13:35:25.9279728Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int64 PASSED [0.7725s] [ 4%] 2025-12-04T13:35:25.9279843Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int8 PASSED [0.0122s] [ 4%] 2025-12-04T13:35:25.9279969Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex64 PASSED [0.0168s] [ 4%] 2025-12-04T13:35:25.9280091Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float64 PASSED [0.7828s] [ 4%] 2025-12-04T13:35:25.9280215Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float32 PASSED [0.0422s] [ 4%] 2025-12-04T13:35:25.9280340Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int64 PASSED [0.0254s] [ 4%] 2025-12-04T13:35:25.9280469Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float64 PASSED [0.0351s] [ 4%] 2025-12-04T13:35:25.9280608Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16 PASSED [0.0249s] [ 4%] 2025-12-04T13:35:25.9280731Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int64 PASSED [0.7848s] [ 4%] 2025-12-04T13:35:25.9280855Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bfloat16 PASSED [0.0283s] [ 4%] 2025-12-04T13:35:25.9280974Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bool PASSED [0.0198s] [ 4%] 2025-12-04T13:35:25.9281101Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex128 PASSED [0.0264s] [ 4%] 2025-12-04T13:35:25.9281223Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float64 PASSED [0.0250s] [ 5%] 2025-12-04T13:35:25.9281344Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32 PASSED [0.0192s] [ 5%] 2025-12-04T13:35:25.9281476Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float16 PASSED [0.7727s] [ 5%] 2025-12-04T13:35:25.9281606Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int16 PASSED [0.0047s] [ 5%] 2025-12-04T13:35:25.9281728Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64 PASSED [0.0346s] [ 5%] 2025-12-04T13:35:25.9281848Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int16 PASSED [0.7958s] [ 5%] 2025-12-04T13:35:25.9281981Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_bfloat16 PASSED [0.0142s] [ 5%] 2025-12-04T13:35:25.9282120Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex128 PASSED [0.0281s] [ 5%] 2025-12-04T13:35:25.9282258Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex64 PASSED [0.7858s] [ 5%] 2025-12-04T13:35:25.9282392Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float64 PASSED [0.0142s] [ 5%] 2025-12-04T13:35:25.9282524Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int64 PASSED [0.7850s] [ 5%] 2025-12-04T13:35:25.9282654Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_uint8 PASSED [0.0098s] [ 5%] 2025-12-04T13:35:25.9282787Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bool PASSED [0.0163s] [ 5%] 2025-12-04T13:35:25.9282918Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_uint8 PASSED [0.0158s] [ 5%] 2025-12-04T13:35:25.9283052Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex128 PASSED [0.0219s] [ 5%] 2025-12-04T13:35:25.9283181Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float64 PASSED [0.7868s] [ 5%] 2025-12-04T13:35:25.9283330Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float16 PASSED [0.0856s] [ 5%] 2025-12-04T13:35:25.9283464Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int16 PASSED [0.0705s] [ 5%] 2025-12-04T13:35:25.9283587Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int64 PASSED [0.0717s] [ 5%] 2025-12-04T13:35:25.9283709Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int8 PASSED [0.0692s] [ 5%] 2025-12-04T13:35:25.9283834Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_complex128 PASSED [0.3798s] [ 5%] 2025-12-04T13:35:25.9283954Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float16 PASSED [0.8038s] [ 5%] 2025-12-04T13:35:25.9284071Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_uint8 PASSED [0.0163s] [ 5%] 2025-12-04T13:35:25.9284197Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex128 PASSED [0.3786s] [ 5%] 2025-12-04T13:35:25.9284338Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64 PASSED [0.0228s] [ 5%] 2025-12-04T13:35:25.9284474Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_complex128 PASSED [0.7796s] [ 5%] 2025-12-04T13:35:25.9284607Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float16 PASSED [0.0079s] [ 5%] 2025-12-04T13:35:25.9284737Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int32 PASSED [0.7809s] [ 5%] 2025-12-04T13:35:25.9284862Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_bfloat16 PASSED [0.0251s] [ 5%] 2025-12-04T13:35:25.9284987Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float32 PASSED [0.7724s] [ 5%] 2025-12-04T13:35:25.9285113Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_float64 PASSED [0.0099s] [ 6%] 2025-12-04T13:35:25.9285237Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int32 PASSED [0.7648s] [ 6%] 2025-12-04T13:35:25.9285362Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float64 PASSED [0.0157s] [ 6%] 2025-12-04T13:35:25.9285484Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int8 PASSED [0.0113s] [ 6%] 2025-12-04T13:35:25.9285606Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_bfloat16 PASSED [0.7658s] [ 6%] 2025-12-04T13:35:25.9285729Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex32 PASSED [0.0085s] [ 6%] 2025-12-04T13:35:25.9285849Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float32 PASSED [0.7569s] [ 6%] 2025-12-04T13:35:25.9285969Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int32 PASSED [0.0068s] [ 6%] 2025-12-04T13:35:25.9286089Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_uint8 PASSED [0.7665s] [ 6%] 2025-12-04T13:35:25.9286217Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int64 PASSED [0.0108s] [ 6%] 2025-12-04T13:35:25.9286342Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_uint8 PASSED [0.7658s] [ 6%] 2025-12-04T13:35:25.9286479Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bfloat16 PASSED [0.0112s] [ 6%] 2025-12-04T13:35:25.9286611Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float32 PASSED [0.0094s] [ 6%] 2025-12-04T13:35:25.9286742Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex32 PASSED [0.7831s] [ 6%] 2025-12-04T13:35:25.9286867Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float32 PASSED [0.0109s] [ 6%] 2025-12-04T13:35:25.9287025Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bfloat16 PASSED [0.0082s] [ 6%] 2025-12-04T13:35:25.9287175Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_complex64 PASSED [0.0082s] [ 6%] 2025-12-04T13:35:25.9287308Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int8 PASSED [0.0063s] [ 6%] 2025-12-04T13:35:25.9287442Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_uint8 PASSED [0.0061s] [ 6%] 2025-12-04T13:35:25.9287566Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float16 PASSED [0.0201s] [ 6%] 2025-12-04T13:35:25.9287690Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float32 PASSED [0.0170s] [ 6%] 2025-12-04T13:35:25.9287812Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int16 PASSED [0.7886s] [ 6%] 2025-12-04T13:35:25.9288144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_bfloat16 SKIPPED [0.0014s] (Skipped _refs.div.floor_rounding with bfloat16Divide by 0: _refs produces NaN, torch produces +/-inf) [ 6%] 2025-12-04T13:35:25.9288304Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float32 PASSED [0.2925s] [ 6%] 2025-12-04T13:35:25.9288439Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int64 PASSED [0.0852s] [ 6%] 2025-12-04T13:35:25.9288582Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bfloat16 PASSED [0.0593s] [ 6%] 2025-12-04T13:35:25.9288723Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float32 PASSED [0.0568s] [ 6%] 2025-12-04T13:35:25.9288858Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int16 PASSED [0.0555s] [ 6%] 2025-12-04T13:35:25.9288998Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int32 PASSED [0.0556s] [ 6%] 2025-12-04T13:35:25.9289137Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int64 PASSED [0.0430s] [ 6%] 2025-12-04T13:35:25.9289274Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int8 PASSED [0.0420s] [ 7%] 2025-12-04T13:35:25.9289396Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_bfloat16 PASSED [0.7846s] [ 7%] 2025-12-04T13:35:25.9289519Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_complex128 PASSED [0.0046s] [ 7%] 2025-12-04T13:35:25.9289648Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex128 PASSED [0.7635s] [ 7%] 2025-12-04T13:35:25.9289772Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex64 PASSED [0.0049s] [ 7%] 2025-12-04T13:35:25.9289897Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float16 PASSED [0.0042s] [ 7%] 2025-12-04T13:35:25.9290070Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bfloat16 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 7%] 2025-12-04T13:35:25.9290241Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float16 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 7%] 2025-12-04T13:35:25.9290406Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_int8 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 7%] 2025-12-04T13:35:25.9290580Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 7%] 2025-12-04T13:35:25.9290751Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int64 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 7%] 2025-12-04T13:35:25.9290966Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bfloat16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 7%] 2025-12-04T13:35:25.9291166Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 7%] 2025-12-04T13:35:25.9291352Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int64 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 7%] 2025-12-04T13:35:25.9291537Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 7%] 2025-12-04T13:35:25.9291721Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_uint8 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 7%] 2025-12-04T13:35:25.9291838Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bool PASSED [0.0357s] [ 7%] 2025-12-04T13:35:25.9291957Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int16 PASSED [0.0367s] [ 7%] 2025-12-04T13:35:25.9292084Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32 PASSED [0.0366s] [ 7%] 2025-12-04T13:35:25.9292204Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int16 PASSED [0.7633s] [ 7%] 2025-12-04T13:35:25.9292322Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int64 PASSED [0.0234s] [ 7%] 2025-12-04T13:35:25.9292443Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_bfloat16 PASSED [0.3240s] [ 7%] 2025-12-04T13:35:25.9292564Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_float32 PASSED [0.0177s] [ 7%] 2025-12-04T13:35:25.9292684Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int16 PASSED [0.7828s] [ 7%] 2025-12-04T13:35:25.9292809Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_bfloat16 PASSED [0.0219s] [ 7%] 2025-12-04T13:35:25.9292933Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_complex64 PASSED [0.3893s] [ 7%] 2025-12-04T13:35:25.9293054Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float64 PASSED [0.0162s] [ 7%] 2025-12-04T13:35:25.9293188Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128 PASSED [0.7668s] [ 7%] 2025-12-04T13:35:25.9293315Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float64 PASSED [0.0043s] [ 7%] 2025-12-04T13:35:25.9293441Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int16 PASSED [0.7599s] [ 8%] 2025-12-04T13:35:25.9293567Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_bool PASSED [0.0051s] [ 8%] 2025-12-04T13:35:25.9293697Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float32 PASSED [0.7615s] [ 8%] 2025-12-04T13:35:25.9293825Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_uint8 PASSED [0.0053s] [ 8%] 2025-12-04T13:35:25.9293948Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int32 PASSED [0.0049s] [ 8%] 2025-12-04T13:35:25.9294067Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_uint8 PASSED [0.0046s] [ 8%] 2025-12-04T13:35:25.9294190Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float64 PASSED [0.7767s] [ 8%] 2025-12-04T13:35:25.9294309Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int16 PASSED [0.0153s] [ 8%] 2025-12-04T13:35:25.9294428Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int8 PASSED [0.0126s] [ 8%] 2025-12-04T13:35:25.9294614Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32 SKIPPED [0.0001s] (Expected: exponential is not comparable) [ 8%] 2025-12-04T13:35:25.9294753Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int32 PASSED [0.7783s] [ 8%] 2025-12-04T13:35:25.9294878Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int8 PASSED [0.0225s] [ 8%] 2025-12-04T13:35:25.9295004Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float32 PASSED [3.6674s] [ 8%] 2025-12-04T13:35:25.9295128Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float64 PASSED [3.6606s] [ 8%] 2025-12-04T13:35:25.9295250Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_bool PASSED [2.3292s] [ 8%] 2025-12-04T13:35:25.9295375Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float32 PASSED [1.2960s] [ 8%] 2025-12-04T13:35:25.9295499Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float64 PASSED [1.1734s] [ 8%] 2025-12-04T13:35:25.9295622Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int64 PASSED [1.2973s] [ 8%] 2025-12-04T13:35:25.9295745Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_bool PASSED [2.0987s] [ 8%] 2025-12-04T13:35:25.9295879Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float32 PASSED [1.3300s] [ 8%] 2025-12-04T13:35:25.9296004Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float64 PASSED [2.0764s] [ 8%] 2025-12-04T13:35:25.9296127Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int8 PASSED [1.2636s] [ 8%] 2025-12-04T13:35:25.9296259Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float64 PASSED [0.0105s] [ 8%] 2025-12-04T13:35:25.9296387Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int32 PASSED [1.2399s] [ 8%] 2025-12-04T13:35:25.9296510Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int8 PASSED [1.4958s] [ 8%] 2025-12-04T13:35:25.9296639Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex32 PASSED [2.6634s] [ 8%] 2025-12-04T13:35:25.9296762Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int32 PASSED [1.8569s] [ 8%] 2025-12-04T13:35:25.9296884Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int8 PASSED [1.2390s] [ 8%] 2025-12-04T13:35:25.9297007Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int64 PASSED [0.0091s] [ 8%] 2025-12-04T13:35:25.9297132Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int8 PASSED [1.2321s] [ 8%] 2025-12-04T13:35:25.9297254Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int16 PASSED [0.0067s] [ 9%] 2025-12-04T13:35:25.9297377Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64 PASSED [1.2524s] [ 9%] 2025-12-04T13:35:25.9297503Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float64 PASSED [0.0081s] [ 9%] 2025-12-04T13:35:25.9297628Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_bool PASSED [1.2656s] [ 9%] 2025-12-04T13:35:25.9297758Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex64 PASSED [0.0081s] [ 9%] 2025-12-04T13:35:25.9297886Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bool PASSED [1.2470s] [ 9%] 2025-12-04T13:35:25.9298022Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex32 PASSED [0.0061s] [ 9%] 2025-12-04T13:35:25.9298199Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex64 PASSED [1.2348s] [ 9%] 2025-12-04T13:35:25.9298332Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float16 PASSED [0.0058s] [ 9%] 2025-12-04T13:35:25.9298692Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float16 PASSED [4.1746s] [ 9%] 2025-12-04T13:35:25.9298840Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float64 PASSED [2.1698s] [ 9%] 2025-12-04T13:35:25.9298965Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_uint8 PASSED [1.5477s] [ 9%] 2025-12-04T13:35:25.9299092Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_float16 PASSED [1.5514s] [ 9%] 2025-12-04T13:35:25.9299215Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int32 PASSED [1.2466s] [ 9%] 2025-12-04T13:35:25.9299362Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float16 SKIPPED [0.0003s] (Skipped!) [ 9%] 2025-12-04T13:35:25.9299488Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int32 PASSED [1.2473s] [ 9%] 2025-12-04T13:35:25.9299622Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex32 PASSED [0.8159s] [ 9%] 2025-12-04T13:35:25.9299758Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int16 PASSED [1.6790s] [ 9%] 2025-12-04T13:35:25.9299883Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int32 PASSED [1.1997s] [ 9%] 2025-12-04T13:35:25.9300014Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex64 PASSED [0.0076s] [ 9%] 2025-12-04T13:35:25.9300137Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int16 PASSED [1.2457s] [ 9%] 2025-12-04T13:35:25.9300260Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int8 PASSED [0.0073s] [ 9%] 2025-12-04T13:35:25.9300391Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex32 PASSED [0.8641s] [ 9%] 2025-12-04T13:35:25.9300520Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float32 PASSED [0.7583s] [ 9%] 2025-12-04T13:35:25.9300645Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int64 PASSED [0.0082s] [ 9%] 2025-12-04T13:35:25.9300770Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int16 PASSED [0.7628s] [ 9%] 2025-12-04T13:35:25.9300896Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float64 PASSED [0.4070s] [ 9%] 2025-12-04T13:35:25.9301020Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bfloat16 PASSED [0.0316s] [ 9%] 2025-12-04T13:35:25.9301144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex64 PASSED [0.0231s] [ 9%] 2025-12-04T13:35:25.9301264Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int16 PASSED [0.0048s] [ 9%] 2025-12-04T13:35:25.9301386Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int64 PASSED [0.0026s] [ 10%] 2025-12-04T13:35:25.9301507Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bool PASSED [0.0025s] [ 10%] 2025-12-04T13:35:25.9301635Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_complex128 PASSED [0.0028s] [ 10%] 2025-12-04T13:35:25.9301759Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float32 PASSED [0.0026s] [ 10%] 2025-12-04T13:35:25.9301879Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int32 PASSED [0.0026s] [ 10%] 2025-12-04T13:35:25.9301999Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_uint8 PASSED [0.0025s] [ 10%] 2025-12-04T13:35:25.9302131Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bfloat16 PASSED [0.0947s] [ 10%] 2025-12-04T13:35:25.9302258Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int64 PASSED [0.8141s] [ 10%] 2025-12-04T13:35:25.9302400Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float64 PASSED [0.0198s] [ 10%] 2025-12-04T13:35:25.9302529Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int8 PASSED [0.0108s] [ 10%] 2025-12-04T13:35:25.9302661Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float16 PASSED [0.2295s] [ 10%] 2025-12-04T13:35:25.9302790Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int32 PASSED [0.0519s] [ 10%] 2025-12-04T13:35:25.9302908Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int8 PASSED [0.0321s] [ 10%] 2025-12-04T13:35:25.9303024Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bool PASSED [0.0305s] [ 10%] 2025-12-04T13:35:25.9303142Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int32 PASSED [0.0322s] [ 10%] 2025-12-04T13:35:25.9303259Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int64 PASSED [0.0400s] [ 10%] 2025-12-04T13:35:25.9303376Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int8 PASSED [0.0399s] [ 10%] 2025-12-04T13:35:25.9303502Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_uint8 PASSED [0.0399s] [ 10%] 2025-12-04T13:35:25.9303626Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float32 PASSED [0.0185s] [ 10%] 2025-12-04T13:35:25.9303741Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int32 PASSED [0.0365s] [ 10%] 2025-12-04T13:35:25.9303857Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int8 PASSED [0.1685s] [ 10%] 2025-12-04T13:35:25.9303971Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_uint8 PASSED [0.0354s] [ 10%] 2025-12-04T13:35:25.9304089Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float32 PASSED [0.0374s] [ 10%] 2025-12-04T13:35:25.9304215Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int8 PASSED [0.0588s] [ 10%] 2025-12-04T13:35:25.9304343Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16 PASSED [1.2971s] [ 10%] 2025-12-04T13:35:25.9304469Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex32 PASSED [0.0075s] [ 10%] 2025-12-04T13:35:25.9304590Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int32 PASSED [1.2579s] [ 10%] 2025-12-04T13:35:25.9304717Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex32 PASSED [0.0055s] [ 10%] 2025-12-04T13:35:25.9304839Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float32 PASSED [1.2530s] [ 10%] 2025-12-04T13:35:25.9304961Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int32 PASSED [0.0050s] [ 10%] 2025-12-04T13:35:25.9305080Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float64 PASSED [0.3133s] [ 11%] 2025-12-04T13:35:25.9305206Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64 PASSED [0.0626s] [ 11%] 2025-12-04T13:35:25.9305337Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex64 PASSED [1.2827s] [ 11%] 2025-12-04T13:35:25.9305462Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float64 PASSED [0.0070s] [ 11%] 2025-12-04T13:35:25.9305584Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int8 PASSED [1.2501s] [ 11%] 2025-12-04T13:35:25.9305716Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex32 PASSED [0.0054s] [ 11%] 2025-12-04T13:35:25.9305846Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex64 PASSED [1.2657s] [ 11%] 2025-12-04T13:35:25.9305991Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_bool PASSED [0.0056s] [ 11%] 2025-12-04T13:35:25.9306130Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex32 PASSED [1.2714s] [ 11%] 2025-12-04T13:35:25.9306263Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex64 PASSED [0.0067s] [ 11%] 2025-12-04T13:35:25.9306387Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int32 PASSED [1.2425s] [ 11%] 2025-12-04T13:35:25.9306523Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex128 PASSED [0.0054s] [ 11%] 2025-12-04T13:35:25.9306651Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8 PASSED [1.2844s] [ 11%] 2025-12-04T13:35:25.9306775Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float16 PASSED [0.1266s] [ 11%] 2025-12-04T13:35:25.9306899Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float32 PASSED [0.1092s] [ 11%] 2025-12-04T13:35:25.9307025Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float64 PASSED [0.1078s] [ 11%] 2025-12-04T13:35:25.9307162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bfloat16 PASSED [1.2788s] [ 11%] 2025-12-04T13:35:25.9307283Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_bool PASSED [0.0145s] [ 11%] 2025-12-04T13:35:25.9307412Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex32 PASSED [0.0305s] [ 11%] 2025-12-04T13:35:25.9307536Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float16 PASSED [0.0157s] [ 11%] 2025-12-04T13:35:25.9307659Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int16 PASSED [1.2841s] [ 11%] 2025-12-04T13:35:25.9307780Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float16 PASSED [0.0140s] [ 11%] 2025-12-04T13:35:25.9307905Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float32 PASSED [0.0115s] [ 11%] 2025-12-04T13:35:25.9308026Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int16 PASSED [1.2895s] [ 11%] 2025-12-04T13:35:25.9308175Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int64 PASSED [0.0119s] [ 11%] 2025-12-04T13:35:25.9308294Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int8 PASSED [0.0106s] [ 11%] 2025-12-04T13:35:25.9308417Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bool PASSED [1.2704s] [ 11%] 2025-12-04T13:35:25.9308544Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float16 PASSED [0.0137s] [ 11%] 2025-12-04T13:35:25.9308671Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_float64 PASSED [0.0112s] [ 11%] 2025-12-04T13:35:25.9308795Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int16 PASSED [1.2687s] [ 12%] 2025-12-04T13:35:25.9308919Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_uint8 PASSED [0.0115s] [ 12%] 2025-12-04T13:35:25.9309048Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bfloat16 PASSED [0.0116s] [ 12%] 2025-12-04T13:35:25.9309173Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float64 PASSED [1.2671s] [ 12%] 2025-12-04T13:35:25.9309297Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int32 PASSED [0.0119s] [ 12%] 2025-12-04T13:35:25.9309420Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_uint8 PASSED [0.0094s] [ 12%] 2025-12-04T13:35:25.9309548Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex128 PASSED [1.2729s] [ 12%] 2025-12-04T13:35:25.9309669Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bfloat16 PASSED [0.0081s] [ 12%] 2025-12-04T13:35:25.9309819Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float32 PASSED [1.2436s] [ 12%] 2025-12-04T13:35:25.9309951Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float64 PASSED [0.0054s] [ 12%] 2025-12-04T13:35:25.9310074Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_bfloat16 PASSED [1.2869s] [ 12%] 2025-12-04T13:35:25.9310198Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex128 PASSED [0.0207s] [ 12%] 2025-12-04T13:35:25.9310319Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float32 PASSED [0.0085s] [ 12%] 2025-12-04T13:35:25.9310443Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bfloat16 PASSED [0.0196s] [ 12%] 2025-12-04T13:35:25.9310576Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float32 PASSED [1.2514s] [ 12%] 2025-12-04T13:35:25.9310706Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int64 PASSED [0.0055s] [ 12%] 2025-12-04T13:35:25.9310852Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bool PASSED [1.2691s] [ 12%] 2025-12-04T13:35:25.9310986Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float16 PASSED [0.0086s] [ 12%] 2025-12-04T13:35:25.9311119Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_uint8 PASSED [1.2466s] [ 12%] 2025-12-04T13:35:25.9311258Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float64 PASSED [0.3024s] [ 12%] 2025-12-04T13:35:25.9311387Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float64 PASSED [0.0477s] [ 12%] 2025-12-04T13:35:25.9311520Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex128 PASSED [1.5834s] [ 12%] 2025-12-04T13:35:25.9311658Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex128 PASSED [0.0229s] [ 12%] 2025-12-04T13:35:25.9311794Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float32 PASSED [0.1142s] [ 12%] 2025-12-04T13:35:25.9311926Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_bfloat16 PASSED [1.2807s] [ 12%] 2025-12-04T13:35:25.9312053Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float32 PASSED [0.0214s] [ 12%] 2025-12-04T13:35:25.9312174Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int32 XFAIL [0.0100s] [ 12%] 2025-12-04T13:35:25.9312298Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bfloat16 PASSED [1.2969s] [ 12%] 2025-12-04T13:35:25.9312417Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int8 PASSED [0.0170s] [ 12%] 2025-12-04T13:35:25.9312545Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex128 PASSED [0.0283s] [ 12%] 2025-12-04T13:35:25.9312672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex64 PASSED [0.0273s] [ 13%] 2025-12-04T13:35:25.9312794Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int16 PASSED [1.2881s] [ 13%] 2025-12-04T13:35:25.9312915Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float32 PASSED [0.0198s] [ 13%] 2025-12-04T13:35:25.9313033Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int16 PASSED [0.0155s] [ 13%] 2025-12-04T13:35:25.9313148Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int8 PASSED [0.0141s] [ 13%] 2025-12-04T13:35:25.9313264Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bool PASSED [1.2822s] [ 13%] 2025-12-04T13:35:25.9313406Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_bool PASSED [1.2864s] [ 13%] 2025-12-04T13:35:25.9313577Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex128 PASSED [1.2471s] [ 13%] 2025-12-04T13:35:25.9313732Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float32 PASSED [1.2742s] [ 13%] 2025-12-04T13:35:25.9313874Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int16 PASSED [1.2711s] [ 13%] 2025-12-04T13:35:25.9314004Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float16 PASSED [0.0167s] [ 13%] 2025-12-04T13:35:25.9314133Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_complex64 XFAIL [0.0357s] [ 13%] 2025-12-04T13:35:25.9314262Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float32 PASSED [1.3058s] [ 13%] 2025-12-04T13:35:25.9314397Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex128 PASSED [0.0470s] [ 13%] 2025-12-04T13:35:25.9314528Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float32 PASSED [0.0311s] [ 13%] 2025-12-04T13:35:25.9314666Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_uint8 PASSED [1.2568s] [ 13%] 2025-12-04T13:35:25.9314792Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bool PASSED [0.0371s] [ 13%] 2025-12-04T13:35:25.9314925Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex64 PASSED [0.0553s] [ 13%] 2025-12-04T13:35:25.9315053Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float16 PASSED [0.0409s] [ 13%] 2025-12-04T13:35:25.9315178Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int16 PASSED [0.0371s] [ 13%] 2025-12-04T13:35:25.9315311Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex128 PASSED [0.0458s] [ 13%] 2025-12-04T13:35:25.9315446Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex64 PASSED [0.0455s] [ 13%] 2025-12-04T13:35:25.9315578Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float16 PASSED [0.0327s] [ 13%] 2025-12-04T13:35:25.9315705Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int32 PASSED [0.0303s] [ 13%] 2025-12-04T13:35:25.9315837Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex128 PASSED [0.1151s] [ 13%] 2025-12-04T13:35:25.9315963Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float32 PASSED [0.1113s] [ 13%] 2025-12-04T13:35:25.9316117Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex128 PASSED [0.4409s] [ 13%] 2025-12-04T13:35:25.9316268Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float16 PASSED [0.4958s] [ 13%] 2025-12-04T13:35:25.9316413Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32 XFAIL [0.0478s] [ 13%] 2025-12-04T13:35:25.9316545Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bfloat16 PASSED [1.3156s] [ 13%] 2025-12-04T13:35:25.9316670Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int32 PASSED [0.0058s] [ 14%] 2025-12-04T13:35:25.9316795Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_uint8 PASSED [0.0056s] [ 14%] 2025-12-04T13:35:25.9316911Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bool PASSED [0.0359s] [ 14%] 2025-12-04T13:35:25.9317032Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float64 PASSED [0.0384s] [ 14%] 2025-12-04T13:35:25.9317146Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int8 PASSED [0.0353s] [ 14%] 2025-12-04T13:35:25.9317297Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_bfloat16 PASSED [1.2746s] [ 14%] 2025-12-04T13:35:25.9317440Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex128 PASSED [0.0080s] [ 14%] 2025-12-04T13:35:25.9317574Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex64 PASSED [1.2887s] [ 14%] 2025-12-04T13:35:25.9317706Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float16 PASSED [0.0077s] [ 14%] 2025-12-04T13:35:25.9317831Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int8 PASSED [1.2768s] [ 14%] 2025-12-04T13:35:25.9317954Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int16 PASSED [0.0354s] [ 14%] 2025-12-04T13:35:25.9318112Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int32 PASSED [0.0330s] [ 14%] 2025-12-04T13:35:25.9318238Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_uint8 PASSED [0.0319s] [ 14%] 2025-12-04T13:35:25.9318360Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float16 PASSED [1.2922s] [ 14%] 2025-12-04T13:35:25.9318496Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float64 PASSED [0.0136s] [ 14%] 2025-12-04T13:35:25.9318641Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int32 PASSED [0.0066s] [ 14%] 2025-12-04T13:35:25.9318793Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_int64 PASSED [1.2555s] [ 14%] 2025-12-04T13:35:25.9318918Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_bfloat16 PASSED [0.0511s] [ 14%] 2025-12-04T13:35:25.9319044Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float16 PASSED [0.0482s] [ 14%] 2025-12-04T13:35:25.9319169Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float32 PASSED [0.0447s] [ 14%] 2025-12-04T13:35:25.9319294Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int32 PASSED [1.2867s] [ 14%] 2025-12-04T13:35:25.9319421Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bfloat16 PASSED [0.0071s] [ 14%] 2025-12-04T13:35:25.9319546Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float16 PASSED [1.2501s] [ 14%] 2025-12-04T13:35:25.9319668Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int8 PASSED [0.0062s] [ 14%] 2025-12-04T13:35:25.9319787Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_bfloat16 PASSED [0.0558s] [ 14%] 2025-12-04T13:35:25.9319907Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float64 PASSED [0.0518s] [ 14%] 2025-12-04T13:35:25.9320034Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float64 PASSED [0.0159s] [ 14%] 2025-12-04T13:35:25.9320162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int16 PASSED [1.2918s] [ 14%] 2025-12-04T13:35:25.9320296Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bfloat16 PASSED [0.0109s] [ 14%] 2025-12-04T13:35:25.9320425Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int16 PASSED [0.0075s] [ 14%] 2025-12-04T13:35:25.9320551Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int64 PASSED [1.2652s] [ 15%] 2025-12-04T13:35:25.9320672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bool PASSED [0.0135s] [ 15%] 2025-12-04T13:35:25.9320798Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex32 PASSED [1.2817s] [ 15%] 2025-12-04T13:35:25.9320922Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float16 PASSED [0.0181s] [ 15%] 2025-12-04T13:35:25.9321068Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float64 PASSED [1.2784s] [ 15%] 2025-12-04T13:35:25.9321201Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int16 PASSED [0.0136s] [ 15%] 2025-12-04T13:35:25.9321320Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int8 PASSED [1.2626s] [ 15%] 2025-12-04T13:35:25.9321436Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bool PASSED [0.0374s] [ 15%] 2025-12-04T13:35:25.9321555Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float32 PASSED [0.0381s] [ 15%] 2025-12-04T13:35:25.9321675Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float64 PASSED [0.0376s] [ 15%] 2025-12-04T13:35:25.9321789Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int16 PASSED [0.0354s] [ 15%] 2025-12-04T13:35:25.9321908Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int32 PASSED [0.0105s] [ 15%] 2025-12-04T13:35:25.9322027Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_uint8 PASSED [1.2577s] [ 15%] 2025-12-04T13:35:25.9322214Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bfloat16 SKIPPED [0.0003s] (Expected: empty is not comparable) [ 15%] 2025-12-04T13:35:25.9322392Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex64 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 15%] 2025-12-04T13:35:25.9322590Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex128 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 15%] 2025-12-04T13:35:25.9322785Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float16 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 15%] 2025-12-04T13:35:25.9322911Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int32 PASSED [1.2893s] [ 15%] 2025-12-04T13:35:25.9323041Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_bfloat16 PASSED [0.0066s] [ 15%] 2025-12-04T13:35:25.9323170Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex64 PASSED [1.2566s] [ 15%] 2025-12-04T13:35:25.9323294Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int64 PASSED [0.0057s] [ 15%] 2025-12-04T13:35:25.9323421Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bfloat16 PASSED [1.2546s] [ 15%] 2025-12-04T13:35:25.9323553Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex32 PASSED [0.0068s] [ 15%] 2025-12-04T13:35:25.9323751Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float64 SKIPPED [0.0002s] (Expected: dropout is not comparable) [ 15%] 2025-12-04T13:35:25.9323902Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float16 PASSED [1.3236s] [ 15%] 2025-12-04T13:35:25.9324051Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float64 PASSED [1.2742s] [ 15%] 2025-12-04T13:35:25.9324198Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float64 PASSED [0.0251s] [ 15%] 2025-12-04T13:35:25.9324343Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float64 PASSED [0.0161s] [ 15%] 2025-12-04T13:35:25.9324503Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float16 PASSED [1.2869s] [ 15%] 2025-12-04T13:35:25.9324663Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0130s] [ 16%] 2025-12-04T13:35:25.9324809Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex128 PASSED [1.2791s] [ 16%] 2025-12-04T13:35:25.9324969Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float32 PASSED [0.0068s] [ 16%] 2025-12-04T13:35:25.9325123Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float16 PASSED [1.2795s] [ 16%] 2025-12-04T13:35:25.9325270Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float32 PASSED [0.0075s] [ 16%] 2025-12-04T13:35:25.9325431Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float64 PASSED [1.2879s] [ 16%] 2025-12-04T13:35:25.9325592Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int64 PASSED [1.2535s] [ 16%] 2025-12-04T13:35:25.9325752Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0171s] [ 16%] 2025-12-04T13:35:25.9325911Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int64 PASSED [1.2785s] [ 16%] 2025-12-04T13:35:25.9326063Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float64 PASSED [0.0397s] [ 16%] 2025-12-04T13:35:25.9326207Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_bfloat16 PASSED [0.0439s] [ 16%] 2025-12-04T13:35:25.9326352Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float16 PASSED [0.0313s] [ 16%] 2025-12-04T13:35:25.9326504Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int32 PASSED [1.2732s] [ 16%] 2025-12-04T13:35:25.9326657Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int8 PASSED [1.2723s] [ 16%] 2025-12-04T13:35:25.9326799Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float32 PASSED [2.6602s] [ 16%] 2025-12-04T13:35:25.9326952Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bfloat16 PASSED [0.0042s] [ 16%] 2025-12-04T13:35:25.9327103Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_float64 PASSED [0.0038s] [ 16%] 2025-12-04T13:35:25.9327254Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_bool PASSED [0.0035s] [ 16%] 2025-12-04T13:35:25.9327405Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float16 PASSED [0.0035s] [ 16%] 2025-12-04T13:35:25.9327558Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_uint8 PASSED [0.0034s] [ 16%] 2025-12-04T13:35:25.9327710Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float64 PASSED [0.0322s] [ 16%] 2025-12-04T13:35:25.9327865Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8 PASSED [0.0342s] [ 16%] 2025-12-04T13:35:25.9328007Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float16 PASSED [0.0604s] [ 16%] 2025-12-04T13:35:25.9328181Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float32 PASSED [0.0500s] [ 16%] 2025-12-04T13:35:25.9328319Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int8 PASSED [1.2760s] [ 16%] 2025-12-04T13:35:25.9328454Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int8 PASSED [0.0136s] [ 16%] 2025-12-04T13:35:25.9328594Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16 PASSED [0.0252s] [ 16%] 2025-12-04T13:35:25.9328731Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float16 PASSED [1.2824s] [ 16%] 2025-12-04T13:35:25.9328915Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float32 PASSED [0.0282s] [ 16%] 2025-12-04T13:35:25.9329077Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float64 PASSED [1.2650s] [ 16%] 2025-12-04T13:35:25.9329235Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bfloat16 PASSED [1.2715s] [ 17%] 2025-12-04T13:35:25.9329395Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex128 PASSED [1.2758s] [ 17%] 2025-12-04T13:35:25.9329553Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_float16 PASSED [1.2294s] [ 17%] 2025-12-04T13:35:25.9329706Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16 PASSED [1.2571s] [ 17%] 2025-12-04T13:35:25.9329867Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex128 PASSED [1.2520s] [ 17%] 2025-12-04T13:35:25.9330026Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_float16 PASSED [1.2834s] [ 17%] 2025-12-04T13:35:25.9330175Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float16 PASSED [0.0268s] [ 17%] 2025-12-04T13:35:25.9330319Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int8 PASSED [0.0135s] [ 17%] 2025-12-04T13:35:25.9330463Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_uint8 PASSED [1.2580s] [ 17%] 2025-12-04T13:35:25.9330607Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_uint8 PASSED [0.0265s] [ 17%] 2025-12-04T13:35:25.9330763Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_uint8 PASSED [1.2641s] [ 17%] 2025-12-04T13:35:25.9330890Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex64 PASSED [0.0161s] [ 17%] 2025-12-04T13:35:25.9331066Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_bfloat16 SKIPPED [0.0002s] (Expected: normal is not comparable) [ 17%] 2025-12-04T13:35:25.9331254Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float32 SKIPPED [0.0001s] (Expected: normal is not comparable) [ 17%] 2025-12-04T13:35:25.9331373Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int8 PASSED [0.0029s] [ 17%] 2025-12-04T13:35:25.9331508Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float32 PASSED [0.0203s] [ 17%] 2025-12-04T13:35:25.9331631Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int32 PASSED [0.0177s] [ 17%] 2025-12-04T13:35:25.9331756Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int8 PASSED [0.0176s] [ 17%] 2025-12-04T13:35:25.9331887Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex32 PASSED [0.0248s] [ 17%] 2025-12-04T13:35:25.9332017Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex64 PASSED [1.2870s] [ 17%] 2025-12-04T13:35:25.9332142Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int16 PASSED [0.0107s] [ 17%] 2025-12-04T13:35:25.9332264Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_bfloat16 PASSED [0.0537s] [ 17%] 2025-12-04T13:35:25.9332386Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_uint8 PASSED [0.0360s] [ 17%] 2025-12-04T13:35:25.9332509Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float16 PASSED [0.0213s] [ 17%] 2025-12-04T13:35:25.9332647Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int64 PASSED [0.0120s] [ 17%] 2025-12-04T13:35:25.9332777Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool PASSED [0.0140s] [ 17%] 2025-12-04T13:35:25.9332905Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float32 PASSED [1.2936s] [ 17%] 2025-12-04T13:35:25.9333029Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int32 PASSED [0.0151s] [ 17%] 2025-12-04T13:35:25.9333155Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex32 PASSED [1.2512s] [ 17%] 2025-12-04T13:35:25.9333276Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_float64 PASSED [0.0055s] [ 17%] 2025-12-04T13:35:25.9333397Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bool PASSED [1.2655s] [ 18%] 2025-12-04T13:35:25.9333516Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int16 PASSED [0.0051s] [ 18%] 2025-12-04T13:35:25.9333640Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16 PASSED [1.2578s] [ 18%] 2025-12-04T13:35:25.9333768Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bool PASSED [0.0133s] [ 18%] 2025-12-04T13:35:25.9333888Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int32 PASSED [1.2505s] [ 18%] 2025-12-04T13:35:25.9334005Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_uint8 PASSED [0.0112s] [ 18%] 2025-12-04T13:35:25.9334134Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int16 PASSED [0.0155s] [ 18%] 2025-12-04T13:35:25.9334261Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int64 PASSED [0.0419s] [ 18%] 2025-12-04T13:35:25.9334389Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex128 PASSED [1.2637s] [ 18%] 2025-12-04T13:35:25.9334516Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float64 PASSED [0.0061s] [ 18%] 2025-12-04T13:35:25.9334637Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_uint8 PASSED [0.0114s] [ 18%] 2025-12-04T13:35:25.9334773Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex128 PASSED [1.2704s] [ 18%] 2025-12-04T13:35:25.9334902Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float32 PASSED [0.0123s] [ 18%] 2025-12-04T13:35:25.9335033Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float64 PASSED [1.2771s] [ 18%] 2025-12-04T13:35:25.9335162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex128 PASSED [0.0216s] [ 18%] 2025-12-04T13:35:25.9335288Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float64 PASSED [0.0176s] [ 18%] 2025-12-04T13:35:25.9335410Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int64 PASSED [0.0141s] [ 18%] 2025-12-04T13:35:25.9335534Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int8 PASSED [0.0140s] [ 18%] 2025-12-04T13:35:25.9335656Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bfloat16 PASSED [1.2757s] [ 18%] 2025-12-04T13:35:25.9335775Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_bool PASSED [0.0077s] [ 18%] 2025-12-04T13:35:25.9335894Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_uint8 PASSED [1.2706s] [ 18%] 2025-12-04T13:35:25.9336021Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex64 PASSED [0.0145s] [ 18%] 2025-12-04T13:35:25.9336140Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int16 PASSED [1.2728s] [ 18%] 2025-12-04T13:35:25.9336259Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int8 PASSED [0.0110s] [ 18%] 2025-12-04T13:35:25.9336395Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int8 PASSED [0.0109s] [ 18%] 2025-12-04T13:35:25.9336527Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_uint8 PASSED [1.2590s] [ 18%] 2025-12-04T13:35:25.9336651Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float32 PASSED [0.0190s] [ 18%] 2025-12-04T13:35:25.9336771Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int64 PASSED [0.0289s] [ 18%] 2025-12-04T13:35:25.9336905Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float64 PASSED [1.2727s] [ 18%] 2025-12-04T13:35:25.9337038Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_uint8 PASSED [0.0055s] [ 18%] 2025-12-04T13:35:25.9337162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex32 PASSED [1.2664s] [ 19%] 2025-12-04T13:35:25.9337283Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float16 PASSED [1.2959s] [ 19%] 2025-12-04T13:35:25.9337402Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int8 PASSED [0.0111s] [ 19%] 2025-12-04T13:35:25.9337528Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_uint8 PASSED [0.0093s] [ 19%] 2025-12-04T13:35:25.9337658Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex32 PASSED [0.0401s] [ 19%] 2025-12-04T13:35:25.9337783Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float32 PASSED [1.2869s] [ 19%] 2025-12-04T13:35:25.9337902Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bool PASSED [0.0152s] [ 19%] 2025-12-04T13:35:25.9338023Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int64 PASSED [0.0110s] [ 19%] 2025-12-04T13:35:25.9338185Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex128 PASSED [0.2244s] [ 19%] 2025-12-04T13:35:25.9338308Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex32 PASSED [1.5101s] [ 19%] 2025-12-04T13:35:25.9338431Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float32 PASSED [0.0233s] [ 19%] 2025-12-04T13:35:25.9338547Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_uint8 PASSED [0.0127s] [ 19%] 2025-12-04T13:35:25.9338664Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int8 PASSED [1.2976s] [ 19%] 2025-12-04T13:35:25.9338780Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bool PASSED [0.0247s] [ 19%] 2025-12-04T13:35:25.9338905Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex32 PASSED [0.4437s] [ 19%] 2025-12-04T13:35:25.9339029Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex64 PASSED [0.0288s] [ 19%] 2025-12-04T13:35:25.9339151Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float16 PASSED [1.2741s] [ 19%] 2025-12-04T13:35:25.9339271Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int64 PASSED [0.0167s] [ 19%] 2025-12-04T13:35:25.9339389Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int8 PASSED [0.0124s] [ 19%] 2025-12-04T13:35:25.9339507Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_uint8 PASSED [1.2947s] [ 19%] 2025-12-04T13:35:25.9339649Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bfloat16 PASSED [1.2517s] [ 19%] 2025-12-04T13:35:25.9339786Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bool PASSED [1.2789s] [ 19%] 2025-12-04T13:35:25.9339929Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex128 PASSED [1.2589s] [ 19%] 2025-12-04T13:35:25.9340094Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int16 PASSED [1.2751s] [ 19%] 2025-12-04T13:35:25.9340244Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32 PASSED [1.2704s] [ 19%] 2025-12-04T13:35:25.9340381Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_uint8 PASSED [0.0205s] [ 19%] 2025-12-04T13:35:25.9340513Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int16 PASSED [0.0171s] [ 19%] 2025-12-04T13:35:25.9340647Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_uint8 PASSED [0.0136s] [ 19%] 2025-12-04T13:35:25.9340777Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int16 PASSED [1.2896s] [ 19%] 2025-12-04T13:35:25.9340911Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int32 PASSED [0.0224s] [ 20%] 2025-12-04T13:35:25.9341041Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bool PASSED [0.0192s] [ 20%] 2025-12-04T13:35:25.9341169Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int64 PASSED [0.0139s] [ 20%] 2025-12-04T13:35:25.9341316Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_uint8 PASSED [1.2936s] [ 20%] 2025-12-04T13:35:25.9341441Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int64 PASSED [0.0208s] [ 20%] 2025-12-04T13:35:25.9341579Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float32 PASSED [0.3356s] [ 20%] 2025-12-04T13:35:25.9341712Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_uint8 PASSED [1.2707s] [ 20%] 2025-12-04T13:35:25.9341865Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool PASSED [1.2770s] [ 20%] 2025-12-04T13:35:25.9342024Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex32 PASSED [1.2561s] [ 20%] 2025-12-04T13:35:25.9342177Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int64 PASSED [1.2878s] [ 20%] 2025-12-04T13:35:25.9342307Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bool PASSED [0.0214s] [ 20%] 2025-12-04T13:35:25.9342469Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16 PASSED [0.0245s] [ 20%] 2025-12-04T13:35:25.9342626Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int32 PASSED [0.0212s] [ 20%] 2025-12-04T13:35:25.9342786Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int16 PASSED [0.0210s] [ 20%] 2025-12-04T13:35:25.9342919Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bfloat16 PASSED [1.3118s] [ 20%] 2025-12-04T13:35:25.9343050Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_bool PASSED [0.0191s] [ 20%] 2025-12-04T13:35:25.9343181Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int32 PASSED [0.0140s] [ 20%] 2025-12-04T13:35:25.9343308Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int8 PASSED [1.2905s] [ 20%] 2025-12-04T13:35:25.9343442Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float32 PASSED [0.0241s] [ 20%] 2025-12-04T13:35:25.9343594Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_complex64 PASSED [0.0047s] [ 20%] 2025-12-04T13:35:25.9343745Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float64 PASSED [1.2599s] [ 20%] 2025-12-04T13:35:25.9343892Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int16 PASSED [1.2447s] [ 20%] 2025-12-04T13:35:25.9344060Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_uint8 PASSED [1.2530s] [ 20%] 2025-12-04T13:35:25.9344224Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float64 PASSED [0.4483s] [ 20%] 2025-12-04T13:35:25.9344373Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int64 PASSED [0.0177s] [ 20%] 2025-12-04T13:35:25.9344519Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int8 PASSED [0.0134s] [ 20%] 2025-12-04T13:35:25.9344668Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_uint8 PASSED [1.2708s] [ 20%] 2025-12-04T13:35:25.9344804Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float16 PASSED [0.0549s] [ 20%] 2025-12-04T13:35:25.9344937Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_uint8 PASSED [1.3205s] [ 20%] 2025-12-04T13:35:25.9345076Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex32 PASSED [0.0067s] [ 20%] 2025-12-04T13:35:25.9345227Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex64 PASSED [1.2807s] [ 21%] 2025-12-04T13:35:25.9345362Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int32 PASSED [0.0051s] [ 21%] 2025-12-04T13:35:25.9345495Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_uint8 PASSED [1.2585s] [ 21%] 2025-12-04T13:35:25.9345618Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int32 PASSED [0.0159s] [ 21%] 2025-12-04T13:35:25.9345739Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int8 PASSED [0.0129s] [ 21%] 2025-12-04T13:35:25.9345871Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_bool PASSED [0.0042s] [ 21%] 2025-12-04T13:35:25.9346007Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex64 PASSED [1.2810s] [ 21%] 2025-12-04T13:35:25.9346134Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float16 PASSED [0.0074s] [ 21%] 2025-12-04T13:35:25.9346266Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bool PASSED [0.0042s] [ 21%] 2025-12-04T13:35:25.9346408Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex128 PASSED [0.0047s] [ 21%] 2025-12-04T13:35:25.9346543Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float64 PASSED [1.2725s] [ 21%] 2025-12-04T13:35:25.9346678Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int64 PASSED [0.0058s] [ 21%] 2025-12-04T13:35:25.9346798Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bool PASSED [0.0045s] [ 21%] 2025-12-04T13:35:25.9346927Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex64 PASSED [0.0050s] [ 21%] 2025-12-04T13:35:25.9347049Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int32 PASSED [0.0040s] [ 21%] 2025-12-04T13:35:25.9347173Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_bfloat16 PASSED [0.0068s] [ 21%] 2025-12-04T13:35:25.9347301Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex64 PASSED [1.2847s] [ 21%] 2025-12-04T13:35:25.9347428Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float16 PASSED [0.0122s] [ 21%] 2025-12-04T13:35:25.9347553Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float32 PASSED [1.2828s] [ 21%] 2025-12-04T13:35:25.9347672Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float64 PASSED [0.3567s] [ 21%] 2025-12-04T13:35:25.9347812Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float16 PASSED [0.0739s] [ 21%] 2025-12-04T13:35:25.9347942Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float32 PASSED [0.0519s] [ 21%] 2025-12-04T13:35:25.9348095Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float64 PASSED [0.0514s] [ 21%] 2025-12-04T13:35:25.9348212Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int64 PASSED [1.3580s] [ 21%] 2025-12-04T13:35:25.9348329Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_bool PASSED [0.0098s] [ 21%] 2025-12-04T13:35:25.9348451Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex64 PASSED [0.0099s] [ 21%] 2025-12-04T13:35:25.9348569Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int16 PASSED [0.0075s] [ 21%] 2025-12-04T13:35:25.9348703Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex64 PASSED [1.2840s] [ 21%] 2025-12-04T13:35:25.9348831Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int16 PASSED [0.0078s] [ 21%] 2025-12-04T13:35:25.9348973Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int64 PASSED [0.0059s] [ 21%] 2025-12-04T13:35:25.9349101Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8 PASSED [1.2714s] [ 22%] 2025-12-04T13:35:25.9349222Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int64 PASSED [0.0047s] [ 22%] 2025-12-04T13:35:25.9349342Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int8 PASSED [1.2954s] [ 22%] 2025-12-04T13:35:25.9349461Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bfloat16 PASSED [0.0061s] [ 22%] 2025-12-04T13:35:25.9349582Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex64 PASSED [1.2869s] [ 22%] 2025-12-04T13:35:25.9349716Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int16 PASSED [0.0706s] [ 22%] 2025-12-04T13:35:25.9349848Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int64 PASSED [0.0064s] [ 22%] 2025-12-04T13:35:25.9349971Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_complex32 PASSED [0.0357s] [ 22%] 2025-12-04T13:35:25.9350096Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex128 PASSED [1.3374s] [ 22%] 2025-12-04T13:35:25.9350221Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex32 PASSED [0.0328s] [ 22%] 2025-12-04T13:35:25.9350341Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16 PASSED [0.0161s] [ 22%] 2025-12-04T13:35:25.9350463Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float64 PASSED [0.0151s] [ 22%] 2025-12-04T13:35:25.9350597Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bfloat16 PASSED [1.3229s] [ 22%] 2025-12-04T13:35:25.9350735Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex128 PASSED [0.0105s] [ 22%] 2025-12-04T13:35:25.9350864Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int64 PASSED [1.2943s] [ 22%] 2025-12-04T13:35:25.9350982Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bool PASSED [0.0143s] [ 22%] 2025-12-04T13:35:25.9351104Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex128 PASSED [0.0118s] [ 22%] 2025-12-04T13:35:25.9351226Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_complex64 PASSED [1.3004s] [ 22%] 2025-12-04T13:35:25.9351347Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float16 PASSED [0.0144s] [ 22%] 2025-12-04T13:35:25.9351494Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int16 PASSED [0.0117s] [ 22%] 2025-12-04T13:35:25.9351623Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int8 PASSED [1.2901s] [ 22%] 2025-12-04T13:35:25.9351747Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_bfloat16 PASSED [0.0043s] [ 22%] 2025-12-04T13:35:25.9351874Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128 PASSED [1.2824s] [ 22%] 2025-12-04T13:35:25.9351997Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float32 PASSED [0.0042s] [ 22%] 2025-12-04T13:35:25.9352130Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_bool PASSED [1.2840s] [ 22%] 2025-12-04T13:35:25.9352267Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex64 PASSED [0.0071s] [ 22%] 2025-12-04T13:35:25.9352400Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_uint8 PASSED [1.2800s] [ 22%] 2025-12-04T13:35:25.9352529Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float16 PASSED [0.0064s] [ 22%] 2025-12-04T13:35:25.9352668Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float32 PASSED [1.2895s] [ 22%] 2025-12-04T13:35:25.9352791Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int8 PASSED [0.0058s] [ 22%] 2025-12-04T13:35:25.9352914Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float32 PASSED [1.3065s] [ 23%] 2025-12-04T13:35:25.9353043Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_indices_cuda_int32 PASSED [0.0060s] [ 23%] 2025-12-04T13:35:25.9353162Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bool PASSED [1.2670s] [ 23%] 2025-12-04T13:35:25.9353283Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float32 PASSED [0.0073s] [ 23%] 2025-12-04T13:35:25.9353406Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float64 PASSED [1.2895s] [ 23%] 2025-12-04T13:35:25.9353527Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int16 PASSED [0.0062s] [ 23%] 2025-12-04T13:35:25.9353646Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int64 PASSED [1.2654s] [ 23%] 2025-12-04T13:35:25.9353763Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_uint8 PASSED [0.0060s] [ 23%] 2025-12-04T13:35:25.9353893Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int32 PASSED [1.2935s] [ 23%] 2025-12-04T13:35:25.9354020Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_indices_cuda_int64 PASSED [0.0061s] [ 23%] 2025-12-04T13:35:25.9354146Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bool PASSED [0.0531s] [ 23%] 2025-12-04T13:35:25.9354282Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex128 PASSED [0.0674s] [ 23%] 2025-12-04T13:35:25.9354414Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex32 XFAIL [0.0202s] [ 23%] 2025-12-04T13:35:25.9354542Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int8 PASSED [1.3624s] [ 23%] 2025-12-04T13:35:25.9354664Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float64 PASSED [0.0149s] [ 23%] 2025-12-04T13:35:25.9354798Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_complex64 PASSED [1.2944s] [ 23%] 2025-12-04T13:35:25.9354928Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_copy_cuda_float64 PASSED [0.0088s] [ 23%] 2025-12-04T13:35:25.9355051Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_float16 PASSED [1.3029s] [ 23%] 2025-12-04T13:35:25.9355171Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int8 PASSED [0.0064s] [ 23%] 2025-12-04T13:35:25.9355314Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bool PASSED [0.0044s] [ 23%] 2025-12-04T13:35:25.9355457Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int64 PASSED [1.3030s] [ 23%] 2025-12-04T13:35:25.9355587Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_uint8 PASSED [0.0085s] [ 23%] 2025-12-04T13:35:25.9355715Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex128 PASSED [1.3011s] [ 23%] 2025-12-04T13:35:25.9355840Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex32 PASSED [0.0109s] [ 23%] 2025-12-04T13:35:25.9355964Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex64 PASSED [1.2614s] [ 23%] 2025-12-04T13:35:25.9356088Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float16 PASSED [0.0107s] [ 23%] 2025-12-04T13:35:25.9356210Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int16 PASSED [1.2641s] [ 23%] 2025-12-04T13:35:25.9356352Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex32 PASSED [0.0081s] [ 23%] 2025-12-04T13:35:25.9356478Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float32 PASSED [0.0057s] [ 23%] 2025-12-04T13:35:25.9356602Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int32 PASSED [0.0046s] [ 24%] 2025-12-04T13:35:25.9356726Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_complex64 PASSED [1.2440s] [ 24%] 2025-12-04T13:35:25.9356855Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex128 PASSED [0.0149s] [ 24%] 2025-12-04T13:35:25.9356980Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float16 PASSED [1.2531s] [ 24%] 2025-12-04T13:35:25.9357104Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_bfloat16 PASSED [0.0044s] [ 24%] 2025-12-04T13:35:25.9357231Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex128 PASSED [1.2332s] [ 24%] 2025-12-04T13:35:25.9357352Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float32 PASSED [0.0042s] [ 24%] 2025-12-04T13:35:25.9357472Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float64 PASSED [1.2293s] [ 24%] 2025-12-04T13:35:25.9357608Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float16 PASSED [0.0052s] [ 24%] 2025-12-04T13:35:25.9357744Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float64 PASSED [1.2446s] [ 24%] 2025-12-04T13:35:25.9357871Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float32 PASSED [0.0061s] [ 24%] 2025-12-04T13:35:25.9357997Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int8 PASSED [1.2376s] [ 24%] 2025-12-04T13:35:25.9358156Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bfloat16 PASSED [0.0203s] [ 24%] 2025-12-04T13:35:25.9358281Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32 PASSED [0.0185s] [ 24%] 2025-12-04T13:35:25.9358400Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int16 PASSED [0.0140s] [ 24%] 2025-12-04T13:35:25.9358519Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_uint8 PASSED [0.0140s] [ 24%] 2025-12-04T13:35:25.9358646Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex32 PASSED [1.2334s] [ 24%] 2025-12-04T13:35:25.9358777Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex64 PASSED [0.0052s] [ 24%] 2025-12-04T13:35:25.9358899Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int32 PASSED [1.2353s] [ 24%] 2025-12-04T13:35:25.9359051Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex128 PASSED [0.0054s] [ 24%] 2025-12-04T13:35:25.9359187Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float64 PASSED [0.0036s] [ 24%] 2025-12-04T13:35:25.9359307Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int16 PASSED [1.2186s] [ 24%] 2025-12-04T13:35:25.9359429Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int32 PASSED [0.0051s] [ 24%] 2025-12-04T13:35:25.9359549Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int64 PASSED [0.0035s] [ 24%] 2025-12-04T13:35:25.9359669Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8 PASSED [1.2166s] [ 24%] 2025-12-04T13:35:25.9359788Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bool PASSED [0.0109s] [ 24%] 2025-12-04T13:35:25.9359916Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex32 PASSED [0.0107s] [ 24%] 2025-12-04T13:35:25.9360037Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float32 PASSED [1.2553s] [ 24%] 2025-12-04T13:35:25.9360178Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int32 PASSED [0.0109s] [ 24%] 2025-12-04T13:35:25.9360296Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int64 PASSED [0.0086s] [ 24%] 2025-12-04T13:35:25.9360415Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int8 PASSED [1.2727s] [ 25%] 2025-12-04T13:35:25.9360533Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int64 PASSED [0.0497s] [ 25%] 2025-12-04T13:35:25.9360658Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_bfloat16 PASSED [1.2335s] [ 25%] 2025-12-04T13:35:25.9360782Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex32 PASSED [0.0048s] [ 25%] 2025-12-04T13:35:25.9360905Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float16 PASSED [1.2827s] [ 25%] 2025-12-04T13:35:25.9361026Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int64 PASSED [0.0041s] [ 25%] 2025-12-04T13:35:25.9361144Z test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_uint8 PASSED [1.2184s] [ 25%] 2025-12-04T13:35:25.9361245Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_all_cuda PASSED [0.0061s] [ 25%] 2025-12-04T13:35:25.9361354Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_logsumexp_cuda PASSED [1.2257s] [ 25%] 2025-12-04T13:35:25.9361477Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_min_reduction_with_dim_cuda PASSED [0.0037s] [ 25%] 2025-12-04T13:35:25.9361580Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_nansum_cuda PASSED [1.2317s] [ 25%] 2025-12-04T13:35:25.9361680Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_prod_cuda PASSED [0.0079s] [ 25%] 2025-12-04T13:35:25.9361778Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_cuda PASSED [1.2384s] [ 25%] 2025-12-04T13:35:25.9361891Z test_ops.py::TestCommonCUDA::test_reduction_ops_reduce_var_unbiased_cuda PASSED [0.0035s] [ 25%] 2025-12-04T13:35:25.9362008Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_complex64 PASSED [1.2245s] [ 25%] 2025-12-04T13:35:25.9362125Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_complex64 PASSED [1.2498s] [ 25%] 2025-12-04T13:35:25.9362239Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_float32 PASSED [0.0042s] [ 25%] 2025-12-04T13:35:25.9362369Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_complex64 PASSED [1.2799s] [ 25%] 2025-12-04T13:35:25.9362495Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_float32 PASSED [0.0264s] [ 25%] 2025-12-04T13:35:25.9362618Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_complex64 PASSED [0.0172s] [ 25%] 2025-12-04T13:35:25.9362755Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_float32 PASSED [1.2750s] [ 25%] 2025-12-04T13:35:25.9362892Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_complex64 PASSED [1.4699s] [ 25%] 2025-12-04T13:35:25.9363010Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmul___cuda_float32 PASSED [1.2307s] [ 25%] 2025-12-04T13:35:25.9363151Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__batch_norm_with_update_cuda_float32 PASSED [0.0274s] [ 25%] 2025-12-04T13:35:25.9363279Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_complex64 PASSED [1.2487s] [ 25%] 2025-12-04T13:35:25.9363402Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_float32 PASSED [0.0052s] [ 25%] 2025-12-04T13:35:25.9363543Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_lengths_cuda_float32 PASSED [0.0635s] [ 25%] 2025-12-04T13:35:25.9363684Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32 PASSED [0.0297s] [ 25%] 2025-12-04T13:35:25.9363832Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_float32 PASSED [1.2392s] [ 25%] 2025-12-04T13:35:25.9363990Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.0089s] [ 25%] 2025-12-04T13:35:25.9364105Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_float32 PASSED [1.2454s] [ 26%] 2025-12-04T13:35:25.9364224Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_complex64 PASSED [1.2891s] [ 26%] 2025-12-04T13:35:25.9364360Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_complex64 SKIPPED [0.0003s] (Skipped!) [ 26%] 2025-12-04T13:35:25.9364474Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_float32 PASSED [1.2227s] [ 26%] 2025-12-04T13:35:25.9364595Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmin_cuda_float32 PASSED [0.0122s] [ 26%] 2025-12-04T13:35:25.9364740Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_complex64 XFAIL [0.0073s] [ 26%] 2025-12-04T13:35:25.9364859Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_complex64 PASSED [1.2608s] [ 26%] 2025-12-04T13:35:25.9364978Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_complex64 PASSED [1.2402s] [ 26%] 2025-12-04T13:35:25.9365105Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_complex64 PASSED [1.2461s] [ 26%] 2025-12-04T13:35:25.9365227Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_baddbmm_cuda_complex64 PASSED [1.2755s] [ 26%] 2025-12-04T13:35:25.9365346Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_complex64 PASSED [1.2357s] [ 26%] 2025-12-04T13:35:25.9365480Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_float32 PASSED [0.0035s] [ 26%] 2025-12-04T13:35:25.9365608Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_float32 PASSED [1.2181s] [ 26%] 2025-12-04T13:35:25.9365739Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32 PASSED [0.0052s] [ 26%] 2025-12-04T13:35:25.9365852Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_float32 PASSED [1.2288s] [ 26%] 2025-12-04T13:35:25.9365972Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_float32 PASSED [0.0116s] [ 26%] 2025-12-04T13:35:25.9366087Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_float32 PASSED [1.2364s] [ 26%] 2025-12-04T13:35:25.9366208Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_float32 PASSED [1.5415s] [ 26%] 2025-12-04T13:35:25.9366338Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_float32 PASSED [1.2601s] [ 26%] 2025-12-04T13:35:25.9366477Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_min_cuda_float32 PASSED [0.0131s] [ 26%] 2025-12-04T13:35:25.9366613Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_column_stack_cuda_complex64 PASSED [1.2741s] [ 26%] 2025-12-04T13:35:25.9366737Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_corrcoef_cuda_complex64 PASSED [1.3674s] [ 26%] 2025-12-04T13:35:25.9366866Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_complex64 PASSED [1.2573s] [ 26%] 2025-12-04T13:35:25.9366993Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32 PASSED [0.0073s] [ 26%] 2025-12-04T13:35:25.9367122Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_float32 PASSED [1.2648s] [ 26%] 2025-12-04T13:35:25.9367240Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_complex64 PASSED [1.4128s] [ 26%] 2025-12-04T13:35:25.9367380Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64 PASSED [0.0482s] [ 26%] 2025-12-04T13:35:25.9367516Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_trunc_rounding_cuda_float32 PASSED [1.2831s] [ 26%] 2025-12-04T13:35:25.9367643Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfc_cuda_float32 PASSED [0.0082s] [ 26%] 2025-12-04T13:35:25.9367761Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_complex64 PASSED [1.2685s] [ 26%] 2025-12-04T13:35:25.9367876Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_exp_cuda_float32 PASSED [0.0069s] [ 27%] 2025-12-04T13:35:25.9368002Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_complex64 PASSED [0.0086s] [ 27%] 2025-12-04T13:35:25.9368161Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_float32 PASSED [1.2328s] [ 27%] 2025-12-04T13:35:25.9368282Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_cuda_complex64 PASSED [1.2409s] [ 27%] 2025-12-04T13:35:25.9368414Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 27%] 2025-12-04T13:35:25.9368537Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_float32 PASSED [1.2934s] [ 27%] 2025-12-04T13:35:25.9368660Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft_cuda_complex64 PASSED [1.8644s] [ 27%] 2025-12-04T13:35:25.9368783Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_complex64 PASSED [1.2349s] [ 27%] 2025-12-04T13:35:25.9368914Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_complex64 PASSED [1.2579s] [ 27%] 2025-12-04T13:35:25.9369034Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft2_cuda_float32 PASSED [0.4370s] [ 27%] 2025-12-04T13:35:25.9369153Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fill_cuda_complex64 PASSED [1.2793s] [ 27%] 2025-12-04T13:35:25.9369274Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_complex64 PASSED [1.2679s] [ 27%] 2025-12-04T13:35:25.9369402Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_power_cuda_complex64 PASSED [1.2787s] [ 27%] 2025-12-04T13:35:25.9369528Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_floor_divide_cuda_float32 PASSED [1.2622s] [ 27%] 2025-12-04T13:35:25.9369644Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmax_cuda_float32 PASSED [0.0108s] [ 27%] 2025-12-04T13:35:25.9369764Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geometric_cuda_float32 XFAIL [0.0101s] [ 27%] 2025-12-04T13:35:25.9369887Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gradient_cuda_complex64 PASSED [1.2460s] [ 27%] 2025-12-04T13:35:25.9370005Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_float32 PASSED [0.0049s] [ 27%] 2025-12-04T13:35:25.9370125Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_complex64 PASSED [1.2421s] [ 27%] 2025-12-04T13:35:25.9370286Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_float32 PASSED [0.0044s] [ 27%] 2025-12-04T13:35:25.9370425Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_copy_cuda_complex64 PASSED [0.0114s] [ 27%] 2025-12-04T13:35:25.9370546Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_float32 PASSED [1.2594s] [ 27%] 2025-12-04T13:35:25.9370679Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_amax_cuda_float32 PASSED [0.0160s] [ 27%] 2025-12-04T13:35:25.9370811Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_mean_cuda_float32 PASSED [1.2695s] [ 27%] 2025-12-04T13:35:25.9370939Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_complex64 PASSED [1.2457s] [ 27%] 2025-12-04T13:35:25.9371057Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_complex64 PASSED [1.2830s] [ 27%] 2025-12-04T13:35:25.9371177Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_float32 PASSED [0.0099s] [ 27%] 2025-12-04T13:35:25.9371314Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_cuda_complex64 PASSED [0.0094s] [ 27%] 2025-12-04T13:35:25.9371444Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_float32 XFAIL [0.0052s] [ 27%] 2025-12-04T13:35:25.9371560Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_float32 PASSED [1.2490s] [ 28%] 2025-12-04T13:35:25.9371677Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lgamma_cuda_float32 PASSED [0.0080s] [ 28%] 2025-12-04T13:35:25.9371813Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_float32 PASSED [1.2172s] [ 28%] 2025-12-04T13:35:25.9371939Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cond_cuda_complex64 PASSED [1.2682s] [ 28%] 2025-12-04T13:35:25.9372065Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_float32 PASSED [1.2466s] [ 28%] 2025-12-04T13:35:25.9372191Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_complex64 PASSED [1.3505s] [ 28%] 2025-12-04T13:35:25.9372316Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_cuda_float32 PASSED [1.3140s] [ 28%] 2025-12-04T13:35:25.9372447Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_complex64 PASSED [1.4238s] [ 28%] 2025-12-04T13:35:25.9372686Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_complex64 SKIPPED [0.0012s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 28%] 2025-12-04T13:35:25.9372825Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_cuda_complex64 PASSED [1.3207s] [ 28%] 2025-12-04T13:35:25.9372962Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_factor_ex_cuda_float32 PASSED [1.2242s] [ 28%] 2025-12-04T13:35:25.9373089Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_float32 PASSED [0.2161s] [ 28%] 2025-12-04T13:35:25.9373225Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_complex64 PASSED [1.3142s] [ 28%] 2025-12-04T13:35:25.9373364Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_power_cuda_complex64 PASSED [1.4056s] [ 28%] 2025-12-04T13:35:25.9373512Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_hermitian_cuda_float32 PASSED [1.3359s] [ 28%] 2025-12-04T13:35:25.9373668Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_subgradients_at_zero_cuda_complex64 PASSED [1.2919s] [ 28%] 2025-12-04T13:35:25.9373795Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_complex64 PASSED [1.3096s] [ 28%] 2025-12-04T13:35:25.9373920Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_float32 PASSED [0.0212s] [ 28%] 2025-12-04T13:35:25.9374160Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_complex64 SKIPPED [0.0008s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 28%] 2025-12-04T13:35:25.9374299Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_float32 PASSED [1.2700s] [ 28%] 2025-12-04T13:35:25.9374426Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_complex64 PASSED [1.3156s] [ 28%] 2025-12-04T13:35:25.9374556Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vecdot_cuda_complex64 PASSED [1.2856s] [ 28%] 2025-12-04T13:35:25.9374676Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_float32 XFAIL [0.0092s] [ 28%] 2025-12-04T13:35:25.9374816Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_float32 XFAIL [0.0265s] [ 28%] 2025-12-04T13:35:25.9374935Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_complex64 PASSED [1.2554s] [ 28%] 2025-12-04T13:35:25.9375051Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32 PASSED [1.2506s] [ 28%] 2025-12-04T13:35:25.9375179Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp_cuda_complex64 PASSED [1.2792s] [ 28%] 2025-12-04T13:35:25.9375306Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32 PASSED [0.0361s] [ 28%] 2025-12-04T13:35:25.9375447Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_float32 XFAIL [0.1572s] [ 28%] 2025-12-04T13:35:25.9375570Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_amax_cuda_float32 PASSED [1.3115s] [ 28%] 2025-12-04T13:35:25.9375697Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_argmax_cuda_float32 PASSED [0.0197s] [ 29%] 2025-12-04T13:35:25.9375828Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_complex64 PASSED [1.2943s] [ 29%] 2025-12-04T13:35:25.9375958Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_float32 PASSED [0.0104s] [ 29%] 2025-12-04T13:35:25.9376090Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32 PASSED [0.0073s] [ 29%] 2025-12-04T13:35:25.9376215Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_norm_cuda_float32 PASSED [0.1184s] [ 29%] 2025-12-04T13:35:25.9376348Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_complex64 PASSED [1.3107s] [ 29%] 2025-12-04T13:35:25.9376477Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_complex64 PASSED [0.9591s] [ 29%] 2025-12-04T13:35:25.9376608Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_complex64 PASSED [0.1209s] [ 29%] 2025-12-04T13:35:25.9376736Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmax_cuda_float32 PASSED [0.0169s] [ 29%] 2025-12-04T13:35:25.9376863Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmin_cuda_float32 PASSED [1.3021s] [ 29%] 2025-12-04T13:35:25.9376990Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_complex64 PASSED [0.1011s] [ 29%] 2025-12-04T13:35:25.9377139Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_complex64 PASSED [0.0072s] [ 29%] 2025-12-04T13:35:25.9377280Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_float32 PASSED [1.2467s] [ 29%] 2025-12-04T13:35:25.9377428Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_variadic_tensors_cuda_complex64 PASSED [1.2625s] [ 29%] 2025-12-04T13:35:25.9377564Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_no_dim_cuda_float32 PASSED [0.0058s] [ 29%] 2025-12-04T13:35:25.9377706Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_dropout_backward_cuda_float32 PASSED [1.2903s] [ 29%] 2025-12-04T13:35:25.9377820Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_float32 PASSED [0.0072s] [ 29%] 2025-12-04T13:35:25.9377952Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_float32 PASSED [1.2516s] [ 29%] 2025-12-04T13:35:25.9378145Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_complex64 SKIPPED [0.0003s] (Skipped!) [ 29%] 2025-12-04T13:35:25.9378284Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 29%] 2025-12-04T13:35:25.9378440Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [1.2706s] [ 29%] 2025-12-04T13:35:25.9378600Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_without_cudnn_cuda_float32 PASSED [1.2368s] [ 29%] 2025-12-04T13:35:25.9378739Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_bilinear_cuda_float32 PASSED [0.0297s] [ 29%] 2025-12-04T13:35:25.9378894Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_channel_shuffle_cuda_complex64 PASSED [1.2634s] [ 29%] 2025-12-04T13:35:25.9379223Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_complex64 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7644e403c000 size: 11008 2025-12-04T13:35:25.9379429Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7644e403c000 size: 11008 2025-12-04T13:35:25.9379628Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7644e4042c00 size: 11008 2025-12-04T13:35:25.9379810Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7644e4042c00 size: 11008 2025-12-04T13:35:25.9380016Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 52800, provided ptr: 0x7644e4042c00 size: 11008 2025-12-04T13:35:25.9380207Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 52800, provided ptr: 0x7644e4042c00 size: 11008 2025-12-04T13:35:25.9380404Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7644e403fe00 size: 12544 2025-12-04T13:35:25.9380586Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7644e403fe00 size: 12544 2025-12-04T13:35:25.9380781Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7644e4057000 size: 12544 2025-12-04T13:35:25.9380966Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7644e4057000 size: 12544 2025-12-04T13:35:25.9381169Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback WTI] Solver , workspace required: 337920, provided ptr: 0x7644e4057000 size: 12544 2025-12-04T13:35:25.9381363Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 337920, provided ptr: 0x7644e4057000 size: 12544 2025-12-04T13:35:25.9381404Z PASSED [1.6463s] [ 29%] 2025-12-04T13:35:25.9381549Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout2d_cuda_float32 PASSED [1.2987s] [ 29%] 2025-12-04T13:35:25.9381683Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32 PASSED [1.2581s] [ 29%] 2025-12-04T13:35:25.9381859Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.0131s] [ 29%] 2025-12-04T13:35:25.9382036Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0274s] [ 29%] 2025-12-04T13:35:25.9382224Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [1.2928s] [ 29%] 2025-12-04T13:35:25.9382393Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.0161s] [ 30%] 2025-12-04T13:35:25.9382560Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [1.2631s] [ 30%] 2025-12-04T13:35:25.9382697Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_float32 PASSED [1.2642s] [ 30%] 2025-12-04T13:35:25.9382840Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_leaky_relu_cuda_float32 PASSED [0.0096s] [ 30%] 2025-12-04T13:35:25.9382976Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_linear_cuda_float32 PASSED [1.2800s] [ 30%] 2025-12-04T13:35:25.9383135Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_local_response_norm_cuda_float32 PASSED [0.0163s] [ 30%] 2025-12-04T13:35:25.9383278Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool2d_cuda_float32 PASSED [0.2625s] [ 30%] 2025-12-04T13:35:25.9383434Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32 PASSED [0.0548s] [ 30%] 2025-12-04T13:35:25.9383581Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_circular_cuda_float32 PASSED [1.2525s] [ 30%] 2025-12-04T13:35:25.9383728Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_complex64 PASSED [0.0369s] [ 30%] 2025-12-04T13:35:25.9383880Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_float32 PASSED [1.2725s] [ 30%] 2025-12-04T13:35:25.9384029Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_shuffle_cuda_complex64 PASSED [1.2378s] [ 30%] 2025-12-04T13:35:25.9384183Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [1.2331s] [ 30%] 2025-12-04T13:35:25.9384320Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_relu6_cuda_float32 PASSED [0.0199s] [ 30%] 2025-12-04T13:35:25.9384484Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [1.3205s] [ 30%] 2025-12-04T13:35:25.9384630Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_smooth_l1_loss_cuda_float32 PASSED [1.2554s] [ 30%] 2025-12-04T13:35:25.9384781Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0072s] [ 30%] 2025-12-04T13:35:25.9384937Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_complex64 PASSED [0.0079s] [ 30%] 2025-12-04T13:35:25.9385086Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_complex64 PASSED [1.2519s] [ 30%] 2025-12-04T13:35:25.9385244Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [1.2406s] [ 30%] 2025-12-04T13:35:25.9385378Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_complex64 XFAIL [0.0065s] [ 30%] 2025-12-04T13:35:25.9385509Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_float32 XFAIL [1.2560s] [ 30%] 2025-12-04T13:35:25.9385658Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_number_mean_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 30%] 2025-12-04T13:35:25.9385781Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_complex64 PASSED [1.8624s] [ 30%] 2025-12-04T13:35:25.9385909Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_complex64 PASSED [0.0827s] [ 30%] 2025-12-04T13:35:25.9386030Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_float32 PASSED [1.2502s] [ 30%] 2025-12-04T13:35:25.9386171Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polar_cuda_float32 PASSED [0.0077s] [ 30%] 2025-12-04T13:35:25.9386341Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_1_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 30%] 2025-12-04T13:35:25.9386497Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_2_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 30%] 2025-12-04T13:35:25.9386622Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_complex64 PASSED [1.2584s] [ 30%] 2025-12-04T13:35:25.9386742Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_complex64 PASSED [1.3207s] [ 31%] 2025-12-04T13:35:25.9386857Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_float32 PASSED [0.0312s] [ 31%] 2025-12-04T13:35:25.9386979Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_float32 PASSED [1.2388s] [ 31%] 2025-12-04T13:35:25.9387106Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_like_cuda_float32 PASSED [0.0082s] [ 31%] 2025-12-04T13:35:25.9387238Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_complex64 XFAIL [0.0041s] [ 31%] 2025-12-04T13:35:25.9387356Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_complex64 PASSED [1.2531s] [ 31%] 2025-12-04T13:35:25.9387494Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_repeat_interleave_cuda_complex64 PASSED [1.2504s] [ 31%] 2025-12-04T13:35:25.9387614Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_cuda_float32 PASSED [1.2739s] [ 31%] 2025-12-04T13:35:25.9387737Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize_as__cuda_float32 PASSED [0.0047s] [ 31%] 2025-12-04T13:35:25.9387864Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_complex64 PASSED [1.2412s] [ 31%] 2025-12-04T13:35:25.9387982Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_float32 PASSED [0.0127s] [ 31%] 2025-12-04T13:35:25.9388140Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_complex64 PASSED [1.3133s] [ 31%] 2025-12-04T13:35:25.9388294Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_3_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 31%] 2025-12-04T13:35:25.9388438Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 31%] 2025-12-04T13:35:25.9388566Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_complex64 PASSED [1.2836s] [ 31%] 2025-12-04T13:35:25.9388686Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_cuda_float32 PASSED [0.0158s] [ 31%] 2025-12-04T13:35:25.9388825Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amin_cuda_float32 PASSED [1.3189s] [ 31%] 2025-12-04T13:35:25.9388962Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_mean_cuda_float32 PASSED [0.0372s] [ 31%] 2025-12-04T13:35:25.9389100Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_sum_cuda_float32 PASSED [1.2733s] [ 31%] 2025-12-04T13:35:25.9389222Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_complex64 PASSED [1.2737s] [ 31%] 2025-12-04T13:35:25.9389339Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_float32 PASSED [0.0065s] [ 31%] 2025-12-04T13:35:25.9389456Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sign_cuda_float32 PASSED [1.2921s] [ 31%] 2025-12-04T13:35:25.9389623Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_cosine_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 31%] 2025-12-04T13:35:25.9389790Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_hamming_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 31%] 2025-12-04T13:35:25.9389903Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_float32 PASSED [1.2636s] [ 31%] 2025-12-04T13:35:25.9390050Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_complex64 PASSED [1.2658s] [ 31%] 2025-12-04T13:35:25.9390178Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_float32 PASSED [1.2522s] [ 31%] 2025-12-04T13:35:25.9390295Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_float32 PASSED [0.0050s] [ 31%] 2025-12-04T13:35:25.9390409Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sort_cuda_float32 PASSED [1.4390s] [ 31%] 2025-12-04T13:35:25.9390556Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_mm_reduce_cuda_float32 SKIPPED [0.0003s] (Skipped!) [ 32%] 2025-12-04T13:35:25.9390713Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0002s] (Skipped!) [ 32%] 2025-12-04T13:35:25.9390851Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j0_cuda_float32 PASSED [1.2523s] [ 32%] 2025-12-04T13:35:25.9390986Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y0_cuda_float32 PASSED [0.0050s] [ 32%] 2025-12-04T13:35:25.9391150Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_hermite_polynomial_h_cuda_float32 PASSED [1.2327s] [ 32%] 2025-12-04T13:35:25.9391294Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i0_cuda_float32 PASSED [1.2871s] [ 32%] 2025-12-04T13:35:25.9391426Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_ndtri_cuda_float32 PASSED [1.2357s] [ 32%] 2025-12-04T13:35:25.9391579Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k1_cuda_float32 PASSED [0.0046s] [ 32%] 2025-12-04T13:35:25.9391743Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [1.2561s] [ 32%] 2025-12-04T13:35:25.9391905Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [1.2751s] [ 32%] 2025-12-04T13:35:25.9392067Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_w_cuda_float32 PASSED [1.2719s] [ 32%] 2025-12-04T13:35:25.9392202Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_float32 PASSED [1.2411s] [ 32%] 2025-12-04T13:35:25.9392326Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_complex64 PASSED [1.2595s] [ 32%] 2025-12-04T13:35:25.9392460Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_float32 PASSED [0.0101s] [ 32%] 2025-12-04T13:35:25.9392574Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32 PASSED [1.2634s] [ 32%] 2025-12-04T13:35:25.9392701Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_float32 PASSED [0.0062s] [ 32%] 2025-12-04T13:35:25.9392819Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_complex64 PASSED [0.6932s] [ 32%] 2025-12-04T13:35:25.9392939Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_float32 PASSED [1.2793s] [ 32%] 2025-12-04T13:35:25.9393069Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_float32 PASSED [0.0086s] [ 32%] 2025-12-04T13:35:25.9393186Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_cuda_float32 PASSED [1.2966s] [ 32%] 2025-12-04T13:35:25.9393304Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tanh_cuda_complex64 PASSED [1.4806s] [ 32%] 2025-12-04T13:35:25.9393420Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_float32 PASSED [1.2602s] [ 32%] 2025-12-04T13:35:25.9393541Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_float32 PASSED [0.0716s] [ 32%] 2025-12-04T13:35:25.9393658Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_float32 PASSED [1.2702s] [ 32%] 2025-12-04T13:35:25.9393795Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_complex64 PASSED [1.2525s] [ 32%] 2025-12-04T13:35:25.9393928Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unbind_cuda_complex64 PASSED [1.2436s] [ 32%] 2025-12-04T13:35:25.9394057Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsafe_chunk_cuda_float32 PASSED [0.0056s] [ 32%] 2025-12-04T13:35:25.9394186Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_complex64 PASSED [0.0244s] [ 32%] 2025-12-04T13:35:25.9394307Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vdot_cuda_complex64 PASSED [1.2656s] [ 32%] 2025-12-04T13:35:25.9394433Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_complex64 PASSED [1.2519s] [ 32%] 2025-12-04T13:35:25.9394557Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_float32 PASSED [0.0055s] [ 33%] 2025-12-04T13:35:25.9394674Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_xlogy_cuda_float32 PASSED [1.2623s] [ 33%] 2025-12-04T13:35:25.9394793Z test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zero__cuda_float32 PASSED [0.0065s] [ 33%] 2025-12-04T13:35:25.9394924Z test_ops.py::TestCompositeComplianceCUDA::test_backward___rdiv___cuda_float32 PASSED [0.1032s] [ 33%] 2025-12-04T13:35:25.9395069Z test_ops.py::TestCompositeComplianceCUDA::test_backward__batch_norm_with_update_cuda_float32 PASSED [0.6694s] [ 33%] 2025-12-04T13:35:25.9395213Z test_ops.py::TestCompositeComplianceCUDA::test_backward__softmax_backward_data_cuda_float32 PASSED [1.2645s] [ 33%] 2025-12-04T13:35:25.9395331Z test_ops.py::TestCompositeComplianceCUDA::test_backward_add_cuda_float32 PASSED [0.0592s] [ 33%] 2025-12-04T13:35:25.9395449Z test_ops.py::TestCompositeComplianceCUDA::test_backward_addbmm_cuda_float32 PASSED [0.1774s] [ 33%] 2025-12-04T13:35:25.9395567Z test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32 PASSED [0.0774s] [ 33%] 2025-12-04T13:35:25.9395690Z test_ops.py::TestCompositeComplianceCUDA::test_backward_alias_copy_cuda_float32 PASSED [1.2364s] [ 33%] 2025-12-04T13:35:25.9395837Z test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_partial_views_cuda_float32 PASSED [0.0155s] [ 33%] 2025-12-04T13:35:25.9395972Z test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_scatter_cuda_float32 PASSED [1.2906s] [ 33%] 2025-12-04T13:35:25.9396091Z test_ops.py::TestCompositeComplianceCUDA::test_backward_asinh_cuda_float32 PASSED [0.0087s] [ 33%] 2025-12-04T13:35:25.9396208Z test_ops.py::TestCompositeComplianceCUDA::test_backward_atan_cuda_float32 PASSED [1.2602s] [ 33%] 2025-12-04T13:35:25.9396327Z test_ops.py::TestCompositeComplianceCUDA::test_backward_baddbmm_cuda_float32 PASSED [0.1371s] [ 33%] 2025-12-04T13:35:25.9396448Z test_ops.py::TestCompositeComplianceCUDA::test_backward_bernoulli_cuda_float32 PASSED [1.2551s] [ 33%] 2025-12-04T13:35:25.9396565Z test_ops.py::TestCompositeComplianceCUDA::test_backward_ceil_cuda_float32 PASSED [0.0069s] [ 33%] 2025-12-04T13:35:25.9396684Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cfloat_cuda_float32 PASSED [0.0155s] [ 33%] 2025-12-04T13:35:25.9396802Z test_ops.py::TestCompositeComplianceCUDA::test_backward_complex_cuda_float32 PASSED [0.0679s] [ 33%] 2025-12-04T13:35:25.9396919Z test_ops.py::TestCompositeComplianceCUDA::test_backward_conj_cuda_float32 PASSED [0.0076s] [ 33%] 2025-12-04T13:35:25.9397050Z test_ops.py::TestCompositeComplianceCUDA::test_backward_constant_pad_nd_cuda_float32 PASSED [0.1047s] [ 33%] 2025-12-04T13:35:25.9397172Z test_ops.py::TestCompositeComplianceCUDA::test_backward_copysign_cuda_float32 PASSED [0.0909s] [ 33%] 2025-12-04T13:35:25.9397293Z test_ops.py::TestCompositeComplianceCUDA::test_backward_corrcoef_cuda_float32 PASSED [0.0706s] [ 33%] 2025-12-04T13:35:25.9397411Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cos_cuda_float32 PASSED [0.0143s] [ 33%] 2025-12-04T13:35:25.9397530Z test_ops.py::TestCompositeComplianceCUDA::test_backward_cross_cuda_float32 PASSED [0.0304s] [ 33%] 2025-12-04T13:35:25.9397672Z test_ops.py::TestCompositeComplianceCUDA::test_backward_diagflat_cuda_float32 PASSED [0.0170s] [ 33%] 2025-12-04T13:35:25.9397818Z test_ops.py::TestCompositeComplianceCUDA::test_backward_div_floor_rounding_cuda_float32 PASSED [0.0596s] [ 33%] 2025-12-04T13:35:25.9397937Z test_ops.py::TestCompositeComplianceCUDA::test_backward_dsplit_cuda_float32 PASSED [0.0460s] [ 33%] 2025-12-04T13:35:25.9398052Z test_ops.py::TestCompositeComplianceCUDA::test_backward_erfc_cuda_float32 PASSED [1.2445s] [ 33%] 2025-12-04T13:35:25.9398198Z test_ops.py::TestCompositeComplianceCUDA::test_backward_exp2_cuda_float32 PASSED [0.0139s] [ 33%] 2025-12-04T13:35:25.9398319Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fft2_cuda_float32 PASSED [0.0253s] [ 34%] 2025-12-04T13:35:25.9398445Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32 PASSED [0.3632s] [ 34%] 2025-12-04T13:35:25.9398564Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft_cuda_float32 PASSED [0.3794s] [ 34%] 2025-12-04T13:35:25.9398688Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfftn_cuda_float32 PASSED [0.0787s] [ 34%] 2025-12-04T13:35:25.9398834Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32 PASSED [0.2098s] [ 34%] 2025-12-04T13:35:25.9398957Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft2_cuda_float32 PASSED [1.2721s] [ 34%] 2025-12-04T13:35:25.9399078Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfftn_cuda_float32 PASSED [0.0412s] [ 34%] 2025-12-04T13:35:25.9399196Z test_ops.py::TestCompositeComplianceCUDA::test_backward_flipud_cuda_float32 PASSED [0.0072s] [ 34%] 2025-12-04T13:35:25.9399313Z test_ops.py::TestCompositeComplianceCUDA::test_backward_floor_cuda_float32 PASSED [0.0053s] [ 34%] 2025-12-04T13:35:25.9399428Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fmax_cuda_float32 PASSED [0.1284s] [ 34%] 2025-12-04T13:35:25.9399544Z test_ops.py::TestCompositeComplianceCUDA::test_backward_fmod_cuda_float32 PASSED [0.0981s] [ 34%] 2025-12-04T13:35:25.9399662Z test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32 PASSED [0.0520s] [ 34%] 2025-12-04T13:35:25.9399795Z test_ops.py::TestCompositeComplianceCUDA::test_backward_grid_sampler_2d_cuda_float32 PASSED [0.1167s] [ 34%] 2025-12-04T13:35:25.9399912Z test_ops.py::TestCompositeComplianceCUDA::test_backward_hsplit_cuda_float32 PASSED [0.0439s] [ 34%] 2025-12-04T13:35:25.9400036Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_add_cuda_float32 PASSED [0.1299s] [ 34%] 2025-12-04T13:35:25.9400161Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_fill_cuda_float32 PASSED [0.0687s] [ 34%] 2025-12-04T13:35:25.9400296Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_amax_cuda_float32 PASSED [0.2461s] [ 34%] 2025-12-04T13:35:25.9400431Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_amin_cuda_float32 PASSED [0.2343s] [ 34%] 2025-12-04T13:35:25.9400568Z test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_mean_cuda_float32 PASSED [0.2187s] [ 34%] 2025-12-04T13:35:25.9400690Z test_ops.py::TestCompositeComplianceCUDA::test_backward_kthvalue_cuda_float32 PASSED [0.0453s] [ 34%] 2025-12-04T13:35:25.9400810Z test_ops.py::TestCompositeComplianceCUDA::test_backward_ldexp_cuda_float32 PASSED [0.0952s] [ 34%] 2025-12-04T13:35:25.9400948Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cholesky_ex_cuda_float32 PASSED [0.6048s] [ 34%] 2025-12-04T13:35:25.9401077Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_cuda_float32 PASSED [0.0512s] [ 34%] 2025-12-04T13:35:25.9401213Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lu_factor_ex_cuda_float32 PASSED [0.3623s] [ 34%] 2025-12-04T13:35:25.9401352Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_matrix_norm_cuda_float32 PASSED [0.4563s] [ 34%] 2025-12-04T13:35:25.9401480Z test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svdvals_cuda_float32 PASSED [0.2038s] [ 34%] 2025-12-04T13:35:25.9401625Z test_ops.py::TestCompositeComplianceCUDA::test_backward_log1p_cuda_float32 PASSED [0.0085s] [ 34%] 2025-12-04T13:35:25.9401783Z test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_with_dtype_cuda_float32 PASSED [0.0350s] [ 34%] 2025-12-04T13:35:25.9401908Z test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp2_cuda_float32 PASSED [0.0189s] [ 34%] 2025-12-04T13:35:25.9402027Z test_ops.py::TestCompositeComplianceCUDA::test_backward_logdet_cuda_float32 PASSED [0.0723s] [ 34%] 2025-12-04T13:35:25.9402142Z test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_cuda_float32 PASSED [0.3812s] [ 35%] 2025-12-04T13:35:25.9402270Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amax_cuda_float32 PASSED [1.3122s] [ 35%] 2025-12-04T13:35:25.9402396Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_amin_cuda_float32 PASSED [1.1948s] [ 35%] 2025-12-04T13:35:25.9402522Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_fill_cuda_float32 PASSED [0.0949s] [ 35%] 2025-12-04T13:35:25.9402659Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_log_softmax_cuda_float32 PASSED [0.1448s] [ 35%] 2025-12-04T13:35:25.9402805Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logaddexp_cuda_float32 PASSED [0.6572s] [ 35%] 2025-12-04T13:35:25.9402929Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_mean_cuda_float32 PASSED [0.7763s] [ 35%] 2025-12-04T13:35:25.9403056Z test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_norm_cuda_float32 PASSED [5.2741s] [ 35%] 2025-12-04T13:35:25.9403210Z test_ops.py::TestCompositeComplianceCUDA::test_backward_max_pool2d_with_indices_backward_cuda_float32 PASSED [2.4048s] [ 35%] 2025-12-04T13:35:25.9403329Z test_ops.py::TestCompositeComplianceCUDA::test_backward_mode_cuda_float32 PASSED [0.2099s] [ 35%] 2025-12-04T13:35:25.9403469Z test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0405s] [ 35%] 2025-12-04T13:35:25.9403606Z test_ops.py::TestCompositeComplianceCUDA::test_backward_native_batch_norm_cuda_float32 PASSED [0.5700s] [ 35%] 2025-12-04T13:35:25.9403742Z test_ops.py::TestCompositeComplianceCUDA::test_backward_native_layer_norm_cuda_float32 PASSED [0.1660s] [ 35%] 2025-12-04T13:35:25.9403904Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.0319s] [ 35%] 2025-12-04T13:35:25.9404061Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.1249s] [ 35%] 2025-12-04T13:35:25.9404216Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0094s] [ 35%] 2025-12-04T13:35:25.9404368Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.2194s] [ 35%] 2025-12-04T13:35:25.9404510Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout3d_cuda_float32 PASSED [0.0379s] [ 35%] 2025-12-04T13:35:25.9404660Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_bag_cuda_float32 PASSED [0.4802s] [ 35%] 2025-12-04T13:35:25.9404804Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_cuda_float32 PASSED [0.0163s] [ 35%] 2025-12-04T13:35:25.9404985Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_without_train_cuda_float32 PASSED [0.0201s] [ 35%] 2025-12-04T13:35:25.9405136Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_gaussian_nll_loss_cuda_float32 XFAIL [0.0064s] [ 35%] 2025-12-04T13:35:25.9405296Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bicubic_cuda_float32 PASSED [1.3309s] [ 35%] 2025-12-04T13:35:25.9405440Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_pool3d_cuda_float32 PASSED [1.1905s] [ 35%] 2025-12-04T13:35:25.9405617Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pairwise_distance_cuda_float32 PASSED [0.0798s] [ 35%] 2025-12-04T13:35:25.9405762Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_relu_cuda_float32 PASSED [0.0112s] [ 35%] 2025-12-04T13:35:25.9405899Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rrelu_cuda_float32 PASSED [0.0202s] [ 35%] 2025-12-04T13:35:25.9406085Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_scaled_dot_product_attention_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 35%] 2025-12-04T13:35:25.9406228Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softplus_cuda_float32 PASSED [0.0070s] [ 35%] 2025-12-04T13:35:25.9406370Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_tanhshrink_cuda_float32 PASSED [0.0100s] [ 36%] 2025-12-04T13:35:25.9407496Z test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_unfold_cuda_float32 PASSED [0.2698s] [ 36%] 2025-12-04T13:35:25.9407625Z test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_inf_cuda_float32 PASSED [0.0221s] [ 36%] 2025-12-04T13:35:25.9407747Z test_ops.py::TestCompositeComplianceCUDA::test_backward_ormqr_cuda_float32 PASSED [6.8641s] [ 36%] 2025-12-04T13:35:25.9407891Z test_ops.py::TestCompositeComplianceCUDA::test_backward_pca_lowrank_cuda_float32 PASSED [3.1206s] [ 36%] 2025-12-04T13:35:25.9408012Z test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32 PASSED [0.0199s] [ 36%] 2025-12-04T13:35:25.9408267Z test_ops.py::TestCompositeComplianceCUDA::test_backward_quantile_cuda_float32 PASSED [0.9050s] [ 36%] 2025-12-04T13:35:25.9408385Z test_ops.py::TestCompositeComplianceCUDA::test_backward_rot90_cuda_float32 PASSED [1.3682s] [ 36%] 2025-12-04T13:35:25.9408506Z test_ops.py::TestCompositeComplianceCUDA::test_backward_rsqrt_cuda_float32 PASSED [0.0124s] [ 36%] 2025-12-04T13:35:25.9408621Z test_ops.py::TestCompositeComplianceCUDA::test_backward_rsub_cuda_float32 PASSED [0.0534s] [ 36%] 2025-12-04T13:35:25.9408763Z test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_sum_cuda_float32 PASSED [0.2447s] [ 36%] 2025-12-04T13:35:25.9408895Z test_ops.py::TestCompositeComplianceCUDA::test_backward_select_scatter_cuda_float32 PASSED [0.0252s] [ 36%] 2025-12-04T13:35:25.9409017Z test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_cuda_float32 PASSED [0.0153s] [ 36%] 2025-12-04T13:35:25.9409146Z test_ops.py::TestCompositeComplianceCUDA::test_backward_special_erfcx_cuda_float32 PASSED [0.0098s] [ 36%] 2025-12-04T13:35:25.9409274Z test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1e_cuda_float32 PASSED [0.0165s] [ 36%] 2025-12-04T13:35:25.9409406Z test_ops.py::TestCompositeComplianceCUDA::test_backward_split_list_args_cuda_float32 PASSED [0.0191s] [ 36%] 2025-12-04T13:35:25.9409536Z test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_copy_cuda_float32 PASSED [0.0136s] [ 36%] 2025-12-04T13:35:25.9409669Z test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_multiple_cuda_float32 PASSED [0.0103s] [ 36%] 2025-12-04T13:35:25.9409789Z test_ops.py::TestCompositeComplianceCUDA::test_backward_std_cuda_float32 PASSED [0.0651s] [ 36%] 2025-12-04T13:35:25.9409907Z test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_cuda_float32 PASSED [1.3186s] [ 36%] 2025-12-04T13:35:25.9410032Z test_ops.py::TestCompositeComplianceCUDA::test_backward_svd_lowrank_cuda_float32 PASSED [5.6025s] [ 36%] 2025-12-04T13:35:25.9410149Z test_ops.py::TestCompositeComplianceCUDA::test_backward_tanh_cuda_float32 PASSED [0.0102s] [ 36%] 2025-12-04T13:35:25.9410273Z test_ops.py::TestCompositeComplianceCUDA::test_backward_true_divide_cuda_float32 PASSED [0.0896s] [ 36%] 2025-12-04T13:35:25.9410398Z test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_copy_cuda_float32 PASSED [0.2175s] [ 36%] 2025-12-04T13:35:25.9410525Z test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_split_cuda_float32 PASSED [0.0145s] [ 36%] 2025-12-04T13:35:25.9410640Z test_ops.py::TestCompositeComplianceCUDA::test_backward_var_cuda_float32 PASSED [0.0517s] [ 36%] 2025-12-04T13:35:25.9410788Z test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_cuda_float32 PASSED [0.0173s] [ 36%] 2025-12-04T13:35:25.9410925Z test_ops.py::TestCompositeComplianceCUDA::test_backward_vsplit_cuda_float32 PASSED [0.0461s] [ 36%] 2025-12-04T13:35:25.9411043Z test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32 PASSED [0.0132s] [ 36%] 2025-12-04T13:35:25.9411163Z test_ops.py::TestCompositeComplianceCUDA::test_backward_where_cuda_float32 PASSED [0.0962s] [ 36%] 2025-12-04T13:35:25.9411282Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input___radd___cuda_float32 PASSED [0.0074s] [ 37%] 2025-12-04T13:35:25.9411402Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rpow___cuda_float32 PASSED [0.0081s] [ 37%] 2025-12-04T13:35:25.9411545Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input__batch_norm_with_update_cuda_float32 PASSED [0.7705s] [ 37%] 2025-12-04T13:35:25.9411689Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_lengths_cuda_float32 PASSED [0.0594s] [ 37%] 2025-12-04T13:35:25.9411832Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input__softmax_backward_data_cuda_float32 PASSED [0.7526s] [ 37%] 2025-12-04T13:35:25.9411962Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_all_cuda_float32 PASSED [0.0070s] [ 37%] 2025-12-04T13:35:25.9412080Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_aminmax_cuda_float32 PASSED [0.0124s] [ 37%] 2025-12-04T13:35:25.9412200Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_angle_cuda_float32 PASSED [0.7478s] [ 37%] 2025-12-04T13:35:25.9412314Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_any_cuda_float32 PASSED [0.0070s] [ 37%] 2025-12-04T13:35:25.9412434Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmin_cuda_float32 PASSED [0.0048s] [ 37%] 2025-12-04T13:35:25.9412565Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_copy_cuda_float32 PASSED [0.7513s] [ 37%] 2025-12-04T13:35:25.9412690Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_1d_cuda_float32 PASSED [0.0064s] [ 37%] 2025-12-04T13:35:25.9412812Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_baddbmm_cuda_float32 PASSED [0.7555s] [ 37%] 2025-12-04T13:35:25.9412931Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bfloat16_cuda_float32 PASSED [0.0058s] [ 37%] 2025-12-04T13:35:25.9413048Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bool_cuda_float32 PASSED [0.7516s] [ 37%] 2025-12-04T13:35:25.9413171Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bucketize_cuda_float32 PASSED [0.0172s] [ 37%] 2025-12-04T13:35:25.9413303Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cartesian_prod_cuda_float32 PASSED [0.0046s] [ 37%] 2025-12-04T13:35:25.9413420Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdist_cuda_float32 PASSED [0.1539s] [ 37%] 2025-12-04T13:35:25.9413539Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdouble_cuda_float32 PASSED [0.7630s] [ 37%] 2025-12-04T13:35:25.9413670Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_solve_cuda_float32 PASSED [0.0140s] [ 37%] 2025-12-04T13:35:25.9413794Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clamp_min_cuda_float32 PASSED [0.0080s] [ 37%] 2025-12-04T13:35:25.9413910Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_cuda_float32 PASSED [0.7522s] [ 37%] 2025-12-04T13:35:25.9414031Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_copysign_cuda_float32 PASSED [0.0091s] [ 37%] 2025-12-04T13:35:25.9414157Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_count_nonzero_cuda_float32 PASSED [0.0062s] [ 37%] 2025-12-04T13:35:25.9414277Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumsum_cuda_float32 PASSED [0.7639s] [ 37%] 2025-12-04T13:35:25.9414396Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagflat_cuda_float32 PASSED [0.0058s] [ 37%] 2025-12-04T13:35:25.9414550Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_no_rounding_mode_cuda_float32 PASSED [0.0084s] [ 37%] 2025-12-04T13:35:25.9414703Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_trunc_rounding_cuda_float32 PASSED [0.0076s] [ 37%] 2025-12-04T13:35:25.9414823Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_einsum_cuda_float32 PASSED [0.7559s] [ 37%] 2025-12-04T13:35:25.9414938Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_equal_cuda_float32 PASSED [0.0052s] [ 37%] 2025-12-04T13:35:25.9415055Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp2_cuda_float32 PASSED [0.7380s] [ 38%] 2025-12-04T13:35:25.9415178Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_as_cuda_float32 PASSED [0.0054s] [ 38%] 2025-12-04T13:35:25.9415292Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_eye_cuda_float32 PASSED [0.0133s] [ 38%] 2025-12-04T13:35:25.9415411Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32 PASSED [0.7261s] [ 38%] 2025-12-04T13:35:25.9415532Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft2_cuda_float32 PASSED [0.0074s] [ 38%] 2025-12-04T13:35:25.9415661Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftshift_cuda_float32 PASSED [0.7242s] [ 38%] 2025-12-04T13:35:25.9415795Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft2_cuda_float32 PASSED [0.0081s] [ 38%] 2025-12-04T13:35:25.9415918Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_irfftn_cuda_float32 PASSED [0.7394s] [ 38%] 2025-12-04T13:35:25.9416039Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfftn_cuda_float32 PASSED [0.0088s] [ 38%] 2025-12-04T13:35:25.9416158Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flipud_cuda_float32 PASSED [0.0031s] [ 38%] 2025-12-04T13:35:25.9416275Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32 PASSED [0.0078s] [ 38%] 2025-12-04T13:35:25.9416392Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmin_cuda_float32 PASSED [0.0076s] [ 38%] 2025-12-04T13:35:25.9416510Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_frexp_cuda_float32 PASSED [0.7343s] [ 38%] 2025-12-04T13:35:25.9416637Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geometric_cuda_float32 PASSED [0.0040s] [ 38%] 2025-12-04T13:35:25.9416786Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 38%] 2025-12-04T13:35:25.9416903Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gt_cuda_float32 PASSED [0.0047s] [ 38%] 2025-12-04T13:35:25.9417026Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_copy_cuda_float32 PASSED [0.7215s] [ 38%] 2025-12-04T13:35:25.9417143Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_inner_cuda_float32 PASSED [0.0051s] [ 38%] 2025-12-04T13:35:25.9417259Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_item_cuda_float32 PASSED [0.7211s] [ 38%] 2025-12-04T13:35:25.9417418Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.0094s] [ 38%] 2025-12-04T13:35:25.9417574Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_return_by_ref_cuda_float32 PASSED [0.0063s] [ 38%] 2025-12-04T13:35:25.9417693Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ldexp_cuda_float32 PASSED [0.0080s] [ 38%] 2025-12-04T13:35:25.9417812Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lgamma_cuda_float32 PASSED [0.7338s] [ 38%] 2025-12-04T13:35:25.9417936Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigh_cuda_float32 PASSED [0.0191s] [ 38%] 2025-12-04T13:35:25.9418118Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32 PASSED [0.0080s] [ 38%] 2025-12-04T13:35:25.9418256Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0033s] [ 38%] 2025-12-04T13:35:25.9418403Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.8703s] [ 38%] 2025-12-04T13:35:25.9418563Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_norm_cuda_float32 PASSED [0.0463s] [ 38%] 2025-12-04T13:35:25.9419160Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_hermitian_cuda_float32 PASSED [0.7277s] [ 38%] 2025-12-04T13:35:25.9419292Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_multi_dot_cuda_float32 PASSED [0.0128s] [ 38%] 2025-12-04T13:35:25.9419425Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_tensorinv_cuda_float32 PASSED [0.7401s] [ 39%] 2025-12-04T13:35:25.9419546Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_cuda_float32 PASSED [0.0100s] [ 39%] 2025-12-04T13:35:25.9419691Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_tensor_overload_cuda_float32 PASSED [0.0447s] [ 39%] 2025-12-04T13:35:25.9419809Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log10_cuda_float32 PASSED [0.0038s] [ 39%] 2025-12-04T13:35:25.9419927Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log1p_cuda_float32 PASSED [0.7249s] [ 39%] 2025-12-04T13:35:25.9420045Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log2_cuda_float32 PASSED [0.0045s] [ 39%] 2025-12-04T13:35:25.9420184Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp2_cuda_float32 PASSED [0.7306s] [ 39%] 2025-12-04T13:35:25.9420310Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_and_cuda_float32 PASSED [0.0057s] [ 39%] 2025-12-04T13:35:25.9420434Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32 PASSED [0.0047s] [ 39%] 2025-12-04T13:35:25.9420551Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logit_cuda_float32 PASSED [0.7264s] [ 39%] 2025-12-04T13:35:25.9420672Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logsumexp_cuda_float32 PASSED [0.0114s] [ 39%] 2025-12-04T13:35:25.9420787Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lt_cuda_float32 PASSED [0.0047s] [ 39%] 2025-12-04T13:35:25.9420911Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_std_cuda_float32 PASSED [0.0587s] [ 39%] 2025-12-04T13:35:25.9421033Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_maximum_cuda_float32 PASSED [0.0076s] [ 39%] 2025-12-04T13:35:25.9421175Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_list_of_tensors_cuda_float32 PASSED [0.7351s] [ 39%] 2025-12-04T13:35:25.9421298Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_binary_cuda_float32 PASSED [0.0096s] [ 39%] 2025-12-04T13:35:25.9421434Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_reduction_no_dim_cuda_float32 PASSED [0.7277s] [ 39%] 2025-12-04T13:35:25.9421548Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mm_cuda_float32 PASSED [0.0060s] [ 39%] 2025-12-04T13:35:25.9421663Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mode_cuda_float32 PASSED [0.7249s] [ 39%] 2025-12-04T13:35:25.9421781Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_msort_cuda_float32 PASSED [0.0049s] [ 39%] 2025-12-04T13:35:25.9421923Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_1_cuda_float32 PASSED [0.7313s] [ 39%] 2025-12-04T13:35:25.9422066Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0085s] [ 39%] 2025-12-04T13:35:25.9422197Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_empty_strided_cuda_float32 PASSED [0.0048s] [ 39%] 2025-12-04T13:35:25.9422357Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool2d_cuda_float32 PASSED [0.0130s] [ 39%] 2025-12-04T13:35:25.9422505Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_alpha_dropout_cuda_float32 PASSED [0.7412s] [ 39%] 2025-12-04T13:35:25.9422649Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool2d_cuda_float32 PASSED [0.0066s] [ 39%] 2025-12-04T13:35:25.9422831Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_without_cudnn_cuda_float32 PASSED [0.0123s] [ 39%] 2025-12-04T13:35:25.9423015Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 39%] 2025-12-04T13:35:25.9423154Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv3d_cuda_float32 PASSED [0.0192s] [ 39%] 2025-12-04T13:35:25.9423292Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_ctc_loss_cuda_float32 PASSED [0.7463s] [ 40%] 2025-12-04T13:35:25.9423427Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_elu_cuda_float32 PASSED [0.0049s] [ 40%] 2025-12-04T13:35:25.9423598Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_feature_alpha_dropout_with_train_cuda_float32 PASSED [0.7427s] [ 40%] 2025-12-04T13:35:25.9423742Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardshrink_cuda_float32 PASSED [0.0061s] [ 40%] 2025-12-04T13:35:25.9423889Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardsigmoid_cuda_float32 PASSED [0.7639s] [ 40%] 2025-12-04T13:35:25.9424055Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_area_cuda_float32 PASSED [0.0193s] [ 40%] 2025-12-04T13:35:25.9424191Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_kl_div_cuda_float32 PASSED [0.0133s] [ 40%] 2025-12-04T13:35:25.9424328Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_l1_loss_cuda_float32 PASSED [0.7548s] [ 40%] 2025-12-04T13:35:25.9424495Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [0.0119s] [ 40%] 2025-12-04T13:35:25.9424659Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.7579s] [ 40%] 2025-12-04T13:35:25.9424814Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 40%] 2025-12-04T13:35:25.9424962Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_shuffle_cuda_float32 PASSED [0.0056s] [ 40%] 2025-12-04T13:35:25.9425116Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0042s] [ 40%] 2025-12-04T13:35:25.9425267Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.7361s] [ 40%] 2025-12-04T13:35:25.9425408Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softplus_cuda_float32 PASSED [0.0048s] [ 40%] 2025-12-04T13:35:25.9425552Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softshrink_cuda_float32 PASSED [0.7420s] [ 40%] 2025-12-04T13:35:25.9425697Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_tanhshrink_cuda_float32 PASSED [0.0052s] [ 40%] 2025-12-04T13:35:25.9425853Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.7500s] [ 40%] 2025-12-04T13:35:25.9426028Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0116s] [ 40%] 2025-12-04T13:35:25.9426167Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_unfold_cuda_float32 PASSED [0.0678s] [ 40%] 2025-12-04T13:35:25.9426321Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0041s] [ 40%] 2025-12-04T13:35:25.9426472Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_upsample_nearest_cuda_float32 PASSED [0.7677s] [ 40%] 2025-12-04T13:35:25.9426592Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_cuda_float32 PASSED [0.0278s] [ 40%] 2025-12-04T13:35:25.9426713Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_inf_cuda_float32 PASSED [0.7993s] [ 40%] 2025-12-04T13:35:25.9426845Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_in_place_cuda_float32 PASSED [0.0042s] [ 40%] 2025-12-04T13:35:25.9426990Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pca_lowrank_cuda_float32 PASSED [0.0749s] [ 40%] 2025-12-04T13:35:25.9427129Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_copy_cuda_float32 PASSED [0.7740s] [ 40%] 2025-12-04T13:35:25.9427274Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0117s] [ 40%] 2025-12-04T13:35:25.9427419Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0064s] [ 40%] 2025-12-04T13:35:25.9427563Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_4_cuda_float32 PASSED [0.7897s] [ 40%] 2025-12-04T13:35:25.9427680Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32 PASSED [0.0250s] [ 41%] 2025-12-04T13:35:25.9427797Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_real_cuda_float32 PASSED [0.7935s] [ 41%] 2025-12-04T13:35:25.9427917Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize__cuda_float32 PASSED [0.0040s] [ 41%] 2025-12-04T13:35:25.9428045Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_cuda_float32 PASSED [0.7951s] [ 41%] 2025-12-04T13:35:25.9428198Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_short_cuda_float32 PASSED [0.0044s] [ 41%] 2025-12-04T13:35:25.9428317Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sigmoid_cuda_float32 PASSED [0.8048s] [ 41%] 2025-12-04T13:35:25.9428459Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_gaussian_cuda_float32 PASSED [0.0048s] [ 41%] 2025-12-04T13:35:25.9428612Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_hamming_cuda_float32 PASSED [0.0046s] [ 41%] 2025-12-04T13:35:25.9428753Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_nuttall_cuda_float32 PASSED [0.8011s] [ 41%] 2025-12-04T13:35:25.9428872Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinh_cuda_float32 PASSED [0.0041s] [ 41%] 2025-12-04T13:35:25.9429013Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_sampled_addmm_cuda_float32 PASSED [0.8468s] [ 41%] 2025-12-04T13:35:25.9429146Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_airy_ai_cuda_float32 PASSED [0.7881s] [ 41%] 2025-12-04T13:35:25.9429300Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0087s] [ 41%] 2025-12-04T13:35:25.9429455Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_laguerre_polynomial_l_cuda_float32 PASSED [0.0062s] [ 41%] 2025-12-04T13:35:25.9429572Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_stft_cuda_float32 PASSED [1.1735s] [ 41%] 2025-12-04T13:35:25.9429689Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tanh_cuda_float32 PASSED [0.8008s] [ 41%] 2025-12-04T13:35:25.9429812Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_to_sparse_cuda_float32 PASSED [0.0063s] [ 41%] 2025-12-04T13:35:25.9429929Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trace_cuda_float32 PASSED [0.7956s] [ 41%] 2025-12-04T13:35:25.9430054Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapezoid_cuda_float32 PASSED [0.0107s] [ 41%] 2025-12-04T13:35:25.9430170Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triu_cuda_float32 PASSED [0.8174s] [ 41%] 2025-12-04T13:35:25.9430306Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_consecutive_cuda_float32 PASSED [0.1017s] [ 41%] 2025-12-04T13:35:25.9430425Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_cuda_float32 PASSED [0.1627s] [ 41%] 2025-12-04T13:35:25.9430552Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_unbiased_cuda_float32 PASSED [0.7969s] [ 41%] 2025-12-04T13:35:25.9430668Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vdot_cuda_float32 PASSED [0.0041s] [ 41%] 2025-12-04T13:35:25.9430783Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_cuda_float32 PASSED [0.8032s] [ 41%] 2025-12-04T13:35:25.9430935Z test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vsplit_cuda_float32 PASSED [0.0051s] [ 41%] 2025-12-04T13:35:25.9431072Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___radd___cuda_float32 PASSED [0.1436s] [ 41%] 2025-12-04T13:35:25.9431238Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__chunk_cat_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 41%] 2025-12-04T13:35:25.9431382Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__softmax_backward_data_cuda_float32 PASSED [0.0482s] [ 41%] 2025-12-04T13:35:25.9431499Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acos_cuda_float32 PASSED [0.8113s] [ 41%] 2025-12-04T13:35:25.9431618Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_acosh_cuda_float32 PASSED [0.0120s] [ 42%] 2025-12-04T13:35:25.9431735Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addmv_cuda_float32 PASSED [0.3637s] [ 42%] 2025-12-04T13:35:25.9431853Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addr_cuda_float32 PASSED [0.2525s] [ 42%] 2025-12-04T13:35:25.9432027Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9432188Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmax_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9432347Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmin_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9432465Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bmm_cuda_float32 PASSED [0.0154s] [ 42%] 2025-12-04T13:35:25.9432602Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_tensors_cuda_float32 PASSED [0.4691s] [ 42%] 2025-12-04T13:35:25.9432732Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_to_cuda_float32 PASSED [0.0188s] [ 42%] 2025-12-04T13:35:25.9432895Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cauchy_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9433019Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdouble_cuda_float32 PASSED [0.8207s] [ 42%] 2025-12-04T13:35:25.9433136Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ceil_cuda_float32 PASSED [0.0064s] [ 42%] 2025-12-04T13:35:25.9433297Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_chalf_cuda_float32 SKIPPED [0.0011s] (Does not support forward_ad) [ 42%] 2025-12-04T13:35:25.9433455Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_char_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9433584Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_column_stack_cuda_float32 PASSED [0.0119s] [ 42%] 2025-12-04T13:35:25.9433715Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_combinations_cuda_float32 PASSED [0.1168s] [ 42%] 2025-12-04T13:35:25.9433834Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_cuda_float32 PASSED [0.0100s] [ 42%] 2025-12-04T13:35:25.9433959Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_corrcoef_cuda_float32 PASSED [0.0512s] [ 42%] 2025-12-04T13:35:25.9434076Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cos_cuda_float32 PASSED [0.0092s] [ 42%] 2025-12-04T13:35:25.9434247Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_count_nonzero_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9434372Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_embed_cuda_float32 PASSED [0.0385s] [ 42%] 2025-12-04T13:35:25.9434509Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_floor_rounding_cuda_float32 PASSED [0.0991s] [ 42%] 2025-12-04T13:35:25.9434647Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_no_rounding_mode_cuda_float32 PASSED [0.1091s] [ 42%] 2025-12-04T13:35:25.9434836Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_permuted_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9434967Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfinv_cuda_float32 PASSED [0.0053s] [ 42%] 2025-12-04T13:35:25.9435084Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp_cuda_float32 PASSED [0.0083s] [ 42%] 2025-12-04T13:35:25.9435205Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_cuda_float32 PASSED [0.0230s] [ 42%] 2025-12-04T13:35:25.9435323Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expm1_cuda_float32 PASSED [0.0045s] [ 42%] 2025-12-04T13:35:25.9435491Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exponential_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 42%] 2025-12-04T13:35:25.9435616Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32 PASSED [0.0350s] [ 42%] 2025-12-04T13:35:25.9435742Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftn_cuda_float32 PASSED [0.0292s] [ 43%] 2025-12-04T13:35:25.9435868Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfftn_cuda_float32 PASSED [0.0337s] [ 43%] 2025-12-04T13:35:25.9436002Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft_cuda_float32 PASSED [0.0240s] [ 43%] 2025-12-04T13:35:25.9436124Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfftn_cuda_float32 PASSED [0.0260s] [ 43%] 2025-12-04T13:35:25.9436286Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_like_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9436446Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igamma_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9436571Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_add_cuda_float32 PASSED [0.2027s] [ 43%] 2025-12-04T13:35:25.9436697Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_copy_cuda_float32 PASSED [0.0656s] [ 43%] 2025-12-04T13:35:25.9436861Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isclose_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9437026Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isfinite_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9437185Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isreal_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9437341Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_item_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9437531Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_binary_return_by_ref_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9437655Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32 PASSED [0.0309s] [ 43%] 2025-12-04T13:35:25.9437776Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ldexp_cuda_float32 PASSED [0.1292s] [ 43%] 2025-12-04T13:35:25.9437907Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_cross_cuda_float32 PASSED [0.0384s] [ 43%] 2025-12-04T13:35:25.9438033Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_det_cuda_float32 PASSED [0.0461s] [ 43%] 2025-12-04T13:35:25.9438197Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigh_cuda_float32 PASSED [0.0588s] [ 43%] 2025-12-04T13:35:25.9438325Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_inv_ex_cuda_float32 PASSED [0.0352s] [ 43%] 2025-12-04T13:35:25.9438514Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9438641Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_norm_cuda_float32 PASSED [0.4500s] [ 43%] 2025-12-04T13:35:25.9438773Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_slogdet_cuda_float32 PASSED [0.0488s] [ 43%] 2025-12-04T13:35:25.9439000Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_tensor_overload_cuda_float32 SKIPPED [0.0013s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9439137Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logdet_cuda_float32 PASSED [0.0420s] [ 43%] 2025-12-04T13:35:25.9439256Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logit_cuda_float32 PASSED [0.0157s] [ 43%] 2025-12-04T13:35:25.9439424Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_argmax_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 43%] 2025-12-04T13:35:25.9439551Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_mean_cuda_float32 PASSED [0.9129s] [ 43%] 2025-12-04T13:35:25.9439681Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmin_cuda_float32 PASSED [0.2395s] [ 43%] 2025-12-04T13:35:25.9439806Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_var_cuda_float32 PASSED [1.2898s] [ 43%] 2025-12-04T13:35:25.9439927Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matmul_cuda_float32 PASSED [0.2667s] [ 44%] 2025-12-04T13:35:25.9440066Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matrix_exp_cuda_float32 PASSED [0.0303s] [ 44%] 2025-12-04T13:35:25.9440224Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_max_pool2d_with_indices_backward_cuda_float32 PASSED [5.8673s] [ 44%] 2025-12-04T13:35:25.9440370Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0559s] [ 44%] 2025-12-04T13:35:25.9440512Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_5_cuda_float32 PASSED [0.0324s] [ 44%] 2025-12-04T13:35:25.9440633Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nansum_cuda_float32 PASSED [0.0999s] [ 44%] 2025-12-04T13:35:25.9440768Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_batch_norm_cuda_float32 PASSED [4.4675s] [ 44%] 2025-12-04T13:35:25.9440930Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_full_cuda_float32 SKIPPED [0.0011s] (Does not support autograd) [ 44%] 2025-12-04T13:35:25.9441091Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_ones_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 44%] 2025-12-04T13:35:25.9441248Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0203s] [ 44%] 2025-12-04T13:35:25.9441405Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.0262s] [ 44%] 2025-12-04T13:35:25.9441560Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0305s] [ 44%] 2025-12-04T13:35:25.9441714Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0329s] [ 44%] 2025-12-04T13:35:25.9441860Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool1d_cuda_float32 PASSED [0.0304s] [ 44%] 2025-12-04T13:35:25.9442005Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool2d_cuda_float32 PASSED [0.7701s] [ 44%] 2025-12-04T13:35:25.9442150Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_batch_norm_cuda_float32 PASSED [3.5841s] [ 44%] 2025-12-04T13:35:25.9442308Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.2227s] [ 44%] 2025-12-04T13:35:25.9442447Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv3d_cuda_float32 PASSED [1.6849s] [ 44%] 2025-12-04T13:35:25.9442631Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_ctc_loss_cuda_float32 SKIPPED [0.0014s] (Does not support forward_ad) [ 44%] 2025-12-04T13:35:25.9442775Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout3d_cuda_float32 PASSED [0.0596s] [ 44%] 2025-12-04T13:35:25.9442937Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout_cuda_float32 PASSED [0.0639s] [ 44%] 2025-12-04T13:35:25.9443099Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32 XFAIL [0.0163s] [ 44%] 2025-12-04T13:35:25.9443243Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_group_norm_cuda_float32 PASSED [4.0324s] [ 44%] 2025-12-04T13:35:25.9443386Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_huber_loss_cuda_float32 PASSED [0.1270s] [ 44%] 2025-12-04T13:35:25.9443539Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_area_cuda_float32 PASSED [0.0642s] [ 44%] 2025-12-04T13:35:25.9443698Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.0630s] [ 44%] 2025-12-04T13:35:25.9443855Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0569s] [ 44%] 2025-12-04T13:35:25.9444000Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool2d_cuda_float32 PASSED [4.8288s] [ 44%] 2025-12-04T13:35:25.9444162Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.0533s] [ 44%] 2025-12-04T13:35:25.9444298Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mish_cuda_float32 PASSED [0.0122s] [ 44%] 2025-12-04T13:35:25.9444464Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.1800s] [ 45%] 2025-12-04T13:35:25.9444643Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pdist_cuda_float32 SKIPPED [0.0012s] (Does not support forward_ad) [ 45%] 2025-12-04T13:35:25.9444796Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0123s] [ 45%] 2025-12-04T13:35:25.9444938Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softmin_cuda_float32 PASSED [0.0285s] [ 45%] 2025-12-04T13:35:25.9445081Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_threshold_cuda_float32 PASSED [0.0122s] [ 45%] 2025-12-04T13:35:25.9445255Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [1.2573s] [ 45%] 2025-12-04T13:35:25.9445394Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_unfold_cuda_float32 PASSED [0.3663s] [ 45%] 2025-12-04T13:35:25.9445550Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.7576s] [ 45%] 2025-12-04T13:35:25.9445713Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_like_cuda_float32 SKIPPED [0.0014s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9445837Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pinverse_cuda_float32 PASSED [0.0709s] [ 45%] 2025-12-04T13:35:25.9445958Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pow_cuda_float32 PASSED [0.1646s] [ 45%] 2025-12-04T13:35:25.9446120Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rand_like_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9446283Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9446447Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_like_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9446568Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_renorm_cuda_float32 PASSED [0.0214s] [ 45%] 2025-12-04T13:35:25.9446729Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize__cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9446861Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_conj_cuda_float32 PASSED [0.0058s] [ 45%] 2025-12-04T13:35:25.9447003Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_roll_cuda_float32 PASSED [0.0389s] [ 45%] 2025-12-04T13:35:25.9447148Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_3_cuda_float32 PASSED [0.0087s] [ 45%] 2025-12-04T13:35:25.9447265Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsqrt_cuda_float32 PASSED [0.0092s] [ 45%] 2025-12-04T13:35:25.9447385Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsub_cuda_float32 PASSED [0.1221s] [ 45%] 2025-12-04T13:35:25.9447521Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_amin_cuda_float32 PASSED [0.8041s] [ 45%] 2025-12-04T13:35:25.9447637Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sgn_cuda_float32 PASSED [0.7379s] [ 45%] 2025-12-04T13:35:25.9447794Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_short_cuda_float32 SKIPPED [0.0014s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9447918Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sigmoid_cuda_float32 PASSED [0.0102s] [ 45%] 2025-12-04T13:35:25.9448037Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sign_cuda_float32 PASSED [0.0048s] [ 45%] 2025-12-04T13:35:25.9448328Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_cosine_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9448509Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hamming_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9448686Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_kaiser_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 45%] 2025-12-04T13:35:25.9448864Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0011s] (Does not support forward_ad) [ 45%] 2025-12-04T13:35:25.9449036Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y1_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T13:35:25.9449229Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_u_cuda_float32 SKIPPED [0.0009s] (Does not support autograd) [ 46%] 2025-12-04T13:35:25.9449365Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_log_ndtr_cuda_float32 PASSED [0.0111s] [ 46%] 2025-12-04T13:35:25.9449497Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtri_cuda_float32 PASSED [0.7403s] [ 46%] 2025-12-04T13:35:25.9449691Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k0_cuda_float32 SKIPPED [0.0013s] (Does not support autograd) [ 46%] 2025-12-04T13:35:25.9449893Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_w_cuda_float32 SKIPPED [0.0010s] (Does not support autograd) [ 46%] 2025-12-04T13:35:25.9450026Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_xlog1py_cuda_float32 PASSED [0.1450s] [ 46%] 2025-12-04T13:35:25.9450156Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_copy_cuda_float32 PASSED [0.0218s] [ 46%] 2025-12-04T13:35:25.9450275Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_cuda_float32 PASSED [0.0539s] [ 46%] 2025-12-04T13:35:25.9450401Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_unbiased_cuda_float32 PASSED [0.0093s] [ 46%] 2025-12-04T13:35:25.9450519Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_cuda_float32 PASSED [0.0510s] [ 46%] 2025-12-04T13:35:25.9450643Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unflatten_cuda_float32 PASSED [0.0252s] [ 46%] 2025-12-04T13:35:25.9450764Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_cuda_float32 PASSED [0.0551s] [ 46%] 2025-12-04T13:35:25.9450895Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsqueeze_copy_cuda_float32 PASSED [0.0235s] [ 46%] 2025-12-04T13:35:25.9451039Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_cuda_float32 PASSED [0.0456s] [ 46%] 2025-12-04T13:35:25.9451183Z test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_unbiased_cuda_float32 PASSED [0.0109s] [ 46%] 2025-12-04T13:35:25.9451312Z test_ops.py::TestCompositeComplianceCUDA::test_operator___rmatmul___cuda_float32 PASSED [0.0297s] [ 46%] 2025-12-04T13:35:25.9451431Z test_ops.py::TestCompositeComplianceCUDA::test_operator___rsub___cuda_float32 PASSED [0.0125s] [ 46%] 2025-12-04T13:35:25.9451578Z test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_lengths_cuda_float32 PASSED [0.1619s] [ 46%] 2025-12-04T13:35:25.9451695Z test_ops.py::TestCompositeComplianceCUDA::test_operator_abs_cuda_float32 PASSED [0.7422s] [ 46%] 2025-12-04T13:35:25.9451812Z test_ops.py::TestCompositeComplianceCUDA::test_operator_acos_cuda_float32 PASSED [0.0054s] [ 46%] 2025-12-04T13:35:25.9451930Z test_ops.py::TestCompositeComplianceCUDA::test_operator_acosh_cuda_float32 PASSED [0.7610s] [ 46%] 2025-12-04T13:35:25.9452050Z test_ops.py::TestCompositeComplianceCUDA::test_operator_addbmm_cuda_float32 PASSED [0.0247s] [ 46%] 2025-12-04T13:35:25.9452167Z test_ops.py::TestCompositeComplianceCUDA::test_operator_addr_cuda_float32 PASSED [0.0151s] [ 46%] 2025-12-04T13:35:25.9452301Z test_ops.py::TestCompositeComplianceCUDA::test_operator_argwhere_cuda_float32 PASSED [0.0063s] [ 46%] 2025-12-04T13:35:25.9452414Z test_ops.py::TestCompositeComplianceCUDA::test_operator_asin_cuda_float32 PASSED [0.7475s] [ 46%] 2025-12-04T13:35:25.9452532Z test_ops.py::TestCompositeComplianceCUDA::test_operator_asinh_cuda_float32 PASSED [0.0043s] [ 46%] 2025-12-04T13:35:25.9452648Z test_ops.py::TestCompositeComplianceCUDA::test_operator_atan2_cuda_float32 PASSED [0.0129s] [ 46%] 2025-12-04T13:35:25.9452772Z test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_2d_cuda_float32 PASSED [0.0082s] [ 46%] 2025-12-04T13:35:25.9452895Z test_ops.py::TestCompositeComplianceCUDA::test_operator_bernoulli_cuda_float32 PASSED [0.7510s] [ 46%] 2025-12-04T13:35:25.9453030Z test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_tensors_cuda_float32 PASSED [0.0149s] [ 47%] 2025-12-04T13:35:25.9453161Z test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32 PASSED [0.0079s] [ 47%] 2025-12-04T13:35:25.9453278Z test_ops.py::TestCompositeComplianceCUDA::test_operator_ceil_cuda_float32 PASSED [0.7327s] [ 47%] 2025-12-04T13:35:25.9453395Z test_ops.py::TestCompositeComplianceCUDA::test_operator_chalf_cuda_float32 PASSED [0.0081s] [ 47%] 2025-12-04T13:35:25.9453519Z test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_max_cuda_float32 PASSED [0.0128s] [ 47%] 2025-12-04T13:35:25.9453636Z test_ops.py::TestCompositeComplianceCUDA::test_operator_clone_cuda_float32 PASSED [0.7360s] [ 47%] 2025-12-04T13:35:25.9453756Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cumprod_cuda_float32 PASSED [0.0116s] [ 47%] 2025-12-04T13:35:25.9453897Z test_ops.py::TestCompositeComplianceCUDA::test_operator_cumulative_trapezoid_cuda_float32 PASSED [0.0214s] [ 47%] 2025-12-04T13:35:25.9454017Z test_ops.py::TestCompositeComplianceCUDA::test_operator_deg2rad_cuda_float32 PASSED [0.0030s] [ 47%] 2025-12-04T13:35:25.9454135Z test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_cuda_float32 PASSED [0.0145s] [ 47%] 2025-12-04T13:35:25.9454264Z test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_copy_cuda_float32 PASSED [0.0137s] [ 47%] 2025-12-04T13:35:25.9454386Z test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_cuda_float32 PASSED [0.0135s] [ 47%] 2025-12-04T13:35:25.9454501Z test_ops.py::TestCompositeComplianceCUDA::test_operator_dist_cuda_float32 PASSED [0.0651s] [ 47%] 2025-12-04T13:35:25.9454637Z test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32 PASSED [0.0133s] [ 47%] 2025-12-04T13:35:25.9454819Z test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_like_cuda_float32 SKIPPED [0.0001s] (Expected: empty_like is not comparable) [ 47%] 2025-12-04T13:35:25.9455018Z test_ops.py::TestCompositeComplianceCUDA::test_operator_empty_strided_cuda_float32 SKIPPED [0.0001s] (Expected: empty is not comparable) [ 47%] 2025-12-04T13:35:25.9455145Z test_ops.py::TestCompositeComplianceCUDA::test_operator_erfc_cuda_float32 PASSED [0.0042s] [ 47%] 2025-12-04T13:35:25.9455272Z test_ops.py::TestCompositeComplianceCUDA::test_operator_exponential_cuda_float32 PASSED [0.0127s] [ 47%] 2025-12-04T13:35:25.9455389Z test_ops.py::TestCompositeComplianceCUDA::test_operator_eye_cuda_float32 PASSED [0.0490s] [ 47%] 2025-12-04T13:35:25.9455515Z test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftshift_cuda_float32 PASSED [0.0062s] [ 47%] 2025-12-04T13:35:25.9458567Z test_ops.py::TestCompositeComplianceCUDA::test_operator_flipud_cuda_float32 PASSED [0.0035s] [ 47%] 2025-12-04T13:35:25.9458698Z test_ops.py::TestCompositeComplianceCUDA::test_operator_float_cuda_float32 PASSED [0.0054s] [ 47%] 2025-12-04T13:35:25.9458827Z test_ops.py::TestCompositeComplianceCUDA::test_operator_float_power_cuda_float32 PASSED [0.0152s] [ 47%] 2025-12-04T13:35:25.9458948Z test_ops.py::TestCompositeComplianceCUDA::test_operator_frac_cuda_float32 PASSED [0.7609s] [ 47%] 2025-12-04T13:35:25.9459099Z test_ops.py::TestCompositeComplianceCUDA::test_operator_frexp_cuda_float32 PASSED [0.0063s] [ 47%] 2025-12-04T13:35:25.9459222Z test_ops.py::TestCompositeComplianceCUDA::test_operator_full_like_cuda_float32 PASSED [0.0083s] [ 47%] 2025-12-04T13:35:25.9459339Z test_ops.py::TestCompositeComplianceCUDA::test_operator_ge_cuda_float32 PASSED [0.0089s] [ 47%] 2025-12-04T13:35:25.9459488Z test_ops.py::TestCompositeComplianceCUDA::test_operator_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 47%] 2025-12-04T13:35:25.9459615Z test_ops.py::TestCompositeComplianceCUDA::test_operator_hash_tensor_cuda_float32 PASSED [0.0123s] [ 47%] 2025-12-04T13:35:25.9459734Z test_ops.py::TestCompositeComplianceCUDA::test_operator_igammac_cuda_float32 PASSED [0.0127s] [ 48%] 2025-12-04T13:35:25.9459880Z test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_unary_cuda_float32 SKIPPED [0.0001s] (skip) [ 48%] 2025-12-04T13:35:25.9459999Z test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32 PASSED [0.0088s] [ 48%] 2025-12-04T13:35:25.9460138Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cholesky_ex_cuda_float32 PASSED [0.0175s] [ 48%] 2025-12-04T13:35:25.9460266Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32 PASSED [0.0059s] [ 48%] 2025-12-04T13:35:25.9460406Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_hermitian_cuda_float32 PASSED [0.0166s] [ 48%] 2025-12-04T13:35:25.9460549Z test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_triangular_cuda_float32 PASSED [0.1827s] [ 48%] 2025-12-04T13:35:25.9460675Z test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_or_cuda_float32 PASSED [0.0091s] [ 48%] 2025-12-04T13:35:25.9460793Z test_ops.py::TestCompositeComplianceCUDA::test_operator_logit_cuda_float32 PASSED [0.0051s] [ 48%] 2025-12-04T13:35:25.9460920Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amin_cuda_float32 PASSED [0.1657s] [ 48%] 2025-12-04T13:35:25.9461054Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_argmax_cuda_float32 PASSED [0.1082s] [ 48%] 2025-12-04T13:35:25.9461188Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_logaddexp_cuda_float32 PASSED [0.0704s] [ 48%] 2025-12-04T13:35:25.9461318Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_median_cuda_float32 PASSED [0.0301s] [ 48%] 2025-12-04T13:35:25.9461448Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmax_cuda_float32 PASSED [0.0397s] [ 48%] 2025-12-04T13:35:25.9461573Z test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_sum_cuda_float32 PASSED [0.1652s] [ 48%] 2025-12-04T13:35:25.9461728Z test_ops.py::TestCompositeComplianceCUDA::test_operator_max_pool2d_with_indices_backward_cuda_float32 PASSED [1.7933s] [ 48%] 2025-12-04T13:35:25.9461872Z test_ops.py::TestCompositeComplianceCUDA::test_operator_mean_cuda_float32 PASSED [0.0179s] [ 48%] 2025-12-04T13:35:25.9462015Z test_ops.py::TestCompositeComplianceCUDA::test_operator_median_cuda_float32 PASSED [0.0143s] [ 48%] 2025-12-04T13:35:25.9462137Z test_ops.py::TestCompositeComplianceCUDA::test_operator_minimum_cuda_float32 PASSED [0.0123s] [ 48%] 2025-12-04T13:35:25.9462252Z test_ops.py::TestCompositeComplianceCUDA::test_operator_mm_cuda_float32 PASSED [0.0056s] [ 48%] 2025-12-04T13:35:25.9462379Z test_ops.py::TestCompositeComplianceCUDA::test_operator_multinomial_cuda_float32 PASSED [0.7815s] [ 48%] 2025-12-04T13:35:25.9462496Z test_ops.py::TestCompositeComplianceCUDA::test_operator_narrow_cuda_float32 XFAIL [0.0050s] [ 48%] 2025-12-04T13:35:25.9462633Z test_ops.py::TestCompositeComplianceCUDA::test_operator_native_layer_norm_cuda_float32 PASSED [0.7850s] [ 48%] 2025-12-04T13:35:25.9462755Z test_ops.py::TestCompositeComplianceCUDA::test_operator_new_ones_cuda_float32 PASSED [0.0084s] [ 48%] 2025-12-04T13:35:25.9462880Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nextafter_cuda_float32 PASSED [0.0127s] [ 48%] 2025-12-04T13:35:25.9463038Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.7620s] [ 48%] 2025-12-04T13:35:25.9463192Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_bilinear_cuda_float32 PASSED [0.1332s] [ 48%] 2025-12-04T13:35:25.9463344Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0047s] [ 48%] 2025-12-04T13:35:25.9463483Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv1d_cuda_float32 PASSED [0.0404s] [ 48%] 2025-12-04T13:35:25.9463802Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x763bab000a00 size: 1024 2025-12-04T13:35:25.9463990Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x763bab000a00 size: 1024 2025-12-04T13:35:25.9464034Z PASSED [0.1150s] [ 48%] 2025-12-04T13:35:25.9464190Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.0802s] [ 49%] 2025-12-04T13:35:25.9464340Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cross_entropy_cuda_float32 PASSED [0.0483s] [ 49%] 2025-12-04T13:35:25.9464481Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32 PASSED [0.0871s] [ 49%] 2025-12-04T13:35:25.9464616Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_elu_cuda_float32 PASSED [0.0043s] [ 49%] 2025-12-04T13:35:25.9464768Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gaussian_nll_loss_cuda_float32 XFAIL [0.0029s] [ 49%] 2025-12-04T13:35:25.9464903Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gelu_cuda_float32 PASSED [0.7728s] [ 49%] 2025-12-04T13:35:25.9465051Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardshrink_cuda_float32 PASSED [0.7461s] [ 49%] 2025-12-04T13:35:25.9465195Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hardswish_cuda_float32 PASSED [0.0158s] [ 49%] 2025-12-04T13:35:25.9465338Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_huber_loss_cuda_float32 PASSED [0.0138s] [ 49%] 2025-12-04T13:35:25.9465494Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_linear_cuda_float32 PASSED [0.0227s] [ 49%] 2025-12-04T13:35:25.9465631Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_linear_cuda_float32 PASSED [0.0432s] [ 49%] 2025-12-04T13:35:25.9465775Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_logsigmoid_cuda_float32 PASSED [0.0044s] [ 49%] 2025-12-04T13:35:25.9465928Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_grad_cuda_float32 PASSED [0.0811s] [ 49%] 2025-12-04T13:35:25.9466100Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_grad_cuda_float32 PASSED [0.0282s] [ 49%] 2025-12-04T13:35:25.9466279Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0194s] [ 49%] 2025-12-04T13:35:25.9466416Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pdist_cuda_float32 PASSED [0.0082s] [ 49%] 2025-12-04T13:35:25.9466567Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0052s] [ 49%] 2025-12-04T13:35:25.9466703Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rrelu_cuda_float32 PASSED [0.0098s] [ 49%] 2025-12-04T13:35:25.9466855Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0098s] [ 49%] 2025-12-04T13:35:25.9466997Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softsign_cuda_float32 PASSED [0.0047s] [ 49%] 2025-12-04T13:35:25.9467153Z test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0054s] [ 49%] 2025-12-04T13:35:25.9467286Z test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_nuc_cuda_float32 PASSED [0.0112s] [ 49%] 2025-12-04T13:35:25.9467417Z test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_in_place_cuda_float32 PASSED [0.0051s] [ 49%] 2025-12-04T13:35:25.9467560Z test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0096s] [ 49%] 2025-12-04T13:35:25.9467705Z test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0095s] [ 49%] 2025-12-04T13:35:25.9467822Z test_ops.py::TestCompositeComplianceCUDA::test_operator_prod_cuda_float32 PASSED [0.0325s] [ 49%] 2025-12-04T13:35:25.9467940Z test_ops.py::TestCompositeComplianceCUDA::test_operator_put_cuda_float32 PASSED [0.0578s] [ 49%] 2025-12-04T13:35:25.9468096Z test_ops.py::TestCompositeComplianceCUDA::test_operator_quantile_cuda_float32 PASSED [0.2164s] [ 49%] 2025-12-04T13:35:25.9468222Z test_ops.py::TestCompositeComplianceCUDA::test_operator_rand_like_cuda_float32 PASSED [0.0138s] [ 49%] 2025-12-04T13:35:25.9468351Z test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_like_cuda_float32 PASSED [0.0239s] [ 49%] 2025-12-04T13:35:25.9468486Z test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_interleave_cuda_float32 PASSED [0.0090s] [ 50%] 2025-12-04T13:35:25.9468614Z test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_conj_cuda_float32 PASSED [0.0033s] [ 50%] 2025-12-04T13:35:25.9468751Z test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amax_cuda_float32 PASSED [0.0506s] [ 50%] 2025-12-04T13:35:25.9468867Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sgn_cuda_float32 PASSED [0.0029s] [ 50%] 2025-12-04T13:35:25.9468985Z test_ops.py::TestCompositeComplianceCUDA::test_operator_short_cuda_float32 PASSED [0.0046s] [ 50%] 2025-12-04T13:35:25.9469130Z test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_blackman_cuda_float32 PASSED [0.0130s] [ 50%] 2025-12-04T13:35:25.9469281Z test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_cosine_cuda_float32 PASSED [0.0128s] [ 50%] 2025-12-04T13:35:25.9469398Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sinc_cuda_float32 PASSED [0.0052s] [ 50%] 2025-12-04T13:35:25.9469512Z test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32 PASSED [0.0029s] [ 50%] 2025-12-04T13:35:25.9469633Z test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_cuda_float32 PASSED [0.0070s] [ 50%] 2025-12-04T13:35:25.9469788Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_t_cuda_float32 PASSED [0.0135s] [ 50%] 2025-12-04T13:35:25.9469941Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0135s] [ 50%] 2025-12-04T13:35:25.9470101Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_entr_cuda_float32 PASSED [0.7516s] [ 50%] 2025-12-04T13:35:25.9470263Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_i0_cuda_float32 PASSED [0.0058s] [ 50%] 2025-12-04T13:35:25.9470410Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k1_cuda_float32 PASSED [0.7515s] [ 50%] 2025-12-04T13:35:25.9470540Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_ndtr_cuda_float32 PASSED [0.0069s] [ 50%] 2025-12-04T13:35:25.9470696Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k1_cuda_float32 PASSED [0.0044s] [ 50%] 2025-12-04T13:35:25.9470860Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.0126s] [ 50%] 2025-12-04T13:35:25.9470993Z test_ops.py::TestCompositeComplianceCUDA::test_operator_special_xlog1py_cuda_float32 PASSED [0.0123s] [ 50%] 2025-12-04T13:35:25.9471113Z test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32 PASSED [0.7452s] [ 50%] 2025-12-04T13:35:25.9471248Z test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_cuda_float32 PASSED [0.0101s] [ 50%] 2025-12-04T13:35:25.9471374Z test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32 PASSED [0.7372s] [ 50%] 2025-12-04T13:35:25.9471490Z test_ops.py::TestCompositeComplianceCUDA::test_operator_t_cuda_float32 PASSED [0.0058s] [ 50%] 2025-12-04T13:35:25.9471606Z test_ops.py::TestCompositeComplianceCUDA::test_operator_take_cuda_float32 PASSED [0.0126s] [ 50%] 2025-12-04T13:35:25.9471731Z test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_cuda_float32 PASSED [0.0084s] [ 50%] 2025-12-04T13:35:25.9471863Z test_ops.py::TestCompositeComplianceCUDA::test_operator_triangular_solve_cuda_float32 PASSED [0.0331s] [ 50%] 2025-12-04T13:35:25.9471980Z test_ops.py::TestCompositeComplianceCUDA::test_operator_tril_cuda_float32 PASSED [0.0086s] [ 50%] 2025-12-04T13:35:25.9472096Z test_ops.py::TestCompositeComplianceCUDA::test_operator_triu_cuda_float32 PASSED [0.0081s] [ 50%] 2025-12-04T13:35:25.9472217Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unbind_cuda_float32 PASSED [0.0147s] [ 50%] 2025-12-04T13:35:25.9472340Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unflatten_cuda_float32 PASSED [0.0096s] [ 50%] 2025-12-04T13:35:25.9472468Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_copy_cuda_float32 PASSED [0.7584s] [ 51%] 2025-12-04T13:35:25.9472599Z test_ops.py::TestCompositeComplianceCUDA::test_operator_unsqueeze_copy_cuda_float32 PASSED [0.0108s] [ 51%] 2025-12-04T13:35:25.9472726Z test_ops.py::TestCompositeComplianceCUDA::test_operator_var_unbiased_cuda_float32 PASSED [0.7392s] [ 51%] 2025-12-04T13:35:25.9472855Z test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_complex_cuda_float32 PASSED [0.0043s] [ 51%] 2025-12-04T13:35:25.9472973Z test_ops.py::TestCompositeComplianceCUDA::test_operator_view_cuda_float32 PASSED [0.0083s] [ 51%] 2025-12-04T13:35:25.9473097Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rdiv___cuda_float32 PASSED [0.0037s] [ 51%] 2025-12-04T13:35:25.9473219Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmod___cuda_float32 PASSED [0.0034s] [ 51%] 2025-12-04T13:35:25.9473341Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmul___cuda_float32 PASSED [0.0033s] [ 51%] 2025-12-04T13:35:25.9473459Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_add_cuda_float32 PASSED [0.7508s] [ 51%] 2025-12-04T13:35:25.9473580Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addbmm_cuda_float32 PASSED [0.0046s] [ 51%] 2025-12-04T13:35:25.9473702Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcdiv_cuda_float32 PASSED [0.7378s] [ 51%] 2025-12-04T13:35:25.9473823Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_cuda_float32 PASSED [0.0045s] [ 51%] 2025-12-04T13:35:25.9473979Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_decomposed_cuda_float32 PASSED [0.7370s] [ 51%] 2025-12-04T13:35:25.9474108Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addr_cuda_float32 PASSED [0.0040s] [ 51%] 2025-12-04T13:35:25.9474230Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32 PASSED [0.7359s] [ 51%] 2025-12-04T13:35:25.9474350Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_angle_cuda_float32 PASSED [0.0029s] [ 51%] 2025-12-04T13:35:25.9474469Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argmin_cuda_float32 PASSED [0.7345s] [ 51%] 2025-12-04T13:35:25.9474596Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bernoulli_cuda_float32 PASSED [0.0040s] [ 51%] 2025-12-04T13:35:25.9474717Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cauchy_cuda_float32 PASSED [0.7431s] [ 51%] 2025-12-04T13:35:25.9474838Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cdouble_cuda_float32 PASSED [0.0034s] [ 51%] 2025-12-04T13:35:25.9474959Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cfloat_cuda_float32 PASSED [0.7671s] [ 51%] 2025-12-04T13:35:25.9475107Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_inverse_cuda_float32 PASSED [0.0071s] [ 51%] 2025-12-04T13:35:25.9475226Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_clone_cuda_float32 PASSED [0.7363s] [ 51%] 2025-12-04T13:35:25.9475347Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_complex_cuda_float32 PASSED [0.0040s] [ 51%] 2025-12-04T13:35:25.9475481Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_constant_pad_nd_cuda_float32 PASSED [0.7448s] [ 51%] 2025-12-04T13:35:25.9475610Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_contiguous_cuda_float32 PASSED [0.0030s] [ 51%] 2025-12-04T13:35:25.9475733Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_corrcoef_cuda_float32 PASSED [0.7346s] [ 51%] 2025-12-04T13:35:25.9475851Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cos_cuda_float32 PASSED [0.0032s] [ 51%] 2025-12-04T13:35:25.9475975Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32 PASSED [0.7558s] [ 51%] 2025-12-04T13:35:25.9476095Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumsum_cuda_float32 PASSED [0.0034s] [ 52%] 2025-12-04T13:35:25.9476214Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diff_cuda_float32 PASSED [0.7439s] [ 52%] 2025-12-04T13:35:25.9476334Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_digamma_cuda_float32 PASSED [0.0031s] [ 52%] 2025-12-04T13:35:25.9476454Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_double_cuda_float32 PASSED [0.7384s] [ 52%] 2025-12-04T13:35:25.9476572Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dstack_cuda_float32 PASSED [0.0032s] [ 52%] 2025-12-04T13:35:25.9476691Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_einsum_cuda_float32 PASSED [0.7264s] [ 52%] 2025-12-04T13:35:25.9476820Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_like_cuda_float32 PASSED [0.0037s] [ 52%] 2025-12-04T13:35:25.9476946Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft2_cuda_float32 PASSED [0.7350s] [ 52%] 2025-12-04T13:35:25.9477070Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft_cuda_float32 PASSED [0.0044s] [ 52%] 2025-12-04T13:35:25.9477204Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftshift_cuda_float32 PASSED [0.7245s] [ 52%] 2025-12-04T13:35:25.9477330Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft2_cuda_float32 PASSED [0.0039s] [ 52%] 2025-12-04T13:35:25.9477449Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fill_cuda_float32 PASSED [0.7282s] [ 52%] 2025-12-04T13:35:25.9477568Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flipud_cuda_float32 PASSED [0.0030s] [ 52%] 2025-12-04T13:35:25.9477687Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32 PASSED [0.7338s] [ 52%] 2025-12-04T13:35:25.9477830Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fmax_cuda_float32 PASSED [0.0043s] [ 52%] 2025-12-04T13:35:25.9477958Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frac_cuda_float32 PASSED [0.7302s] [ 52%] 2025-12-04T13:35:25.9478115Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frexp_cuda_float32 PASSED [0.0033s] [ 52%] 2025-12-04T13:35:25.9478250Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_2d_cuda_float32 PASSED [0.7359s] [ 52%] 2025-12-04T13:35:25.9478369Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hypot_cuda_float32 PASSED [0.0044s] [ 52%] 2025-12-04T13:35:25.9478488Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igamma_cuda_float32 PASSED [0.0035s] [ 52%] 2025-12-04T13:35:25.9478615Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_copy_cuda_float32 PASSED [0.7257s] [ 52%] 2025-12-04T13:35:25.9478741Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_put_cuda_float32 PASSED [0.0041s] [ 52%] 2025-12-04T13:35:25.9478880Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_amax_cuda_float32 PASSED [0.7400s] [ 52%] 2025-12-04T13:35:25.9479031Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_mean_cuda_float32 PASSED [0.0043s] [ 52%] 2025-12-04T13:35:25.9479168Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_prod_cuda_float32 PASSED [0.7327s] [ 52%] 2025-12-04T13:35:25.9479297Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_select_cuda_float32 PASSED [0.0033s] [ 52%] 2025-12-04T13:35:25.9479447Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_2inputs_2outputs_cuda_float32 PASSED [0.0057s] [ 52%] 2025-12-04T13:35:25.9479605Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_4inputs_with_extra_args_cuda_float32 PASSED [0.0037s] [ 52%] 2025-12-04T13:35:25.9479741Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_unary_cuda_float32 PASSED [0.0032s] [ 52%] 2025-12-04T13:35:25.9479882Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0030s] [ 52%] 2025-12-04T13:35:25.9480103Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_singular_cuda_float32 SKIPPED [0.0006s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 53%] 2025-12-04T13:35:25.9480237Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_slogdet_cuda_float32 PASSED [0.7405s] [ 53%] 2025-12-04T13:35:25.9480368Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32 PASSED [0.0119s] [ 53%] 2025-12-04T13:35:25.9480514Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_triangular_cuda_float32 PASSED [0.0223s] [ 53%] 2025-12-04T13:35:25.9480648Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_tensorinv_cuda_float32 PASSED [0.7376s] [ 53%] 2025-12-04T13:35:25.9480781Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vecdot_cuda_float32 PASSED [0.0085s] [ 53%] 2025-12-04T13:35:25.9480918Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vector_norm_cuda_float32 PASSED [0.7553s] [ 53%] 2025-12-04T13:35:25.9481045Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_normal_cuda_float32 PASSED [0.0102s] [ 53%] 2025-12-04T13:35:25.9481173Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_cuda_float32 PASSED [0.7343s] [ 53%] 2025-12-04T13:35:25.9481319Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32 PASSED [0.0037s] [ 53%] 2025-12-04T13:35:25.9481448Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_not_cuda_float32 PASSED [0.7348s] [ 53%] 2025-12-04T13:35:25.9481575Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logsumexp_cuda_float32 PASSED [0.0056s] [ 53%] 2025-12-04T13:35:25.9481699Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_unpack_cuda_float32 PASSED [0.0106s] [ 53%] 2025-12-04T13:35:25.9481840Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mT_cuda_float32 PASSED [0.7458s] [ 53%] 2025-12-04T13:35:25.9481989Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logaddexp_cuda_float32 PASSED [0.0057s] [ 53%] 2025-12-04T13:35:25.9482125Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logsumexp_cuda_float32 PASSED [0.7524s] [ 53%] 2025-12-04T13:35:25.9482258Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_normalize_cuda_float32 PASSED [0.0086s] [ 53%] 2025-12-04T13:35:25.9482390Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_scatter_cuda_float32 PASSED [0.7299s] [ 53%] 2025-12-04T13:35:25.9482520Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_softmax_cuda_float32 PASSED [0.0074s] [ 53%] 2025-12-04T13:35:25.9482678Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_pool2d_with_indices_backward_cuda_float32 PASSED [0.9339s] [ 53%] 2025-12-04T13:35:25.9482806Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmedian_cuda_float32 PASSED [0.0050s] [ 53%] 2025-12-04T13:35:25.9482934Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ne_cuda_float32 PASSED [0.0035s] [ 53%] 2025-12-04T13:35:25.9483095Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.7438s] [ 53%] 2025-12-04T13:35:25.9483253Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.0053s] [ 53%] 2025-12-04T13:35:25.9483414Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.7387s] [ 53%] 2025-12-04T13:35:25.9483569Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.0057s] [ 53%] 2025-12-04T13:35:25.9483716Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout3d_cuda_float32 PASSED [0.7634s] [ 53%] 2025-12-04T13:35:25.9483856Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_gelu_cuda_float32 PASSED [0.0039s] [ 53%] 2025-12-04T13:35:25.9484015Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bicubic_cuda_float32 PASSED [0.7414s] [ 53%] 2025-12-04T13:35:25.9484163Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_leaky_relu_cuda_float32 PASSED [0.0040s] [ 53%] 2025-12-04T13:35:25.9484305Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_linear_cuda_float32 PASSED [0.7420s] [ 54%] 2025-12-04T13:35:25.9484467Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_replicate_negative_cuda_float32 PASSED [0.0114s] [ 54%] 2025-12-04T13:35:25.9484610Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rms_norm_cuda_float32 PASSED [0.8015s] [ 54%] 2025-12-04T13:35:25.9484751Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rrelu_cuda_float32 PASSED [0.0043s] [ 54%] 2025-12-04T13:35:25.9484922Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.8184s] [ 54%] 2025-12-04T13:35:25.9485066Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_cuda_float32 PASSED [0.0039s] [ 54%] 2025-12-04T13:35:25.9485224Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.8013s] [ 54%] 2025-12-04T13:35:25.9485368Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softplus_cuda_float32 PASSED [0.0034s] [ 54%] 2025-12-04T13:35:25.9485495Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_cuda_float32 PASSED [0.0063s] [ 54%] 2025-12-04T13:35:25.9485655Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_static_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 54%] 2025-12-04T13:35:25.9485797Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32 PASSED [0.7989s] [ 54%] 2025-12-04T13:35:25.9485933Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_nuc_cuda_float32 PASSED [0.0043s] [ 54%] 2025-12-04T13:35:25.9486057Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_cuda_float32 PASSED [0.7957s] [ 54%] 2025-12-04T13:35:25.9486181Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_positive_cuda_float32 PASSED [0.0030s] [ 54%] 2025-12-04T13:35:25.9486312Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_like_cuda_float32 PASSED [0.8069s] [ 54%] 2025-12-04T13:35:25.9486432Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randn_cuda_float32 PASSED [0.0034s] [ 54%] 2025-12-04T13:35:25.9486552Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_cuda_float32 PASSED [0.0071s] [ 54%] 2025-12-04T13:35:25.9486684Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resolve_conj_cuda_float32 PASSED [0.8052s] [ 54%] 2025-12-04T13:35:25.9486804Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_rot90_cuda_float32 PASSED [0.0071s] [ 54%] 2025-12-04T13:35:25.9486937Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_cuda_float32 PASSED [0.8011s] [ 54%] 2025-12-04T13:35:25.9487058Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_cuda_float32 PASSED [0.0082s] [ 54%] 2025-12-04T13:35:25.9487206Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_bartlett_cuda_float32 PASSED [0.7972s] [ 54%] 2025-12-04T13:35:25.9487348Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_cosine_cuda_float32 PASSED [0.0042s] [ 54%] 2025-12-04T13:35:25.9487491Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_hamming_cuda_float32 PASSED [0.8006s] [ 54%] 2025-12-04T13:35:25.9487611Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sort_cuda_float32 PASSED [0.0085s] [ 54%] 2025-12-04T13:35:25.9487747Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_airy_ai_cuda_float32 PASSED [0.8006s] [ 54%] 2025-12-04T13:35:25.9487877Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1_cuda_float32 PASSED [0.0045s] [ 54%] 2025-12-04T13:35:25.9488047Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_polygamma_special_polygamma_n_0_cuda_float32 PASSED [0.7827s] [ 54%] 2025-12-04T13:35:25.9488281Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32 PASSED [0.0049s] [ 54%] 2025-12-04T13:35:25.9488446Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0056s] [ 54%] 2025-12-04T13:35:25.9488611Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_u_cuda_float32 PASSED [0.0036s] [ 55%] 2025-12-04T13:35:25.9488746Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_list_args_cuda_float32 PASSED [0.0075s] [ 55%] 2025-12-04T13:35:25.9488884Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_cuda_float32 PASSED [0.8111s] [ 55%] 2025-12-04T13:35:25.9489010Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_cuda_float32 PASSED [0.0046s] [ 55%] 2025-12-04T13:35:25.9489148Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_unbiased_cuda_float32 PASSED [0.8087s] [ 55%] 2025-12-04T13:35:25.9489264Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_cuda_float32 PASSED [0.0050s] [ 55%] 2025-12-04T13:35:25.9489394Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tensor_split_cuda_float32 PASSED [0.8182s] [ 55%] 2025-12-04T13:35:25.9489554Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [0.0048s] [ 55%] 2025-12-04T13:35:25.9489686Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_copy_cuda_float32 PASSED [0.8084s] [ 55%] 2025-12-04T13:35:25.9489846Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_cuda_float32 PASSED [0.0082s] [ 55%] 2025-12-04T13:35:25.9489978Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trapz_cuda_float32 PASSED [0.7948s] [ 55%] 2025-12-04T13:35:25.9490106Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unflatten_cuda_float32 PASSED [0.0088s] [ 55%] 2025-12-04T13:35:25.9490227Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_uniform_cuda_float32 PASSED [0.7974s] [ 55%] 2025-12-04T13:35:25.9490367Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_consecutive_cuda_float32 PASSED [0.0595s] [ 55%] 2025-12-04T13:35:25.9490487Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vstack_cuda_float32 PASSED [0.8031s] [ 55%] 2025-12-04T13:35:25.9490607Z test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zeros_cuda_float32 PASSED [0.0031s] [ 55%] 2025-12-04T13:35:25.9490715Z test_ops.py::TestMathBitsCUDA::test_conj_view___getitem___cuda_complex64 PASSED [0.8342s] [ 55%] 2025-12-04T13:35:25.9490825Z test_ops.py::TestMathBitsCUDA::test_conj_view___rmatmul___cuda_complex64 PASSED [0.0286s] [ 55%] 2025-12-04T13:35:25.9490967Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bfloat16_cuda_complex64 PASSED [0.8102s] [ 55%] 2025-12-04T13:35:25.9491091Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_char_cuda_complex64 PASSED [0.0046s] [ 55%] 2025-12-04T13:35:25.9491198Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addr_cuda_complex64 PASSED [0.8085s] [ 55%] 2025-12-04T13:35:25.9491311Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_alias_copy_cuda_complex64 PASSED [0.0044s] [ 55%] 2025-12-04T13:35:25.9491416Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64 PASSED [0.8231s] [ 55%] 2025-12-04T13:35:25.9491603Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_partial_views_cuda_complex64 SKIPPED [0.0002s] (Errors when storage_offset is included) [ 55%] 2025-12-04T13:35:25.9491710Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atan_cuda_complex64 PASSED [0.8031s] [ 55%] 2025-12-04T13:35:25.9491827Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_column_stack_cuda_complex64 PASSED [0.8043s] [ 55%] 2025-12-04T13:35:25.9491934Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_cuda_complex64 PASSED [0.0050s] [ 55%] 2025-12-04T13:35:25.9492095Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_empty_like_cuda_complex64 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 55%] 2025-12-04T13:35:25.9492201Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp2_cuda_complex64 PASSED [0.0053s] [ 55%] 2025-12-04T13:35:25.9492306Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp_cuda_complex64 PASSED [0.8085s] [ 56%] 2025-12-04T13:35:25.9492416Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft_cuda_complex64 PASSED [0.0072s] [ 56%] 2025-12-04T13:35:25.9492526Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfftn_cuda_complex64 PASSED [0.8238s] [ 56%] 2025-12-04T13:35:25.9492636Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fliplr_cuda_complex64 PASSED [0.0042s] [ 56%] 2025-12-04T13:35:25.9492744Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hsplit_cuda_complex64 PASSED [0.7965s] [ 56%] 2025-12-04T13:35:25.9492852Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hstack_cuda_complex64 PASSED [0.0049s] [ 56%] 2025-12-04T13:35:25.9492962Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_add_cuda_complex64 PASSED [0.7995s] [ 56%] 2025-12-04T13:35:25.9493073Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isclose_cuda_complex64 PASSED [0.0289s] [ 56%] 2025-12-04T13:35:25.9493177Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_item_cuda_complex64 PASSED [0.0034s] [ 56%] 2025-12-04T13:35:25.9493299Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_diagonal_cuda_complex64 PASSED [0.8291s] [ 56%] 2025-12-04T13:35:25.9493405Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_cuda_complex64 PASSED [0.0051s] [ 56%] 2025-12-04T13:35:25.9493546Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_or_cuda_complex64 PASSED [0.0067s] [ 56%] 2025-12-04T13:35:25.9493665Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_cuda_complex64 XFAIL [0.0022s] [ 56%] 2025-12-04T13:35:25.9493798Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_tensor_overload_cuda_complex64 XFAIL [0.0033s] [ 56%] 2025-12-04T13:35:25.9493933Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_meshgrid_variadic_tensors_cuda_complex64 PASSED [0.7966s] [ 56%] 2025-12-04T13:35:25.9494074Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_channel_shuffle_cuda_complex64 PASSED [0.8202s] [ 56%] 2025-12-04T13:35:25.9494214Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0049s] [ 56%] 2025-12-04T13:35:25.9494358Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex64 PASSED [0.8287s] [ 56%] 2025-12-04T13:35:25.9494494Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_tanhshrink_cuda_complex64 PASSED [0.0045s] [ 56%] 2025-12-04T13:35:25.9494639Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_triplet_margin_loss_cuda_complex64 PASSED [0.8106s] [ 56%] 2025-12-04T13:35:25.9494761Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_cuda_complex64 PASSED [0.0058s] [ 56%] 2025-12-04T13:35:25.9494874Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_reciprocal_cuda_complex64 PASSED [0.0037s] [ 56%] 2025-12-04T13:35:25.9494982Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rsub_cuda_complex64 PASSED [0.8103s] [ 56%] 2025-12-04T13:35:25.9495092Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sigmoid_cuda_complex64 PASSED [0.0062s] [ 56%] 2025-12-04T13:35:25.9495229Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64 PASSED [0.8001s] [ 56%] 2025-12-04T13:35:25.9495334Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_stft_cuda_complex64 PASSED [0.0164s] [ 56%] 2025-12-04T13:35:25.9495443Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sub_cuda_complex64 PASSED [0.0082s] [ 56%] 2025-12-04T13:35:25.9495550Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sum_cuda_complex64 PASSED [0.0104s] [ 56%] 2025-12-04T13:35:25.9495659Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_copy_cuda_complex64 PASSED [0.8064s] [ 56%] 2025-12-04T13:35:25.9495763Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_triu_cuda_complex64 PASSED [0.0070s] [ 56%] 2025-12-04T13:35:25.9495877Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_true_divide_cuda_complex64 PASSED [0.0076s] [ 57%] 2025-12-04T13:35:25.9495988Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unflatten_cuda_complex64 PASSED [0.0052s] [ 57%] 2025-12-04T13:35:25.9496099Z test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unsqueeze_cuda_complex64 PASSED [0.0064s] [ 57%] 2025-12-04T13:35:25.9496241Z test_ops.py::TestMathBitsCUDA::test_conj_view__unsafe_masked_index_put_accumulate_cuda_complex64 PASSED [0.8310s] [ 57%] 2025-12-04T13:35:25.9496347Z test_ops.py::TestMathBitsCUDA::test_conj_view_addmv_cuda_complex64 PASSED [0.0147s] [ 57%] 2025-12-04T13:35:25.9496448Z test_ops.py::TestMathBitsCUDA::test_conj_view_any_cuda_complex64 PASSED [0.8060s] [ 57%] 2025-12-04T13:35:25.9496554Z test_ops.py::TestMathBitsCUDA::test_conj_view_argwhere_cuda_complex64 PASSED [0.0051s] [ 57%] 2025-12-04T13:35:25.9496722Z test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_partial_views_cuda_complex64 SKIPPED [0.0002s] (Test changes in memory layout) [ 57%] 2025-12-04T13:35:25.9496828Z test_ops.py::TestMathBitsCUDA::test_conj_view_block_diag_cuda_complex64 PASSED [0.8172s] [ 57%] 2025-12-04T13:35:25.9496943Z test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_tensors_cuda_complex64 PASSED [0.0043s] [ 57%] 2025-12-04T13:35:25.9497052Z test_ops.py::TestMathBitsCUDA::test_conj_view_broadcast_to_cuda_complex64 PASSED [0.8111s] [ 57%] 2025-12-04T13:35:25.9497174Z test_ops.py::TestMathBitsCUDA::test_conj_view_cfloat_cuda_complex64 PASSED [0.0099s] [ 57%] 2025-12-04T13:35:25.9497285Z test_ops.py::TestMathBitsCUDA::test_conj_view_chalf_cuda_complex64 PASSED [0.8118s] [ 57%] 2025-12-04T13:35:25.9497392Z test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_cuda_complex64 PASSED [0.0404s] [ 57%] 2025-12-04T13:35:25.9497507Z test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_inverse_cuda_complex64 PASSED [0.0256s] [ 57%] 2025-12-04T13:35:25.9497620Z test_ops.py::TestMathBitsCUDA::test_conj_view_constant_pad_nd_cuda_complex64 PASSED [0.8495s] [ 57%] 2025-12-04T13:35:25.9497723Z test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64 PASSED [0.0193s] [ 57%] 2025-12-04T13:35:25.9497847Z test_ops.py::TestMathBitsCUDA::test_conj_view_cumulative_trapezoid_cuda_complex64 PASSED [0.8180s] [ 57%] 2025-12-04T13:35:25.9497958Z test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_copy_cuda_complex64 PASSED [0.0215s] [ 57%] 2025-12-04T13:35:25.9498128Z test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_cuda_complex64 PASSED [0.8166s] [ 57%] 2025-12-04T13:35:25.9498244Z test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_scatter_cuda_complex64 PASSED [0.0245s] [ 57%] 2025-12-04T13:35:25.9498365Z test_ops.py::TestMathBitsCUDA::test_conj_view_dist_cuda_complex64 PASSED [0.8642s] [ 57%] 2025-12-04T13:35:25.9498463Z test_ops.py::TestMathBitsCUDA::test_conj_view_eq_cuda_complex64 PASSED [0.0059s] [ 57%] 2025-12-04T13:35:25.9498567Z test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftn_cuda_complex64 PASSED [0.8403s] [ 57%] 2025-12-04T13:35:25.9498669Z test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft_cuda_complex64 PASSED [0.8228s] [ 57%] 2025-12-04T13:35:25.9498780Z test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftshift_cuda_complex64 PASSED [0.8016s] [ 57%] 2025-12-04T13:35:25.9498884Z test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft_cuda_complex64 PASSED [0.0111s] [ 57%] 2025-12-04T13:35:25.9498988Z test_ops.py::TestMathBitsCUDA::test_conj_view_flipud_cuda_complex64 PASSED [0.0043s] [ 57%] 2025-12-04T13:35:25.9499092Z test_ops.py::TestMathBitsCUDA::test_conj_view_gather_cuda_complex64 PASSED [0.8179s] [ 57%] 2025-12-04T13:35:25.9499193Z test_ops.py::TestMathBitsCUDA::test_conj_view_geqrf_cuda_complex64 PASSED [0.0256s] [ 57%] 2025-12-04T13:35:25.9499296Z test_ops.py::TestMathBitsCUDA::test_conj_view_gradient_cuda_complex64 PASSED [0.8103s] [ 58%] 2025-12-04T13:35:25.9499395Z test_ops.py::TestMathBitsCUDA::test_conj_view_hsplit_cuda_complex64 PASSED [0.0051s] [ 58%] 2025-12-04T13:35:25.9499495Z test_ops.py::TestMathBitsCUDA::test_conj_view_hstack_cuda_complex64 PASSED [0.8196s] [ 58%] 2025-12-04T13:35:25.9499598Z test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64 PASSED [0.0103s] [ 58%] 2025-12-04T13:35:25.9499702Z test_ops.py::TestMathBitsCUDA::test_conj_view_isfinite_cuda_complex64 PASSED [0.8112s] [ 58%] 2025-12-04T13:35:25.9499831Z test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_2inputs_2outputs_cuda_complex64 XFAIL [0.0061s] [ 58%] 2025-12-04T13:35:25.9499932Z test_ops.py::TestMathBitsCUDA::test_conj_view_kron_cuda_complex64 PASSED [1.5882s] [ 58%] 2025-12-04T13:35:25.9500031Z test_ops.py::TestMathBitsCUDA::test_conj_view_lerp_cuda_complex64 PASSED [0.0436s] [ 58%] 2025-12-04T13:35:25.9500141Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eig_cuda_complex64 PASSED [0.0769s] [ 58%] 2025-12-04T13:35:25.9500257Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_cuda_complex64 PASSED [1.2989s] [ 58%] 2025-12-04T13:35:25.9500363Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_cuda_complex64 PASSED [0.0306s] [ 58%] 2025-12-04T13:35:25.9500490Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_triangular_cuda_complex64 PASSED [0.4232s] [ 58%] 2025-12-04T13:35:25.9500603Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svdvals_cuda_complex64 PASSED [0.0344s] [ 58%] 2025-12-04T13:35:25.9500719Z test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vector_norm_cuda_complex64 PASSED [0.1580s] [ 58%] 2025-12-04T13:35:25.9500870Z test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_tensor_overload_cuda_complex64 XFAIL [0.0032s] [ 58%] 2025-12-04T13:35:25.9500990Z test_ops.py::TestMathBitsCUDA::test_conj_view_logsumexp_cuda_complex64 PASSED [2.5481s] [ 58%] 2025-12-04T13:35:25.9501105Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumprod_cuda_complex64 PASSED [0.0313s] [ 58%] 2025-12-04T13:35:25.9501215Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumsum_cuda_complex64 PASSED [0.0239s] [ 58%] 2025-12-04T13:35:25.9501324Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_fill_cuda_complex64 PASSED [0.0158s] [ 58%] 2025-12-04T13:35:25.9501437Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_logsumexp_cuda_complex64 PASSED [0.1184s] [ 58%] 2025-12-04T13:35:25.9501545Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_mean_cuda_complex64 PASSED [0.1113s] [ 58%] 2025-12-04T13:35:25.9501657Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64 PASSED [0.0105s] [ 58%] 2025-12-04T13:35:25.9501764Z test_ops.py::TestMathBitsCUDA::test_conj_view_masked_sum_cuda_complex64 PASSED [0.0986s] [ 58%] 2025-12-04T13:35:25.9501873Z test_ops.py::TestMathBitsCUDA::test_conj_view_matrix_exp_cuda_complex64 PASSED [1.2725s] [ 58%] 2025-12-04T13:35:25.9501990Z test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_copy_cuda_complex64 PASSED [0.0056s] [ 58%] 2025-12-04T13:35:25.9502128Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv_transpose2d_cuda_complex64 PASSED [1.3147s] [ 58%] 2025-12-04T13:35:25.9502286Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex64 PASSED [0.0217s] [ 58%] 2025-12-04T13:35:25.9502407Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_l1_loss_cuda_complex64 PASSED [1.2898s] [ 58%] 2025-12-04T13:35:25.9502535Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_cuda_complex64 PASSED [0.0178s] [ 58%] 2025-12-04T13:35:25.9502666Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_shuffle_cuda_complex64 PASSED [0.0070s] [ 58%] 2025-12-04T13:35:25.9502798Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_unshuffle_cuda_complex64 PASSED [0.0068s] [ 59%] 2025-12-04T13:35:25.9502937Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_softmin_with_dtype_cuda_complex64 PASSED [0.0080s] [ 59%] 2025-12-04T13:35:25.9503092Z test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex64 PASSED [1.2620s] [ 59%] 2025-12-04T13:35:25.9503237Z test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64 SKIPPED [0.0012s] (Only runs on cpu) [ 59%] 2025-12-04T13:35:25.9503338Z test_ops.py::TestMathBitsCUDA::test_conj_view_norm_cuda_complex64 PASSED [1.3014s] [ 59%] 2025-12-04T13:35:25.9503444Z test_ops.py::TestMathBitsCUDA::test_conj_view_norm_nuc_cuda_complex64 PASSED [0.0090s] [ 59%] 2025-12-04T13:35:25.9503555Z test_ops.py::TestMathBitsCUDA::test_conj_view_normal_in_place_cuda_complex64 XFAIL [0.0039s] [ 59%] 2025-12-04T13:35:25.9503664Z test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64 PASSED [0.0199s] [ 59%] 2025-12-04T13:35:25.9503763Z test_ops.py::TestMathBitsCUDA::test_conj_view_put_cuda_complex64 PASSED [1.2722s] [ 59%] 2025-12-04T13:35:25.9503860Z test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64 PASSED [0.0285s] [ 59%] 2025-12-04T13:35:25.9503963Z test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_cuda_complex64 PASSED [1.2928s] [ 59%] 2025-12-04T13:35:25.9504080Z test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_interleave_cuda_complex64 PASSED [0.0115s] [ 59%] 2025-12-04T13:35:25.9504183Z test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_cuda_complex64 PASSED [1.2776s] [ 59%] 2025-12-04T13:35:25.9504288Z test_ops.py::TestMathBitsCUDA::test_conj_view_resize_as__cuda_complex64 PASSED [0.0057s] [ 59%] 2025-12-04T13:35:25.9504400Z test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_conj_cuda_complex64 PASSED [1.2597s] [ 59%] 2025-12-04T13:35:25.9504525Z test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_add_cuda_complex64 PASSED [0.0172s] [ 59%] 2025-12-04T13:35:25.9504640Z test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_cuda_complex64 PASSED [1.2984s] [ 59%] 2025-12-04T13:35:25.9504744Z test_ops.py::TestMathBitsCUDA::test_conj_view_select_cuda_complex64 PASSED [0.0117s] [ 59%] 2025-12-04T13:35:25.9504884Z test_ops.py::TestMathBitsCUDA::test_conj_view_sparse_sampled_addmm_cuda_complex64 SKIPPED [0.0002s] (Skipped!) [ 59%] 2025-12-04T13:35:25.9504993Z test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_copy_cuda_complex64 PASSED [1.2748s] [ 59%] 2025-12-04T13:35:25.9505097Z test_ops.py::TestMathBitsCUDA::test_conj_view_std_mean_cuda_complex64 PASSED [0.0115s] [ 59%] 2025-12-04T13:35:25.9505195Z test_ops.py::TestMathBitsCUDA::test_conj_view_stft_cuda_complex64 PASSED [1.2658s] [ 59%] 2025-12-04T13:35:25.9505302Z test_ops.py::TestMathBitsCUDA::test_conj_view_tensor_split_cuda_complex64 PASSED [0.0112s] [ 59%] 2025-12-04T13:35:25.9505409Z test_ops.py::TestMathBitsCUDA::test_conj_view_transpose_cuda_complex64 PASSED [1.2717s] [ 59%] 2025-12-04T13:35:25.9505517Z test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_cuda_complex64 PASSED [0.0172s] [ 59%] 2025-12-04T13:35:25.9505633Z test_ops.py::TestMathBitsCUDA::test_conj_view_where_cuda_complex64 PASSED [1.2526s] [ 59%] 2025-12-04T13:35:25.9505740Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_T_cuda_complex128 PASSED [0.0041s] [ 59%] 2025-12-04T13:35:25.9505873Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cfloat_cuda_complex128 PASSED [1.2729s] [ 59%] 2025-12-04T13:35:25.9506003Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_chalf_cuda_complex128 PASSED [0.0042s] [ 59%] 2025-12-04T13:35:25.9506133Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_short_cuda_complex128 PASSED [1.2759s] [ 60%] 2025-12-04T13:35:25.9506243Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_abs_cuda_complex128 PASSED [0.0045s] [ 60%] 2025-12-04T13:35:25.9506360Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addcmul_cuda_complex128 PASSED [1.2707s] [ 60%] 2025-12-04T13:35:25.9506478Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_allclose_cuda_complex128 PASSED [0.0040s] [ 60%] 2025-12-04T13:35:25.9506658Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_copy_cuda_complex128 SKIPPED [0.0002s] (Errors when storage_offset is included) [ 60%] 2025-12-04T13:35:25.9506769Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asin_cuda_complex128 PASSED [1.2468s] [ 60%] 2025-12-04T13:35:25.9506880Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atanh_cuda_complex128 PASSED [0.0063s] [ 60%] 2025-12-04T13:35:25.9507000Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_contiguous_cuda_complex128 PASSED [1.2624s] [ 60%] 2025-12-04T13:35:25.9507131Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_div_no_rounding_mode_cuda_complex128 PASSED [0.0046s] [ 60%] 2025-12-04T13:35:25.9507244Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dstack_cuda_complex128 PASSED [0.0041s] [ 60%] 2025-12-04T13:35:25.9507412Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_like_cuda_complex128 SKIPPED [0.0002s] (Expected: empty is not comparable) [ 60%] 2025-12-04T13:35:25.9507593Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_strided_cuda_complex128 SKIPPED [0.0001s] (Expected: empty_strided is not comparable) [ 60%] 2025-12-04T13:35:25.9507704Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eq_cuda_complex128 PASSED [0.0026s] [ 60%] 2025-12-04T13:35:25.9507816Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_cuda_complex128 PASSED [1.2789s] [ 60%] 2025-12-04T13:35:25.9507930Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft_cuda_complex128 PASSED [0.6247s] [ 60%] 2025-12-04T13:35:25.9508045Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft_cuda_complex128 PASSED [1.5701s] [ 60%] 2025-12-04T13:35:25.9508233Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfftn_cuda_complex128 PASSED [1.4042s] [ 60%] 2025-12-04T13:35:25.9508355Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fill_cuda_complex128 PASSED [1.2607s] [ 60%] 2025-12-04T13:35:25.9508479Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_select_cuda_complex128 PASSED [0.0041s] [ 60%] 2025-12-04T13:35:25.9508594Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isclose_cuda_complex128 PASSED [1.2806s] [ 60%] 2025-12-04T13:35:25.9508707Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isinf_cuda_complex128 PASSED [0.0041s] [ 60%] 2025-12-04T13:35:25.9508819Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_istft_cuda_complex128 PASSED [1.2776s] [ 60%] 2025-12-04T13:35:25.9508942Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_cross_cuda_complex128 PASSED [1.2676s] [ 60%] 2025-12-04T13:35:25.9509052Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log2_cuda_complex128 PASSED [0.0060s] [ 60%] 2025-12-04T13:35:25.9509187Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_softmax_with_dtype_cuda_complex128 PASSED [1.2629s] [ 60%] 2025-12-04T13:35:25.9509322Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128 PASSED [0.0040s] [ 60%] 2025-12-04T13:35:25.9509441Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_xor_cuda_complex128 PASSED [1.2867s] [ 60%] 2025-12-04T13:35:25.9509559Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logsumexp_cuda_complex128 PASSED [0.0061s] [ 60%] 2025-12-04T13:35:25.9509678Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_masked_fill_cuda_complex128 PASSED [1.2691s] [ 60%] 2025-12-04T13:35:25.9509817Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_list_of_tensors_cuda_complex128 PASSED [0.0045s] [ 60%] 2025-12-04T13:35:25.9509959Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_variadic_tensors_cuda_complex128 PASSED [1.2710s] [ 61%] 2025-12-04T13:35:25.9510073Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_neg_cuda_complex128 PASSED [0.0045s] [ 61%] 2025-12-04T13:35:25.9510261Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_empty_strided_cuda_complex128 SKIPPED [0.0002s] (Expected: empty_strided is not comparable) [ 61%] 2025-12-04T13:35:25.9510380Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_full_cuda_complex128 PASSED [1.2888s] [ 61%] 2025-12-04T13:35:25.9510495Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_zeros_cuda_complex128 PASSED [0.0040s] [ 61%] 2025-12-04T13:35:25.9510649Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex128 PASSED [1.2842s] [ 61%] 2025-12-04T13:35:25.9510760Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_prod_cuda_complex128 PASSED [0.0064s] [ 61%] 2025-12-04T13:35:25.9510870Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinc_cuda_complex128 PASSED [1.2959s] [ 61%] 2025-12-04T13:35:25.9511003Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_softmax_with_dtype_cuda_complex128 PASSED [0.0052s] [ 61%] 2025-12-04T13:35:25.9511131Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_split_with_sizes_cuda_complex128 PASSED [1.2383s] [ 61%] 2025-12-04T13:35:25.9511241Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_cuda_complex128 PASSED [0.0040s] [ 61%] 2025-12-04T13:35:25.9511349Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tanh_cuda_complex128 PASSED [1.2611s] [ 61%] 2025-12-04T13:35:25.9511461Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_trace_cuda_complex128 PASSED [0.0041s] [ 61%] 2025-12-04T13:35:25.9511580Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_copy_cuda_complex128 PASSED [1.2414s] [ 61%] 2025-12-04T13:35:25.9511693Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_cuda_complex128 PASSED [0.0048s] [ 61%] 2025-12-04T13:35:25.9511800Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acosh_cuda_complex128 PASSED [1.2518s] [ 61%] 2025-12-04T13:35:25.9511928Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_angle_cuda_complex128 PASSED [0.0072s] [ 61%] 2025-12-04T13:35:25.9512049Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_cuda_complex128 PASSED [1.2508s] [ 61%] 2025-12-04T13:35:25.9512156Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atanh_cuda_complex128 PASSED [0.0062s] [ 61%] 2025-12-04T13:35:25.9512266Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_2d_cuda_complex128 PASSED [1.2678s] [ 61%] 2025-12-04T13:35:25.9512372Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bool_cuda_complex128 PASSED [0.0037s] [ 61%] 2025-12-04T13:35:25.9512489Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cartesian_prod_cuda_complex128 PASSED [1.2383s] [ 61%] 2025-12-04T13:35:25.9512594Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cat_cuda_complex128 PASSED [0.0068s] [ 61%] 2025-12-04T13:35:25.9512698Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_char_cuda_complex128 PASSED [1.2450s] [ 61%] 2025-12-04T13:35:25.9512816Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_solve_cuda_complex128 PASSED [0.0073s] [ 61%] 2025-12-04T13:35:25.9512943Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_combinations_cuda_complex128 PASSED [1.2554s] [ 61%] 2025-12-04T13:35:25.9513053Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_einsum_cuda_complex128 PASSED [0.0057s] [ 61%] 2025-12-04T13:35:25.9513172Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eye_cuda_complex128 SKIPPED [0.0002s] (Skipped!) [ 61%] 2025-12-04T13:35:25.9513286Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftshift_cuda_complex128 PASSED [1.2308s] [ 61%] 2025-12-04T13:35:25.9513397Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfft2_cuda_complex128 PASSED [0.0127s] [ 61%] 2025-12-04T13:35:25.9513503Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_geqrf_cuda_complex128 PASSED [1.2683s] [ 62%] 2025-12-04T13:35:25.9513614Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isfinite_cuda_complex128 PASSED [0.0035s] [ 62%] 2025-12-04T13:35:25.9513720Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_kron_cuda_complex128 PASSED [1.2424s] [ 62%] 2025-12-04T13:35:25.9513837Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_ex_cuda_complex128 PASSED [0.0190s] [ 62%] 2025-12-04T13:35:25.9513958Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_cuda_complex128 PASSED [1.3263s] [ 62%] 2025-12-04T13:35:25.9514164Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_solve_cuda_complex128 SKIPPED [0.0012s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 62%] 2025-12-04T13:35:25.9514278Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_cuda_complex128 PASSED [1.2769s] [ 62%] 2025-12-04T13:35:25.9514402Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_cuda_complex128 PASSED [0.0056s] [ 62%] 2025-12-04T13:35:25.9514538Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_hermitian_cuda_complex128 PASSED [1.2822s] [ 62%] 2025-12-04T13:35:25.9514672Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_triangular_cuda_complex128 PASSED [0.0073s] [ 62%] 2025-12-04T13:35:25.9514780Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log2_cuda_complex128 PASSED [1.2610s] [ 62%] 2025-12-04T13:35:25.9514894Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logcumsumexp_cuda_complex128 PASSED [0.0189s] [ 62%] 2025-12-04T13:35:25.9515007Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_not_cuda_complex128 PASSED [1.2504s] [ 62%] 2025-12-04T13:35:25.9515119Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_or_cuda_complex128 PASSED [0.0055s] [ 62%] 2025-12-04T13:35:25.9515254Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_variadic_tensors_cuda_complex128 PASSED [1.2632s] [ 62%] 2025-12-04T13:35:25.9515363Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128 PASSED [0.0056s] [ 62%] 2025-12-04T13:35:25.9515469Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128 PASSED [1.2659s] [ 62%] 2025-12-04T13:35:25.9515592Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mv_cuda_complex128 PASSED [0.0065s] [ 62%] 2025-12-04T13:35:25.9515711Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nanmean_cuda_complex128 PASSED [1.2832s] [ 62%] 2025-12-04T13:35:25.9515819Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_cuda_complex128 PASSED [0.0058s] [ 62%] 2025-12-04T13:35:25.9516003Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_empty_strided_cuda_complex128 SKIPPED [0.0002s] (Expected: new_empty_strided is not comparable) [ 62%] 2025-12-04T13:35:25.9516112Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_ones_cuda_complex128 PASSED [1.2473s] [ 62%] 2025-12-04T13:35:25.9516223Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128 PASSED [0.0041s] [ 62%] 2025-12-04T13:35:25.9516348Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv2d_cuda_complex128 PASSED [1.3279s] [ 62%] 2025-12-04T13:35:25.9516492Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose2d_cuda_complex128 PASSED [1.2508s] [ 62%] 2025-12-04T13:35:25.9516627Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_circular_cuda_complex128 PASSED [0.0056s] [ 62%] 2025-12-04T13:35:25.9516776Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pixel_unshuffle_cuda_complex128 PASSED [1.2523s] [ 62%] 2025-12-04T13:35:25.9516905Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softsign_cuda_complex128 PASSED [0.0096s] [ 62%] 2025-12-04T13:35:25.9517031Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_unfold_cuda_complex128 PASSED [0.0033s] [ 62%] 2025-12-04T13:35:25.9517149Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_normal_in_place_cuda_complex128 XFAIL [0.0038s] [ 62%] 2025-12-04T13:35:25.9517256Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_outer_cuda_complex128 PASSED [2.4917s] [ 63%] 2025-12-04T13:35:25.9517366Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_positive_cuda_complex128 PASSED [0.0056s] [ 63%] 2025-12-04T13:35:25.9517472Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_qr_cuda_complex128 PASSED [1.2282s] [ 63%] 2025-12-04T13:35:25.9517584Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_randn_like_cuda_complex128 PASSED [0.0044s] [ 63%] 2025-12-04T13:35:25.9517707Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_interleave_cuda_complex128 PASSED [1.2372s] [ 63%] 2025-12-04T13:35:25.9517814Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsqrt_cuda_complex128 PASSED [0.0059s] [ 63%] 2025-12-04T13:35:25.9517925Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_add_cuda_complex128 PASSED [1.2491s] [ 63%] 2025-12-04T13:35:25.9518036Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_cuda_complex128 PASSED [0.0079s] [ 63%] 2025-12-04T13:35:25.9518171Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sgn_cuda_complex128 PASSED [1.3049s] [ 63%] 2025-12-04T13:35:25.9518291Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_list_args_cuda_complex128 PASSED [0.0044s] [ 63%] 2025-12-04T13:35:25.9518397Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stack_cuda_complex128 PASSED [1.3109s] [ 63%] 2025-12-04T13:35:25.9518508Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_cuda_complex128 PASSED [0.0044s] [ 63%] 2025-12-04T13:35:25.9518619Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_to_size_cuda_complex128 PASSED [1.2671s] [ 63%] 2025-12-04T13:35:25.9518724Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_cuda_complex128 PASSED [0.0044s] [ 63%] 2025-12-04T13:35:25.9518831Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_cuda_complex128 PASSED [1.2508s] [ 63%] 2025-12-04T13:35:25.9518943Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tensordot_cuda_complex128 PASSED [0.0072s] [ 63%] 2025-12-04T13:35:25.9519053Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128 PASSED [1.2625s] [ 63%] 2025-12-04T13:35:25.9519168Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unsafe_chunk_cuda_complex128 PASSED [0.0043s] [ 63%] 2025-12-04T13:35:25.9519317Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128 PASSED [1.2833s] [ 63%] 2025-12-04T13:35:25.9519444Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_cuda_complex128 PASSED [0.0056s] [ 63%] 2025-12-04T13:35:25.9519553Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_copy_cuda_complex128 PASSED [1.2722s] [ 63%] 2025-12-04T13:35:25.9519659Z test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_cuda_complex128 PASSED [0.0054s] [ 63%] 2025-12-04T13:35:25.9519762Z test_ops.py::TestMathBitsCUDA::test_neg_view___getitem___cuda_float64 PASSED [1.2910s] [ 63%] 2025-12-04T13:35:25.9519867Z test_ops.py::TestMathBitsCUDA::test_neg_view___rmatmul___cuda_float64 PASSED [0.0260s] [ 63%] 2025-12-04T13:35:25.9519966Z test_ops.py::TestMathBitsCUDA::test_neg_view___rmod___cuda_float64 PASSED [0.0129s] [ 63%] 2025-12-04T13:35:25.9520086Z test_ops.py::TestMathBitsCUDA::test_neg_view__batch_norm_with_update_cuda_float64 PASSED [1.2884s] [ 63%] 2025-12-04T13:35:25.9520207Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_char_cuda_float64 PASSED [0.0047s] [ 63%] 2025-12-04T13:35:25.9520346Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64 PASSED [0.0059s] [ 63%] 2025-12-04T13:35:25.9520463Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_int_cuda_float64 PASSED [1.2504s] [ 63%] 2025-12-04T13:35:25.9520582Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_polar_cuda_float64 PASSED [0.0080s] [ 64%] 2025-12-04T13:35:25.9520687Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acosh_cuda_float64 PASSED [1.2514s] [ 64%] 2025-12-04T13:35:25.9520794Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_allclose_cuda_float64 PASSED [0.0119s] [ 64%] 2025-12-04T13:35:25.9520895Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_any_cuda_float64 PASSED [1.2600s] [ 64%] 2025-12-04T13:35:25.9520998Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asinh_cuda_float64 PASSED [0.0044s] [ 64%] 2025-12-04T13:35:25.9521101Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atan2_cuda_float64 PASSED [0.0070s] [ 64%] 2025-12-04T13:35:25.9521205Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atanh_cuda_float64 PASSED [1.2867s] [ 64%] 2025-12-04T13:35:25.9521308Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_chunk_cuda_float64 PASSED [0.0064s] [ 64%] 2025-12-04T13:35:25.9521415Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_max_cuda_float64 PASSED [0.0052s] [ 64%] 2025-12-04T13:35:25.9521517Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clone_cuda_float64 PASSED [1.2657s] [ 64%] 2025-12-04T13:35:25.9521627Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_column_stack_cuda_float64 PASSED [0.0059s] [ 64%] 2025-12-04T13:35:25.9521733Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_deg2rad_cuda_float64 PASSED [1.2493s] [ 64%] 2025-12-04T13:35:25.9521852Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_no_rounding_mode_cuda_float64 PASSED [0.0091s] [ 64%] 2025-12-04T13:35:25.9521957Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_equal_cuda_float64 PASSED [1.2949s] [ 64%] 2025-12-04T13:35:25.9522059Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erf_cuda_float64 PASSED [0.0045s] [ 64%] 2025-12-04T13:35:25.9522162Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_erfc_cuda_float64 PASSED [1.2886s] [ 64%] 2025-12-04T13:35:25.9522262Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exp2_cuda_float64 PASSED [0.0064s] [ 64%] 2025-12-04T13:35:25.9522368Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftn_cuda_float64 PASSED [1.2716s] [ 64%] 2025-12-04T13:35:25.9522474Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_hfft2_cuda_float64 PASSED [0.6334s] [ 64%] 2025-12-04T13:35:25.9522580Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft_cuda_float64 PASSED [1.2856s] [ 64%] 2025-12-04T13:35:25.9522686Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfftn_cuda_float64 PASSED [1.2775s] [ 64%] 2025-12-04T13:35:25.9522807Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_flip_cuda_float64 PASSED [0.0070s] [ 64%] 2025-12-04T13:35:25.9522920Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmax_cuda_float64 PASSED [0.0068s] [ 64%] 2025-12-04T13:35:25.9523020Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmin_cuda_float64 PASSED [0.0062s] [ 64%] 2025-12-04T13:35:25.9523131Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_select_cuda_float64 PASSED [1.2927s] [ 64%] 2025-12-04T13:35:25.9523236Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isclose_cuda_float64 PASSED [0.0282s] [ 64%] 2025-12-04T13:35:25.9523337Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_le_cuda_float64 PASSED [0.0058s] [ 64%] 2025-12-04T13:35:25.9523437Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lerp_cuda_float64 PASSED [1.2853s] [ 64%] 2025-12-04T13:35:25.9523559Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_matrix_norm_cuda_float64 PASSED [0.0470s] [ 64%] 2025-12-04T13:35:25.9523671Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64 PASSED [0.0528s] [ 64%] 2025-12-04T13:35:25.9523784Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svd_cuda_float64 PASSED [1.4353s] [ 65%] 2025-12-04T13:35:25.9523907Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vecdot_cuda_float64 PASSED [0.0175s] [ 65%] 2025-12-04T13:35:25.9524034Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_tensor_overload_cuda_float64 XFAIL [0.0034s] [ 65%] 2025-12-04T13:35:25.9524143Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_and_cuda_float64 PASSED [1.3005s] [ 65%] 2025-12-04T13:35:25.9524244Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_mul_cuda_float64 PASSED [0.0073s] [ 65%] 2025-12-04T13:35:25.9524353Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_copy_cuda_float64 PASSED [1.2934s] [ 65%] 2025-12-04T13:35:25.9524472Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_celu_cuda_float64 PASSED [0.0053s] [ 65%] 2025-12-04T13:35:25.9524594Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_l1_loss_cuda_float64 PASSED [1.2815s] [ 65%] 2025-12-04T13:35:25.9524737Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_margin_ranking_loss_cuda_float64 PASSED [0.0124s] [ 65%] 2025-12-04T13:35:25.9524872Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0038s] [ 65%] 2025-12-04T13:35:25.9524974Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ones_cuda_float64 XFAIL [0.0025s] [ 65%] 2025-12-04T13:35:25.9525081Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_cuda_float64 PASSED [1.2917s] [ 65%] 2025-12-04T13:35:25.9525185Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_round_cuda_float64 PASSED [1.2736s] [ 65%] 2025-12-04T13:35:25.9525288Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsub_cuda_float64 PASSED [0.0080s] [ 65%] 2025-12-04T13:35:25.9525408Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_softmax_with_dtype_cuda_float64 PASSED [0.0094s] [ 65%] 2025-12-04T13:35:25.9525529Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j0_cuda_float64 PASSED [1.2678s] [ 65%] 2025-12-04T13:35:25.9525642Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_logit_cuda_float64 PASSED [0.0061s] [ 65%] 2025-12-04T13:35:25.9525787Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_1_cuda_float64 PASSED [1.2941s] [ 65%] 2025-12-04T13:35:25.9525929Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_3_cuda_float64 PASSED [0.0085s] [ 65%] 2025-12-04T13:35:25.9526031Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sqrt_cuda_float64 PASSED [1.2739s] [ 65%] 2025-12-04T13:35:25.9526135Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_square_cuda_float64 PASSED [0.0054s] [ 65%] 2025-12-04T13:35:25.9526236Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_cuda_float64 PASSED [1.2688s] [ 65%] 2025-12-04T13:35:25.9526360Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_copy_cuda_float64 PASSED [1.2742s] [ 65%] 2025-12-04T13:35:25.9526476Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trace_cuda_float64 PASSED [0.0041s] [ 65%] 2025-12-04T13:35:25.9526584Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64 PASSED [1.2725s] [ 65%] 2025-12-04T13:35:25.9526687Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_trunc_cuda_float64 PASSED [0.0044s] [ 65%] 2025-12-04T13:35:25.9526790Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unfold_cuda_float64 PASSED [1.2665s] [ 65%] 2025-12-04T13:35:25.9526898Z test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unsqueeze_cuda_float64 PASSED [0.0086s] [ 65%] 2025-12-04T13:35:25.9527012Z test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_cuda_float64 PASSED [1.2762s] [ 65%] 2025-12-04T13:35:25.9527131Z test_ops.py::TestMathBitsCUDA::test_neg_view__upsample_bilinear2d_aa_cuda_float64 XFAIL [0.0073s] [ 65%] 2025-12-04T13:35:25.9527230Z test_ops.py::TestMathBitsCUDA::test_neg_view_acosh_cuda_float64 PASSED [2.5943s] [ 66%] 2025-12-04T13:35:25.9527327Z test_ops.py::TestMathBitsCUDA::test_neg_view_add_cuda_float64 PASSED [0.0171s] [ 66%] 2025-12-04T13:35:25.9527434Z test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_cuda_float64 PASSED [1.2789s] [ 66%] 2025-12-04T13:35:25.9527548Z test_ops.py::TestMathBitsCUDA::test_neg_view_addmm_decomposed_cuda_float64 PASSED [0.0113s] [ 66%] 2025-12-04T13:35:25.9527651Z test_ops.py::TestMathBitsCUDA::test_neg_view_alias_copy_cuda_float64 PASSED [1.2774s] [ 66%] 2025-12-04T13:35:25.9527746Z test_ops.py::TestMathBitsCUDA::test_neg_view_arange_cuda_float64 XFAIL [0.0033s] [ 66%] 2025-12-04T13:35:25.9527841Z test_ops.py::TestMathBitsCUDA::test_neg_view_atanh_cuda_float64 PASSED [1.2824s] [ 66%] 2025-12-04T13:35:25.9527943Z test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_1d_cuda_float64 PASSED [0.0105s] [ 66%] 2025-12-04T13:35:25.9528054Z test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_tensors_cuda_float64 PASSED [1.2393s] [ 66%] 2025-12-04T13:35:25.9528184Z test_ops.py::TestMathBitsCUDA::test_neg_view_ceil_cuda_float64 PASSED [0.0057s] [ 66%] 2025-12-04T13:35:25.9528289Z test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_min_cuda_float64 PASSED [0.0159s] [ 66%] 2025-12-04T13:35:25.9528387Z test_ops.py::TestMathBitsCUDA::test_neg_view_complex_cuda_float64 PASSED [0.0127s] [ 66%] 2025-12-04T13:35:25.9528484Z test_ops.py::TestMathBitsCUDA::test_neg_view_cumsum_cuda_float64 PASSED [0.0069s] [ 66%] 2025-12-04T13:35:25.9528578Z test_ops.py::TestMathBitsCUDA::test_neg_view_diag_cuda_float64 PASSED [1.2650s] [ 66%] 2025-12-04T13:35:25.9528672Z test_ops.py::TestMathBitsCUDA::test_neg_view_diff_cuda_float64 PASSED [0.1199s] [ 66%] 2025-12-04T13:35:25.9528785Z test_ops.py::TestMathBitsCUDA::test_neg_view_div_no_rounding_mode_cuda_float64 PASSED [0.0151s] [ 66%] 2025-12-04T13:35:25.9528882Z test_ops.py::TestMathBitsCUDA::test_neg_view_erfinv_cuda_float64 PASSED [1.2501s] [ 66%] 2025-12-04T13:35:25.9528979Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fft_cuda_float64 PASSED [0.0524s] [ 66%] 2025-12-04T13:35:25.9529079Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft_cuda_float64 PASSED [2.4697s] [ 66%] 2025-12-04T13:35:25.9529179Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft_cuda_float64 PASSED [1.3210s] [ 66%] 2025-12-04T13:35:25.9529279Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft_cuda_float64 PASSED [0.0162s] [ 66%] 2025-12-04T13:35:25.9529379Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfftn_cuda_float64 PASSED [1.3669s] [ 66%] 2025-12-04T13:35:25.9529477Z test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfft_cuda_float64 PASSED [1.3143s] [ 66%] 2025-12-04T13:35:25.9529573Z test_ops.py::TestMathBitsCUDA::test_neg_view_fliplr_cuda_float64 PASSED [0.0061s] [ 66%] 2025-12-04T13:35:25.9529669Z test_ops.py::TestMathBitsCUDA::test_neg_view_flipud_cuda_float64 PASSED [0.0040s] [ 66%] 2025-12-04T13:35:25.9529763Z test_ops.py::TestMathBitsCUDA::test_neg_view_floor_cuda_float64 PASSED [1.3223s] [ 66%] 2025-12-04T13:35:25.9529883Z test_ops.py::TestMathBitsCUDA::test_neg_view_fmax_cuda_float64 PASSED [0.0177s] [ 66%] 2025-12-04T13:35:25.9530002Z test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64 PASSED [1.2760s] [ 66%] 2025-12-04T13:35:25.9530098Z test_ops.py::TestMathBitsCUDA::test_neg_view_geqrf_cuda_float64 PASSED [0.0245s] [ 66%] 2025-12-04T13:35:25.9530207Z test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_2d_cuda_float64 PASSED [1.2908s] [ 66%] 2025-12-04T13:35:25.9530311Z test_ops.py::TestMathBitsCUDA::test_neg_view_hash_tensor_cuda_float64 PASSED [0.0078s] [ 67%] 2025-12-04T13:35:25.9530408Z test_ops.py::TestMathBitsCUDA::test_neg_view_hsplit_cuda_float64 PASSED [1.2571s] [ 67%] 2025-12-04T13:35:25.9530500Z test_ops.py::TestMathBitsCUDA::test_neg_view_i0_cuda_float64 PASSED [0.0108s] [ 67%] 2025-12-04T13:35:25.9530596Z test_ops.py::TestMathBitsCUDA::test_neg_view_int_cuda_float64 PASSED [1.2734s] [ 67%] 2025-12-04T13:35:25.9530730Z test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_4inputs_with_extra_args_cuda_float64 XFAIL [0.1198s] [ 67%] 2025-12-04T13:35:25.9530828Z test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64 PASSED [2.5186s] [ 67%] 2025-12-04T13:35:25.9530950Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvalsh_cuda_float64 PASSED [0.0159s] [ 67%] 2025-12-04T13:35:25.9531171Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_ex_cuda_float64 PASSED [0.0578s] [ 67%] 2025-12-04T13:35:25.9531368Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_solve_cuda_float64 SKIPPED [0.0010s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 67%] 2025-12-04T13:35:25.9531477Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_cuda_float64 PASSED [1.4916s] [ 67%] 2025-12-04T13:35:25.9531588Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_cuda_float64 PASSED [0.0383s] [ 67%] 2025-12-04T13:35:25.9531694Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_norm_cuda_float64 PASSED [0.1082s] [ 67%] 2025-12-04T13:35:25.9531888Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_singular_cuda_float64 SKIPPED [0.0011s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 67%] 2025-12-04T13:35:25.9531999Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64 PASSED [0.0153s] [ 67%] 2025-12-04T13:35:25.9532101Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_svd_cuda_float64 PASSED [1.4157s] [ 67%] 2025-12-04T13:35:25.9532213Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_tensorinv_cuda_float64 PASSED [0.0098s] [ 67%] 2025-12-04T13:35:25.9532318Z test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_vander_cuda_float64 PASSED [1.2978s] [ 67%] 2025-12-04T13:35:25.9532438Z test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64 XFAIL [0.0048s] [ 67%] 2025-12-04T13:35:25.9532533Z test_ops.py::TestMathBitsCUDA::test_neg_view_log10_cuda_float64 PASSED [2.5352s] [ 67%] 2025-12-04T13:35:25.9532629Z test_ops.py::TestMathBitsCUDA::test_neg_view_logdet_cuda_float64 PASSED [0.0191s] [ 67%] 2025-12-04T13:35:25.9532724Z test_ops.py::TestMathBitsCUDA::test_neg_view_lu_cuda_float64 PASSED [1.3055s] [ 67%] 2025-12-04T13:35:25.9532825Z test_ops.py::TestMathBitsCUDA::test_neg_view_lu_solve_cuda_float64 PASSED [0.2080s] [ 67%] 2025-12-04T13:35:25.9532920Z test_ops.py::TestMathBitsCUDA::test_neg_view_mT_cuda_float64 PASSED [1.2832s] [ 67%] 2025-12-04T13:35:25.9533027Z test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmax_cuda_float64 PASSED [0.0253s] [ 67%] 2025-12-04T13:35:25.9533134Z test_ops.py::TestMathBitsCUDA::test_neg_view_masked_argmin_cuda_float64 PASSED [0.0232s] [ 67%] 2025-12-04T13:35:25.9533236Z test_ops.py::TestMathBitsCUDA::test_neg_view_matrix_exp_cuda_float64 PASSED [1.2872s] [ 67%] 2025-12-04T13:35:25.9533336Z test_ops.py::TestMathBitsCUDA::test_neg_view_maximum_cuda_float64 PASSED [0.0172s] [ 67%] 2025-12-04T13:35:25.9533432Z test_ops.py::TestMathBitsCUDA::test_neg_view_median_cuda_float64 PASSED [1.2919s] [ 67%] 2025-12-04T13:35:25.9533563Z test_ops.py::TestMathBitsCUDA::test_neg_view_min_binary_cuda_float64 PASSED [0.0172s] [ 67%] 2025-12-04T13:35:25.9533688Z test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_no_dim_cuda_float64 PASSED [1.2584s] [ 68%] 2025-12-04T13:35:25.9533784Z test_ops.py::TestMathBitsCUDA::test_neg_view_mode_cuda_float64 PASSED [0.0076s] [ 68%] 2025-12-04T13:35:25.9533881Z test_ops.py::TestMathBitsCUDA::test_neg_view_movedim_cuda_float64 PASSED [1.2489s] [ 68%] 2025-12-04T13:35:25.9534001Z test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_5_cuda_float64 PASSED [0.0176s] [ 68%] 2025-12-04T13:35:25.9534099Z test_ops.py::TestMathBitsCUDA::test_neg_view_new_full_cuda_float64 PASSED [1.2963s] [ 68%] 2025-12-04T13:35:25.9534219Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool3d_cuda_float64 PASSED [0.0215s] [ 68%] 2025-12-04T13:35:25.9534330Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_celu_cuda_float64 PASSED [1.2695s] [ 68%] 2025-12-04T13:35:25.9534445Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv1d_cuda_float64 PASSED [0.0242s] [ 68%] 2025-12-04T13:35:25.9534558Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv3d_cuda_float64 PASSED [1.3060s] [ 68%] 2025-12-04T13:35:25.9534698Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose1d_cuda_float64 PASSED [0.0226s] [ 68%] 2025-12-04T13:35:25.9534835Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_embedding_loss_cuda_float64 PASSED [1.3612s] [ 68%] 2025-12-04T13:35:25.9534969Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout_cuda_float64 SKIPPED [0.0008s] (Skipped!) [ 68%] 2025-12-04T13:35:25.9535078Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gelu_cuda_float64 PASSED [1.2665s] [ 68%] 2025-12-04T13:35:25.9535199Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_grid_sample_cuda_float64 PASSED [0.0404s] [ 68%] 2025-12-04T13:35:25.9535333Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest_cuda_float64 PASSED [1.2791s] [ 68%] 2025-12-04T13:35:25.9535448Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_kl_div_cuda_float64 PASSED [0.0292s] [ 68%] 2025-12-04T13:35:25.9535570Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_leaky_relu_cuda_float64 PASSED [1.2694s] [ 68%] 2025-12-04T13:35:25.9535681Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_linear_cuda_float64 PASSED [0.0305s] [ 68%] 2025-12-04T13:35:25.9535800Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool3d_cuda_float64 PASSED [1.2935s] [ 68%] 2025-12-04T13:35:25.9535928Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_grad_cuda_float64 PASSED [0.0444s] [ 68%] 2025-12-04T13:35:25.9536050Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_cuda_float64 PASSED [1.6477s] [ 68%] 2025-12-04T13:35:25.9536195Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_head_attention_forward_cuda_float64 PASSED [0.0064s] [ 68%] 2025-12-04T13:35:25.9536313Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_nll_loss_cuda_float64 PASSED [1.3605s] [ 68%] 2025-12-04T13:35:25.9536435Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_circular_cuda_float64 PASSED [0.0163s] [ 68%] 2025-12-04T13:35:25.9536564Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_unshuffle_cuda_float64 PASSED [0.0061s] [ 68%] 2025-12-04T13:35:25.9536675Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_rrelu_cuda_float64 PASSED [1.3246s] [ 68%] 2025-12-04T13:35:25.9536820Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_scaled_dot_product_attention_cuda_float64 PASSED [0.0685s] [ 68%] 2025-12-04T13:35:25.9536936Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softplus_cuda_float64 PASSED [1.3156s] [ 68%] 2025-12-04T13:35:25.9537071Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_triplet_margin_loss_cuda_float64 PASSED [0.0188s] [ 68%] 2025-12-04T13:35:25.9537218Z test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_upsample_nearest_cuda_float64 PASSED [1.3226s] [ 68%] 2025-12-04T13:35:25.9537330Z test_ops.py::TestMathBitsCUDA::test_neg_view_norm_fro_cuda_float64 PASSED [0.0078s] [ 69%] 2025-12-04T13:35:25.9537431Z test_ops.py::TestMathBitsCUDA::test_neg_view_norm_nuc_cuda_float64 PASSED [1.3011s] [ 69%] 2025-12-04T13:35:25.9537555Z test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_1_cuda_float64 PASSED [0.0188s] [ 69%] 2025-12-04T13:35:25.9537677Z test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_2_cuda_float64 PASSED [1.3201s] [ 69%] 2025-12-04T13:35:25.9537796Z test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_4_cuda_float64 PASSED [0.0158s] [ 69%] 2025-12-04T13:35:25.9537898Z test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64 PASSED [1.3180s] [ 69%] 2025-12-04T13:35:25.9537997Z test_ops.py::TestMathBitsCUDA::test_neg_view_quantile_cuda_float64 PASSED [0.1103s] [ 69%] 2025-12-04T13:35:25.9538129Z test_ops.py::TestMathBitsCUDA::test_neg_view_real_cuda_float64 PASSED [1.3034s] [ 69%] 2025-12-04T13:35:25.9538245Z test_ops.py::TestMathBitsCUDA::test_neg_view_repeat_interleave_cuda_float64 PASSED [0.0105s] [ 69%] 2025-12-04T13:35:25.9538432Z test_ops.py::TestMathBitsCUDA::test_neg_view_resize__cuda_float64 SKIPPED [0.0012s] (Operation not tested with tensors with negative bit.) [ 69%] 2025-12-04T13:35:25.9538535Z test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64 PASSED [1.2778s] [ 69%] 2025-12-04T13:35:25.9538643Z test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_conj_cuda_float64 PASSED [0.0064s] [ 69%] 2025-12-04T13:35:25.9538739Z test_ops.py::TestMathBitsCUDA::test_neg_view_rot90_cuda_float64 PASSED [1.3480s] [ 69%] 2025-12-04T13:35:25.9538834Z test_ops.py::TestMathBitsCUDA::test_neg_view_rsub_cuda_float64 PASSED [0.0157s] [ 69%] 2025-12-04T13:35:25.9538940Z test_ops.py::TestMathBitsCUDA::test_neg_view_searchsorted_cuda_float64 PASSED [1.3620s] [ 69%] 2025-12-04T13:35:25.9539049Z test_ops.py::TestMathBitsCUDA::test_neg_view_select_scatter_cuda_float64 PASSED [0.0103s] [ 69%] 2025-12-04T13:35:25.9539187Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_bartlett_cuda_float64 SKIPPED [0.0002s] (Skipped!) [ 69%] 2025-12-04T13:35:25.9539331Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_exponential_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 69%] 2025-12-04T13:35:25.9539464Z test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_nuttall_cuda_float64 SKIPPED [0.0001s] (Skipped!) [ 69%] 2025-12-04T13:35:25.9539562Z test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64 PASSED [1.3068s] [ 69%] 2025-12-04T13:35:25.9539675Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j1_cuda_float64 PASSED [0.0063s] [ 69%] 2025-12-04T13:35:25.9539781Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_entr_cuda_float64 PASSED [1.3078s] [ 69%] 2025-12-04T13:35:25.9539911Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_laguerre_polynomial_l_cuda_float64 PASSED [0.0088s] [ 69%] 2025-12-04T13:35:25.9540023Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_log_ndtr_cuda_float64 PASSED [1.3000s] [ 69%] 2025-12-04T13:35:25.9540148Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i0_cuda_float64 PASSED [0.0064s] [ 69%] 2025-12-04T13:35:25.9540272Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k0_cuda_float64 PASSED [1.4754s] [ 69%] 2025-12-04T13:35:25.9540406Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64 PASSED [0.1815s] [ 69%] 2025-12-04T13:35:25.9540547Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_u_cuda_float64 PASSED [0.0067s] [ 69%] 2025-12-04T13:35:25.9540658Z test_ops.py::TestMathBitsCUDA::test_neg_view_special_xlog1py_cuda_float64 PASSED [0.0164s] [ 69%] 2025-12-04T13:35:25.9540769Z test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_multiple_cuda_float64 PASSED [0.0096s] [ 69%] 2025-12-04T13:35:25.9540866Z test_ops.py::TestMathBitsCUDA::test_neg_view_stft_cuda_float64 PASSED [1.9700s] [ 70%] 2025-12-04T13:35:25.9540985Z test_ops.py::TestMathBitsCUDA::test_neg_view_sub_cuda_float64 PASSED [1.3094s] [ 70%] 2025-12-04T13:35:25.9541094Z test_ops.py::TestMathBitsCUDA::test_neg_view_t_cuda_float64 PASSED [0.0082s] [ 70%] 2025-12-04T13:35:25.9541202Z test_ops.py::TestMathBitsCUDA::test_neg_view_take_along_dim_cuda_float64 PASSED [1.2996s] [ 70%] 2025-12-04T13:35:25.9541305Z test_ops.py::TestMathBitsCUDA::test_neg_view_tensordot_cuda_float64 PASSED [0.0106s] [ 70%] 2025-12-04T13:35:25.9541398Z test_ops.py::TestMathBitsCUDA::test_neg_view_to_cuda_float64 PASSED [1.3347s] [ 70%] 2025-12-04T13:35:25.9541494Z test_ops.py::TestMathBitsCUDA::test_neg_view_trace_cuda_float64 PASSED [0.0059s] [ 70%] 2025-12-04T13:35:25.9541603Z test_ops.py::TestMathBitsCUDA::test_neg_view_transpose_copy_cuda_float64 PASSED [1.2881s] [ 70%] 2025-12-04T13:35:25.9541705Z test_ops.py::TestMathBitsCUDA::test_neg_view_trapezoid_cuda_float64 PASSED [0.0216s] [ 70%] 2025-12-04T13:35:25.9541800Z test_ops.py::TestMathBitsCUDA::test_neg_view_vdot_cuda_float64 PASSED [1.2739s] [ 70%] 2025-12-04T13:35:25.9541981Z test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_complex_cuda_float64 SKIPPED [0.0017s] (Operation not tested with tensors with negative bit.) [ 70%] 2025-12-04T13:35:25.9542092Z test_ops.py::TestMathBitsCUDA::test_neg_view_view_copy_cuda_float64 PASSED [1.3074s] [ 70%] 2025-12-04T13:35:25.9542186Z test_ops.py::TestMathBitsCUDA::test_neg_view_view_cuda_float64 PASSED [0.0122s] [ 70%] 2025-12-04T13:35:25.9542280Z test_ops.py::TestMathBitsCUDA::test_neg_view_zero__cuda_float64 PASSED [1.3013s] [ 70%] 2025-12-04T13:35:25.9542384Z test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_like_cuda_float64 PASSED [0.0063s] [ 70%] 2025-12-04T13:35:25.9542477Z test_ops.py::TestFakeTensorCUDA::test_fake_T_cuda_float32 PASSED [0.0157s] [ 70%] 2025-12-04T13:35:25.9542576Z test_ops.py::TestFakeTensorCUDA::test_fake___rmul___cuda_float32 PASSED [0.0107s] [ 70%] 2025-12-04T13:35:25.9542673Z test_ops.py::TestFakeTensorCUDA::test_fake___rsub___cuda_float32 PASSED [0.0107s] [ 70%] 2025-12-04T13:35:25.9542792Z test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32 PASSED [0.0780s] [ 70%] 2025-12-04T13:35:25.9542915Z test_ops.py::TestFakeTensorCUDA::test_fake__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0079s] [ 70%] 2025-12-04T13:35:25.9543010Z test_ops.py::TestFakeTensorCUDA::test_fake_acos_cuda_float32 PASSED [0.0045s] [ 70%] 2025-12-04T13:35:25.9543106Z test_ops.py::TestFakeTensorCUDA::test_fake_addbmm_cuda_float32 PASSED [0.0083s] [ 70%] 2025-12-04T13:35:25.9543202Z test_ops.py::TestFakeTensorCUDA::test_fake_addcdiv_cuda_float32 PASSED [0.0142s] [ 70%] 2025-12-04T13:35:25.9543296Z test_ops.py::TestFakeTensorCUDA::test_fake_addr_cuda_float32 PASSED [0.0089s] [ 70%] 2025-12-04T13:35:25.9543390Z test_ops.py::TestFakeTensorCUDA::test_fake_arange_cuda_float32 PASSED [0.0205s] [ 70%] 2025-12-04T13:35:25.9543487Z test_ops.py::TestFakeTensorCUDA::test_fake_argmin_cuda_float32 PASSED [0.0106s] [ 70%] 2025-12-04T13:35:25.9543590Z test_ops.py::TestFakeTensorCUDA::test_fake_atleast_1d_cuda_float32 PASSED [0.0061s] [ 70%] 2025-12-04T13:35:25.9543694Z test_ops.py::TestFakeTensorCUDA::test_fake_atleast_2d_cuda_float32 PASSED [0.0070s] [ 70%] 2025-12-04T13:35:25.9543802Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast___ror___cuda_int64 PASSED [0.0102s] [ 70%] 2025-12-04T13:35:25.9543932Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast__batch_norm_with_update_cuda_float32 PASSED [0.0500s] [ 70%] 2025-12-04T13:35:25.9544064Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast__native_batch_norm_legit_cuda_float32 PASSED [0.0497s] [ 71%] 2025-12-04T13:35:25.9544194Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_lengths_cuda_float32 PASSED [0.0804s] [ 71%] 2025-12-04T13:35:25.9544304Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcdiv_cuda_float32 PASSED [0.0151s] [ 71%] 2025-12-04T13:35:25.9544412Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_amin_cuda_float32 PASSED [0.0159s] [ 71%] 2025-12-04T13:35:25.9544570Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_aminmax_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 71%] 2025-12-04T13:35:25.9544690Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_angle_cuda_float32 PASSED [1.3325s] [ 71%] 2025-12-04T13:35:25.9544799Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argsort_cuda_float32 PASSED [0.0324s] [ 71%] 2025-12-04T13:35:25.9544912Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argwhere_cuda_float32 PASSED [0.0067s] [ 71%] 2025-12-04T13:35:25.9545025Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_2d_cuda_float32 PASSED [0.0072s] [ 71%] 2025-12-04T13:35:25.9545137Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bfloat16_cuda_float32 PASSED [0.0079s] [ 71%] 2025-12-04T13:35:25.9545250Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_not_cuda_int64 PASSED [0.0045s] [ 71%] 2025-12-04T13:35:25.9545374Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_right_shift_cuda_int64 PASSED [0.0102s] [ 71%] 2025-12-04T13:35:25.9545489Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_min_cuda_float32 PASSED [0.0121s] [ 71%] 2025-12-04T13:35:25.9545606Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clone_cuda_float32 PASSED [0.0035s] [ 71%] 2025-12-04T13:35:25.9545724Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_column_stack_cuda_float32 PASSED [0.0057s] [ 71%] 2025-12-04T13:35:25.9545845Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_constant_pad_nd_cuda_float32 PASSED [0.0370s] [ 71%] 2025-12-04T13:35:25.9545955Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummax_cuda_float32 PASSED [0.0046s] [ 71%] 2025-12-04T13:35:25.9546064Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummin_cuda_float32 PASSED [0.0045s] [ 71%] 2025-12-04T13:35:25.9546173Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_cuda_float32 PASSED [0.0133s] [ 71%] 2025-12-04T13:35:25.9546290Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_copy_cuda_float32 PASSED [0.0124s] [ 71%] 2025-12-04T13:35:25.9546415Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_scatter_cuda_float32 PASSED [0.0141s] [ 71%] 2025-12-04T13:35:25.9546521Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dot_cuda_float32 PASSED [1.3382s] [ 71%] 2025-12-04T13:35:25.9546630Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_double_cuda_float32 PASSED [0.0109s] [ 71%] 2025-12-04T13:35:25.9546737Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dstack_cuda_float32 PASSED [1.3013s] [ 71%] 2025-12-04T13:35:25.9546852Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_as_cuda_float32 PASSED [0.0076s] [ 71%] 2025-12-04T13:35:25.9546956Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eye_cuda_float32 PASSED [0.0641s] [ 71%] 2025-12-04T13:35:25.9547068Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifft2_cuda_float32 PASSED [0.0120s] [ 71%] 2025-12-04T13:35:25.9547179Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft_cuda_float32 PASSED [0.0095s] [ 71%] 2025-12-04T13:35:25.9547295Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfftn_cuda_float32 PASSED [0.0117s] [ 71%] 2025-12-04T13:35:25.9547406Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfft_cuda_float32 PASSED [0.0111s] [ 72%] 2025-12-04T13:35:25.9547519Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_geometric_cuda_float32 PASSED [0.0061s] [ 72%] 2025-12-04T13:35:25.9547657Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_grid_sampler_3d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 72%] 2025-12-04T13:35:25.9547772Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hash_tensor_cuda_float32 PASSED [0.0153s] [ 72%] 2025-12-04T13:35:25.9547884Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_heaviside_cuda_float32 PASSED [0.0135s] [ 72%] 2025-12-04T13:35:25.9547991Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hstack_cuda_float32 PASSED [0.0042s] [ 72%] 2025-12-04T13:35:25.9548131Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_i0_cuda_float32 PASSED [0.0048s] [ 72%] 2025-12-04T13:35:25.9548272Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igamma_cuda_float32 PASSED [0.0100s] [ 72%] 2025-12-04T13:35:25.9548403Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_put_cuda_float32 PASSED [0.0065s] [ 72%] 2025-12-04T13:35:25.9548526Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_amin_cuda_float32 PASSED [1.3078s] [ 72%] 2025-12-04T13:35:25.9548678Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_cuda_float32 SKIPPED [0.0018s] (Skip failing test) [ 72%] 2025-12-04T13:35:25.9548785Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kron_cuda_float32 PASSED [0.0103s] [ 72%] 2025-12-04T13:35:25.9548894Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lgamma_cuda_float32 PASSED [0.0062s] [ 72%] 2025-12-04T13:35:25.9549042Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigvalsh_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 72%] 2025-12-04T13:35:25.9549159Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_cuda_float32 PASSED [0.0132s] [ 72%] 2025-12-04T13:35:25.9549281Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_cuda_float32 PASSED [0.0576s] [ 72%] 2025-12-04T13:35:25.9549414Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_solve_cuda_float32 PASSED [0.1016s] [ 72%] 2025-12-04T13:35:25.9549534Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_multi_dot_cuda_float32 PASSED [0.5598s] [ 72%] 2025-12-04T13:35:25.9549680Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_subgradients_at_zero_cuda_float32 PASSED [0.0977s] [ 72%] 2025-12-04T13:35:25.9549822Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 72%] 2025-12-04T13:35:25.9549940Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vecdot_cuda_float32 PASSED [0.0781s] [ 72%] 2025-12-04T13:35:25.9550050Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log2_cuda_float32 PASSED [0.0045s] [ 72%] 2025-12-04T13:35:25.9550166Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp2_cuda_float32 PASSED [0.0040s] [ 72%] 2025-12-04T13:35:25.9550284Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32 PASSED [0.0063s] [ 72%] 2025-12-04T13:35:25.9550392Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logdet_cuda_float32 PASSED [0.0201s] [ 72%] 2025-12-04T13:35:25.9550505Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logsumexp_cuda_float32 PASSED [0.0204s] [ 72%] 2025-12-04T13:35:25.9550610Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_cuda_float32 PASSED [0.0439s] [ 72%] 2025-12-04T13:35:25.9550724Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amin_cuda_float32 PASSED [0.1257s] [ 72%] 2025-12-04T13:35:25.9550842Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_cumprod_cuda_float32 PASSED [0.0321s] [ 72%] 2025-12-04T13:35:25.9550965Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logsumexp_cuda_float32 PASSED [0.1729s] [ 72%] 2025-12-04T13:35:25.9551074Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matmul_cuda_float32 PASSED [0.0472s] [ 73%] 2025-12-04T13:35:25.9551183Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mean_cuda_float32 PASSED [0.0174s] [ 73%] 2025-12-04T13:35:25.9551296Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_binary_cuda_float32 PASSED [0.0100s] [ 73%] 2025-12-04T13:35:25.9551407Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_minimum_cuda_float32 PASSED [0.0101s] [ 73%] 2025-12-04T13:35:25.9551520Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nan_to_num_cuda_float32 PASSED [0.0042s] [ 73%] 2025-12-04T13:35:25.9551652Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_dropout_backward_cuda_float32 PASSED [0.0194s] [ 73%] 2025-12-04T13:35:25.9551764Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_zeros_cuda_float32 PASSED [0.0088s] [ 73%] 2025-12-04T13:35:25.9551898Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool2d_cuda_float32 PASSED [0.0082s] [ 73%] 2025-12-04T13:35:25.9552223Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 1200, provided ptr: 0x763b67c01200 size: 768 2025-12-04T13:35:25.9552417Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 1200, provided ptr: 0x763b67c01200 size: 768 2025-12-04T13:35:25.9552459Z PASSED [0.1189s] [ 73%] 2025-12-04T13:35:25.9552603Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.9988s] [ 73%] 2025-12-04T13:35:25.9552735Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout3d_cuda_float32 PASSED [0.0189s] [ 73%] 2025-12-04T13:35:25.9552856Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_elu_cuda_float32 PASSED [0.0054s] [ 73%] 2025-12-04T13:35:25.9552988Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_cuda_float32 PASSED [0.0118s] [ 73%] 2025-12-04T13:35:25.9553121Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardshrink_cuda_float32 PASSED [0.0075s] [ 73%] 2025-12-04T13:35:25.9553266Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardsigmoid_cuda_float32 PASSED [0.0086s] [ 73%] 2025-12-04T13:35:25.9553395Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardswish_cuda_float32 PASSED [0.0104s] [ 73%] 2025-12-04T13:35:25.9553542Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hinge_embedding_loss_cuda_float32 PASSED [0.0345s] [ 73%] 2025-12-04T13:35:25.9553678Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_instance_norm_cuda_float32 PASSED [0.0565s] [ 73%] 2025-12-04T13:35:25.9553821Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_area_cuda_float32 PASSED [0.0201s] [ 73%] 2025-12-04T13:35:25.9553968Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bicubic_cuda_float32 PASSED [1.0616s] [ 73%] 2025-12-04T13:35:25.9554101Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_layer_norm_cuda_float32 PASSED [0.0196s] [ 73%] 2025-12-04T13:35:25.9554227Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_prelu_cuda_float32 PASSED [0.0610s] [ 73%] 2025-12-04T13:35:25.9554385Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.2139s] [ 73%] 2025-12-04T13:35:25.9554508Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_cuda_float32 PASSED [0.0139s] [ 73%] 2025-12-04T13:35:25.9554647Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0150s] [ 73%] 2025-12-04T13:35:25.9554775Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_cuda_float32 PASSED [0.0086s] [ 73%] 2025-12-04T13:35:25.9554908Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softshrink_cuda_float32 PASSED [0.0074s] [ 73%] 2025-12-04T13:35:25.9555054Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.0278s] [ 73%] 2025-12-04T13:35:25.9555217Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0284s] [ 73%] 2025-12-04T13:35:25.9555344Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_unfold_cuda_float32 PASSED [0.6954s] [ 73%] 2025-12-04T13:35:25.9555484Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0113s] [ 74%] 2025-12-04T13:35:25.9555597Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_cuda_float32 PASSED [0.1361s] [ 74%] 2025-12-04T13:35:25.9555710Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_fro_cuda_float32 PASSED [0.0059s] [ 74%] 2025-12-04T13:35:25.9555822Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_nuc_cuda_float32 PASSED [0.0079s] [ 74%] 2025-12-04T13:35:25.9555960Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32 PASSED [0.0047s] [ 74%] 2025-12-04T13:35:25.9556079Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32 PASSED [0.0032s] [ 74%] 2025-12-04T13:35:25.9556189Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ormqr_cuda_float32 PASSED [0.1324s] [ 74%] 2025-12-04T13:35:25.9556332Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pinverse_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 74%] 2025-12-04T13:35:25.9556464Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32 PASSED [0.0092s] [ 74%] 2025-12-04T13:35:25.9556597Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0090s] [ 74%] 2025-12-04T13:35:25.9556706Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize__cuda_float32 PASSED [0.0047s] [ 74%] 2025-12-04T13:35:25.9556825Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scalar_tensor_cuda_float32 PASSED [0.0037s] [ 74%] 2025-12-04T13:35:25.9556934Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_cuda_float32 PASSED [0.0258s] [ 74%] 2025-12-04T13:35:25.9557072Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amax_cuda_float32 PASSED [0.0216s] [ 74%] 2025-12-04T13:35:25.9557191Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_searchsorted_cuda_float32 PASSED [0.2627s] [ 74%] 2025-12-04T13:35:25.9557329Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_cosine_cuda_float32 PASSED [0.0253s] [ 74%] 2025-12-04T13:35:25.9557460Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hamming_cuda_float32 PASSED [0.0243s] [ 74%] 2025-12-04T13:35:25.9557584Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hann_cuda_float32 PASSED [0.0240s] [ 74%] 2025-12-04T13:35:25.9557712Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_nuttall_cuda_float32 PASSED [0.0241s] [ 74%] 2025-12-04T13:35:25.9557823Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signbit_cuda_float32 PASSED [0.0029s] [ 74%] 2025-12-04T13:35:25.9557933Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sin_cuda_float32 PASSED [1.3459s] [ 74%] 2025-12-04T13:35:25.9558041Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sort_cuda_float32 PASSED [0.0350s] [ 74%] 2025-12-04T13:35:25.9558230Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_mm_reduce_cuda_float32 SKIPPED [0.0008s] (Only runs on cpu) [ 74%] 2025-12-04T13:35:25.9558385Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_sampled_addmm_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 74%] 2025-12-04T13:35:25.9558506Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_airy_ai_cuda_float32 PASSED [0.0048s] [ 74%] 2025-12-04T13:35:25.9558629Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j1_cuda_float32 PASSED [0.0045s] [ 74%] 2025-12-04T13:35:25.9558776Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0098s] [ 74%] 2025-12-04T13:35:25.9558917Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_v_cuda_float32 PASSED [0.0118s] [ 74%] 2025-12-04T13:35:25.9559061Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0097s] [ 74%] 2025-12-04T13:35:25.9559180Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtr_cuda_float32 PASSED [1.3489s] [ 74%] 2025-12-04T13:35:25.9559326Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_scaled_modified_bessel_k1_cuda_float32 PASSED [0.0073s] [ 75%] 2025-12-04T13:35:25.9559481Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_t_cuda_float32 PASSED [0.0106s] [ 75%] 2025-12-04T13:35:25.9559600Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_xlog1py_cuda_float32 PASSED [0.0133s] [ 75%] 2025-12-04T13:35:25.9559738Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_cuda_float32 PASSED [0.0042s] [ 75%] 2025-12-04T13:35:25.9559858Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32 PASSED [1.3354s] [ 75%] 2025-12-04T13:35:25.9559966Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sub_cuda_float32 PASSED [0.0150s] [ 75%] 2025-12-04T13:35:25.9560070Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_cuda_float32 PASSED [0.0171s] [ 75%] 2025-12-04T13:35:25.9560175Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_cuda_float32 PASSED [0.2911s] [ 75%] 2025-12-04T13:35:25.9560295Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_take_along_dim_cuda_float32 PASSED [0.0115s] [ 75%] 2025-12-04T13:35:25.9560460Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch__scaled_mm_cuda_float8_e4m3fn SKIPPED [0.0007s] (Requires CUDA SM >= 8.9) [ 75%] 2025-12-04T13:35:25.9560608Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__safe_softmax_default_cuda_float32 PASSED [0.0188s] [ 75%] 2025-12-04T13:35:25.9560721Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trace_cuda_float32 PASSED [1.3398s] [ 75%] 2025-12-04T13:35:25.9560854Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_true_divide_cuda_float32 PASSED [0.0147s] [ 75%] 2025-12-04T13:35:25.9560963Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trunc_cuda_float32 PASSED [0.0034s] [ 75%] 2025-12-04T13:35:25.9561071Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_cuda_float32 PASSED [0.0171s] [ 75%] 2025-12-04T13:35:25.9561199Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_consecutive_cuda_float32 PASSED [0.1683s] [ 75%] 2025-12-04T13:35:25.9561303Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_cuda_float32 PASSED [0.0125s] [ 75%] 2025-12-04T13:35:25.9561410Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_cuda_float32 PASSED [0.0085s] [ 75%] 2025-12-04T13:35:25.9561518Z test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vstack_cuda_float32 PASSED [0.0048s] [ 75%] 2025-12-04T13:35:25.9561618Z test_ops.py::TestFakeTensorCUDA::test_fake_bfloat16_cuda_float32 PASSED [0.0070s] [ 75%] 2025-12-04T13:35:25.9561714Z test_ops.py::TestFakeTensorCUDA::test_fake_bmm_cuda_float32 PASSED [1.3370s] [ 75%] 2025-12-04T13:35:25.9561813Z test_ops.py::TestFakeTensorCUDA::test_fake_cauchy_cuda_float32 PASSED [0.0096s] [ 75%] 2025-12-04T13:35:25.9561908Z test_ops.py::TestFakeTensorCUDA::test_fake_char_cuda_float32 PASSED [0.0084s] [ 75%] 2025-12-04T13:35:25.9562014Z test_ops.py::TestFakeTensorCUDA::test_fake_conj_physical_cuda_float32 PASSED [1.3446s] [ 75%] 2025-12-04T13:35:25.9562124Z test_ops.py::TestFakeTensorCUDA::test_fake_constant_pad_nd_cuda_float32 PASSED [0.0391s] [ 75%] 2025-12-04T13:35:25.9562252Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rsub___cuda_float32 PASSED [0.0290s] [ 75%] 2025-12-04T13:35:25.9562403Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__upsample_bilinear2d_aa_cuda_float32 PASSED [0.0148s] [ 75%] 2025-12-04T13:35:25.9562527Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acos_cuda_float32 PASSED [0.0155s] [ 75%] 2025-12-04T13:35:25.9562650Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_add_cuda_float32 PASSED [0.0262s] [ 75%] 2025-12-04T13:35:25.9562779Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcmul_cuda_float32 PASSED [0.0638s] [ 76%] 2025-12-04T13:35:25.9562902Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addr_cuda_float32 PASSED [0.0554s] [ 76%] 2025-12-04T13:35:25.9563030Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_cuda_float32 PASSED [0.0257s] [ 76%] 2025-12-04T13:35:25.9563160Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_2d_cuda_float32 PASSED [0.0136s] [ 76%] 2025-12-04T13:35:25.9563288Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_block_diag_cuda_float32 PASSED [0.0318s] [ 76%] 2025-12-04T13:35:25.9563411Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ceil_cuda_float32 PASSED [0.0040s] [ 76%] 2025-12-04T13:35:25.9563558Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_cuda_float32 PASSED [0.6628s] [ 76%] 2025-12-04T13:35:25.9563713Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_inverse_cuda_float32 PASSED [0.3067s] [ 76%] 2025-12-04T13:35:25.9563834Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cov_cuda_float32 PASSED [1.7145s] [ 76%] 2025-12-04T13:35:25.9563972Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_copy_cuda_float32 PASSED [0.0320s] [ 76%] 2025-12-04T13:35:25.9564100Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_digamma_cuda_float32 PASSED [0.0093s] [ 76%] 2025-12-04T13:35:25.9564224Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dot_cuda_float32 PASSED [0.0049s] [ 76%] 2025-12-04T13:35:25.9564350Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dsplit_cuda_float32 PASSED [1.3613s] [ 76%] 2025-12-04T13:35:25.9564472Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erf_cuda_float32 PASSED [0.0105s] [ 76%] 2025-12-04T13:35:25.9564613Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfftn_cuda_float32 PASSED [0.0862s] [ 76%] 2025-12-04T13:35:25.9564748Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifftshift_cuda_float32 PASSED [0.0137s] [ 76%] 2025-12-04T13:35:25.9564879Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfftn_cuda_float32 PASSED [0.0617s] [ 76%] 2025-12-04T13:35:25.9565006Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_flatten_cuda_float32 PASSED [1.3662s] [ 76%] 2025-12-04T13:35:25.9565132Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_cuda_float32 PASSED [0.0105s] [ 76%] 2025-12-04T13:35:25.9565258Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_gather_cuda_float32 PASSED [1.3459s] [ 76%] 2025-12-04T13:35:25.9565401Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_mean_cuda_float32 PASSED [0.0620s] [ 76%] 2025-12-04T13:35:25.9565542Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_prod_cuda_float32 PASSED [0.1380s] [ 76%] 2025-12-04T13:35:25.9565676Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cross_cuda_float32 PASSED [0.0114s] [ 76%] 2025-12-04T13:35:25.9565915Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_householder_product_cuda_float32 SKIPPED [0.0008s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 76%] 2025-12-04T13:35:25.9566044Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_cuda_float32 PASSED [1.1585s] [ 76%] 2025-12-04T13:35:25.9566183Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32 PASSED [0.1260s] [ 76%] 2025-12-04T13:35:25.9566332Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 76%] 2025-12-04T13:35:25.9566476Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_tensorsolve_cuda_float32 PASSED [0.0771s] [ 76%] 2025-12-04T13:35:25.9566598Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mH_cuda_float32 PASSED [0.0104s] [ 76%] 2025-12-04T13:35:25.9566731Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_fill_cuda_float32 PASSED [0.0285s] [ 76%] 2025-12-04T13:35:25.9566863Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_norm_cuda_float32 PASSED [4.8319s] [ 77%] 2025-12-04T13:35:25.9566994Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_std_cuda_float32 PASSED [1.6811s] [ 77%] 2025-12-04T13:35:25.9567124Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_binary_cuda_float32 PASSED [0.0762s] [ 77%] 2025-12-04T13:35:25.9567269Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32 PASSED [0.0154s] [ 77%] 2025-12-04T13:35:25.9567420Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nanquantile_cuda_float32 PASSED [1.6153s] [ 77%] 2025-12-04T13:35:25.9567581Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32 PASSED [0.0408s] [ 77%] 2025-12-04T13:35:25.9567720Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_layer_norm_cuda_float32 PASSED [0.1400s] [ 77%] 2025-12-04T13:35:25.9567877Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose2d_cuda_float32 PASSED [0.8348s] [ 77%] 2025-12-04T13:35:25.9568035Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose3d_cuda_float32 PASSED [0.1080s] [ 77%] 2025-12-04T13:35:25.9568336Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_embedding_loss_cuda_float32 PASSED [0.2739s] [ 77%] 2025-12-04T13:35:25.9568486Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32 PASSED [0.0079s] [ 77%] 2025-12-04T13:35:25.9568637Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_huber_loss_cuda_float32 PASSED [1.4347s] [ 77%] 2025-12-04T13:35:25.9568809Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_instance_norm_cuda_float32 PASSED [0.4309s] [ 77%] 2025-12-04T13:35:25.9568966Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_area_cuda_float32 PASSED [0.0635s] [ 77%] 2025-12-04T13:35:25.9569128Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bicubic_cuda_float32 PASSED [1.1160s] [ 77%] 2025-12-04T13:35:25.9569287Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_linear_cuda_float32 PASSED [0.1160s] [ 77%] 2025-12-04T13:35:25.9569434Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_l1_loss_cuda_float32 PASSED [0.0389s] [ 77%] 2025-12-04T13:35:25.9569585Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_leaky_relu_cuda_float32 PASSED [0.0268s] [ 77%] 2025-12-04T13:35:25.9569733Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_linear_cuda_float32 PASSED [0.3239s] [ 77%] 2025-12-04T13:35:25.9569893Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_local_response_norm_cuda_float32 PASSED [0.2209s] [ 77%] 2025-12-04T13:35:25.9570042Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool2d_cuda_float32 PASSED [2.9539s] [ 77%] 2025-12-04T13:35:25.9570196Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool1d_cuda_float32 PASSED [1.6219s] [ 77%] 2025-12-04T13:35:25.9570370Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [0.1658s] [ 77%] 2025-12-04T13:35:25.9570522Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_constant_cuda_float32 PASSED [0.0859s] [ 77%] 2025-12-04T13:35:25.9570690Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_replicate_negative_cuda_float32 PASSED [1.4002s] [ 77%] 2025-12-04T13:35:25.9570864Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [1.1814s] [ 77%] 2025-12-04T13:35:25.9571005Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_selu_cuda_float32 PASSED [0.0152s] [ 77%] 2025-12-04T13:35:25.9571145Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_silu_cuda_float32 PASSED [0.0148s] [ 77%] 2025-12-04T13:35:25.9571303Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0479s] [ 77%] 2025-12-04T13:35:25.9571487Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0539s] [ 77%] 2025-12-04T13:35:25.9571654Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_nearest_cuda_float32 PASSED [0.0235s] [ 78%] 2025-12-04T13:35:25.9571786Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_nuc_cuda_float32 PASSED [1.3713s] [ 78%] 2025-12-04T13:35:25.9571910Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polar_cuda_float32 PASSED [0.1075s] [ 78%] 2025-12-04T13:35:25.9572034Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_qr_cuda_float32 PASSED [1.0171s] [ 78%] 2025-12-04T13:35:25.9572174Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_repeat_interleave_cuda_float32 PASSED [0.0297s] [ 78%] 2025-12-04T13:35:25.9572296Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsub_cuda_float32 PASSED [0.0319s] [ 78%] 2025-12-04T13:35:25.9572440Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_prod_cuda_float32 PASSED [0.3380s] [ 78%] 2025-12-04T13:35:25.9572577Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_scatter_cuda_float32 PASSED [0.0171s] [ 78%] 2025-12-04T13:35:25.9572715Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sigmoid_cuda_float32 PASSED [0.0120s] [ 78%] 2025-12-04T13:35:25.9572840Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sort_cuda_float32 PASSED [0.0824s] [ 78%] 2025-12-04T13:35:25.9572972Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i0e_cuda_float32 PASSED [0.0115s] [ 78%] 2025-12-04T13:35:25.9573103Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1e_cuda_float32 PASSED [0.0283s] [ 78%] 2025-12-04T13:35:25.9573237Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_ndtr_cuda_float32 PASSED [0.0260s] [ 78%] 2025-12-04T13:35:25.9573375Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_squeeze_multiple_cuda_float32 PASSED [0.0119s] [ 78%] 2025-12-04T13:35:25.9573505Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_cuda_float32 PASSED [0.1363s] [ 78%] 2025-12-04T13:35:25.9573643Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_along_dim_cuda_float32 PASSED [0.0268s] [ 78%] 2025-12-04T13:35:25.9573768Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tan_cuda_float32 PASSED [0.0057s] [ 78%] 2025-12-04T13:35:25.9573892Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tanh_cuda_float32 PASSED [0.0055s] [ 78%] 2025-12-04T13:35:25.9574031Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triangular_solve_cuda_float32 PASSED [0.2213s] [ 78%] 2025-12-04T13:35:25.9574152Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_triu_cuda_float32 PASSED [0.0205s] [ 78%] 2025-12-04T13:35:25.9574273Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_cuda_float32 PASSED [0.0560s] [ 78%] 2025-12-04T13:35:25.9574396Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vdot_cuda_float32 PASSED [1.3833s] [ 78%] 2025-12-04T13:35:25.9574534Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_as_complex_cuda_float32 PASSED [0.0068s] [ 78%] 2025-12-04T13:35:25.9574656Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_cuda_float32 PASSED [0.0192s] [ 78%] 2025-12-04T13:35:25.9574781Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_xlogy_cuda_float32 PASSED [0.0698s] [ 78%] 2025-12-04T13:35:25.9574903Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_zero__cuda_float32 PASSED [0.0102s] [ 78%] 2025-12-04T13:35:25.9575025Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_T_cuda_float32 PASSED [0.0056s] [ 78%] 2025-12-04T13:35:25.9575175Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__native_batch_norm_legit_cuda_float32 PASSED [0.1092s] [ 78%] 2025-12-04T13:35:25.9575358Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__unsafe_masked_index_put_accumulate_cuda_float32 PASSED [0.0505s] [ 78%] 2025-12-04T13:35:25.9575499Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_abs_cuda_float32 PASSED [0.0047s] [ 78%] 2025-12-04T13:35:25.9575626Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acos_cuda_float32 PASSED [0.0155s] [ 79%] 2025-12-04T13:35:25.9575755Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addbmm_cuda_float32 PASSED [0.0526s] [ 79%] 2025-12-04T13:35:25.9575881Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amin_cuda_float32 PASSED [0.1131s] [ 79%] 2025-12-04T13:35:25.9576017Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_as_strided_cuda_float32 PASSED [0.0247s] [ 79%] 2025-12-04T13:35:25.9576145Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan2_cuda_float32 PASSED [0.0793s] [ 79%] 2025-12-04T13:35:25.9576282Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32 PASSED [0.0297s] [ 79%] 2025-12-04T13:35:25.9576411Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cdouble_cuda_float32 PASSED [0.0234s] [ 79%] 2025-12-04T13:35:25.9576552Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_cuda_float32 PASSED [0.2846s] [ 79%] 2025-12-04T13:35:25.9576685Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_max_cuda_float32 PASSED [0.0551s] [ 79%] 2025-12-04T13:35:25.9576825Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_constant_pad_nd_cuda_float32 PASSED [0.0853s] [ 79%] 2025-12-04T13:35:25.9576960Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_contiguous_cuda_float32 PASSED [0.0034s] [ 79%] 2025-12-04T13:35:25.9577088Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cosh_cuda_float32 PASSED [0.0092s] [ 79%] 2025-12-04T13:35:25.9577217Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumsum_cuda_float32 PASSED [0.0129s] [ 79%] 2025-12-04T13:35:25.9577344Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_cuda_float32 PASSED [0.0316s] [ 79%] 2025-12-04T13:35:25.9577477Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft2_cuda_float32 PASSED [0.0655s] [ 79%] 2025-12-04T13:35:25.9577608Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftn_cuda_float32 PASSED [0.0412s] [ 79%] 2025-12-04T13:35:25.9577741Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfftn_cuda_float32 PASSED [0.0601s] [ 79%] 2025-12-04T13:35:25.9577878Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32 PASSED [1.5056s] [ 79%] 2025-12-04T13:35:25.9578035Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_3d_cuda_float32 SKIPPED [0.0002s] (Skipped!) [ 79%] 2025-12-04T13:35:25.9578200Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hstack_cuda_float32 PASSED [0.0092s] [ 79%] 2025-12-04T13:35:25.9578338Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_add_cuda_float32 PASSED [0.0368s] [ 79%] 2025-12-04T13:35:25.9578482Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_prod_cuda_float32 PASSED [0.1356s] [ 79%] 2025-12-04T13:35:25.9578621Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32 PASSED [0.1477s] [ 79%] 2025-12-04T13:35:25.9578756Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_ex_cuda_float32 PASSED [0.1045s] [ 79%] 2025-12-04T13:35:25.9578981Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_singular_cuda_float32 SKIPPED [0.0007s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 79%] 2025-12-04T13:35:25.9579126Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32 PASSED [1.1463s] [ 79%] 2025-12-04T13:35:25.9579279Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log10_cuda_float32 PASSED [0.8863s] [ 79%] 2025-12-04T13:35:25.9579419Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log1p_cuda_float32 PASSED [0.0072s] [ 79%] 2025-12-04T13:35:25.9579547Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_cuda_float32 PASSED [0.0084s] [ 79%] 2025-12-04T13:35:25.9579699Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_with_dtype_cuda_float32 PASSED [0.0471s] [ 80%] 2025-12-04T13:35:25.9579832Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amin_cuda_float32 PASSED [0.9053s] [ 80%] 2025-12-04T13:35:25.9579967Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_fill_cuda_float32 PASSED [0.0292s] [ 80%] 2025-12-04T13:35:25.9580108Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logsumexp_cuda_float32 PASSED [0.9428s] [ 80%] 2025-12-04T13:35:25.9580247Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_mean_cuda_float32 PASSED [0.8473s] [ 80%] 2025-12-04T13:35:25.9580399Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_median_cuda_float32 PASSED [0.1416s] [ 80%] 2025-12-04T13:35:25.9580542Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_normalize_cuda_float32 PASSED [0.4786s] [ 80%] 2025-12-04T13:35:25.9580695Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_variadic_tensors_cuda_float32 PASSED [0.0781s] [ 80%] 2025-12-04T13:35:25.9580829Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_binary_cuda_float32 PASSED [0.0748s] [ 80%] 2025-12-04T13:35:25.9580955Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_neg_cuda_float32 PASSED [0.0039s] [ 80%] 2025-12-04T13:35:25.9581118Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool1d_cuda_float32 PASSED [0.0331s] [ 80%] 2025-12-04T13:35:25.9581281Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool3d_cuda_float32 PASSED [0.0246s] [ 80%] 2025-12-04T13:35:25.9581434Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool3d_cuda_float32 PASSED [0.0172s] [ 80%] 2025-12-04T13:35:25.9581760Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv2d_cuda_float32 MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x763b74e00e00 size: 1024 2025-12-04T13:35:25.9581942Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x763b74e00e00 size: 1024 2025-12-04T13:35:25.9582145Z MIOpen(HIP): Warning [IsEnoughWorkspace] [GetSolutionsFallback AI] Solver , workspace required: 2400, provided ptr: 0x763b74e01000 size: 1024 2025-12-04T13:35:25.9582337Z MIOpen(HIP): Warning [IsEnoughWorkspace] [EvaluateInvokers] Solver , workspace required: 2400, provided ptr: 0x763b74e01000 size: 1024 2025-12-04T13:35:25.9582380Z PASSED [0.1657s] [ 80%] 2025-12-04T13:35:25.9582548Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0963s] [ 80%] 2025-12-04T13:35:25.9582692Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_glu_cuda_float32 PASSED [0.2210s] [ 80%] 2025-12-04T13:35:25.9582844Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool1d_cuda_float32 PASSED [4.8704s] [ 80%] 2025-12-04T13:35:25.9583020Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [0.1171s] [ 80%] 2025-12-04T13:35:25.9583171Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_nll_loss_cuda_float32 PASSED [0.4707s] [ 80%] 2025-12-04T13:35:25.9583344Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_circular_cuda_float32 PASSED [0.1552s] [ 80%] 2025-12-04T13:35:25.9583516Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_poisson_nll_loss_cuda_float32 PASSED [1.2071s] [ 80%] 2025-12-04T13:35:25.9583664Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu6_cuda_float32 PASSED [0.0136s] [ 80%] 2025-12-04T13:35:25.9583810Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu_cuda_float32 PASSED [0.0150s] [ 80%] 2025-12-04T13:35:25.9583952Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_selu_cuda_float32 PASSED [0.0119s] [ 80%] 2025-12-04T13:35:25.9584109Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_soft_margin_loss_cuda_float32 PASSED [0.0309s] [ 80%] 2025-12-04T13:35:25.9584271Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_with_dtype_cuda_float32 PASSED [0.0431s] [ 80%] 2025-12-04T13:35:25.9584426Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softshrink_cuda_float32 PASSED [0.0126s] [ 80%] 2025-12-04T13:35:25.9584570Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_nuc_cuda_float32 PASSED [0.0263s] [ 80%] 2025-12-04T13:35:25.9584721Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pca_lowrank_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 80%] 2025-12-04T13:35:25.9584852Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pinverse_cuda_float32 PASSED [0.3317s] [ 80%] 2025-12-04T13:35:25.9585004Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0249s] [ 81%] 2025-12-04T13:35:25.9585154Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0244s] [ 81%] 2025-12-04T13:35:25.9585282Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pow_cuda_float32 PASSED [0.1008s] [ 81%] 2025-12-04T13:35:25.9585416Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_quantile_cuda_float32 PASSED [1.5720s] [ 81%] 2025-12-04T13:35:25.9585547Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sigmoid_cuda_float32 PASSED [0.0107s] [ 81%] 2025-12-04T13:35:25.9585683Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32 PASSED [0.0135s] [ 81%] 2025-12-04T13:35:25.9585820Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtr_cuda_float32 PASSED [0.0257s] [ 81%] 2025-12-04T13:35:25.9585957Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_ndtri_cuda_float32 PASSED [0.0149s] [ 81%] 2025-12-04T13:35:25.9586094Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_list_args_cuda_float32 PASSED [0.8869s] [ 81%] 2025-12-04T13:35:25.9586245Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_copy_cuda_float32 PASSED [0.0160s] [ 81%] 2025-12-04T13:35:25.9586372Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sqrt_cuda_float32 PASSED [0.0056s] [ 81%] 2025-12-04T13:35:25.9586501Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_square_cuda_float32 PASSED [0.8751s] [ 81%] 2025-12-04T13:35:25.9586635Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_unbiased_cuda_float32 PASSED [0.0184s] [ 81%] 2025-12-04T13:35:25.9586763Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_cuda_float32 PASSED [0.0257s] [ 81%] 2025-12-04T13:35:25.9586886Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_cuda_float32 PASSED [0.0567s] [ 81%] 2025-12-04T13:35:25.9587174Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__efficient_attention_forward_cuda_float32 SKIPPED [0.0006s] (Efficient attention on ROCM doesn't support custom_mask_type==2) [ 81%] 2025-12-04T13:35:25.9587325Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_transpose_copy_cuda_float32 PASSED [0.0151s] [ 81%] 2025-12-04T13:35:25.9587460Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trapezoid_cuda_float32 PASSED [0.1456s] [ 81%] 2025-12-04T13:35:25.9587595Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_true_divide_cuda_float32 PASSED [0.0503s] [ 81%] 2025-12-04T13:35:25.9587725Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unbind_cuda_float32 PASSED [0.0238s] [ 81%] 2025-12-04T13:35:25.9587856Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_cuda_float32 PASSED [0.0174s] [ 81%] 2025-12-04T13:35:25.9587983Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_cuda_float32 PASSED [0.0555s] [ 81%] 2025-12-04T13:35:25.9588141Z test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_zero__cuda_float32 PASSED [0.8984s] [ 81%] 2025-12-04T13:35:25.9588256Z test_ops.py::TestFakeTensorCUDA::test_fake_div_no_rounding_mode_cuda_float32 PASSED [0.0130s] [ 81%] 2025-12-04T13:35:25.9588368Z test_ops.py::TestFakeTensorCUDA::test_fake_erfc_cuda_float32 PASSED [0.0046s] [ 81%] 2025-12-04T13:35:25.9588465Z test_ops.py::TestFakeTensorCUDA::test_fake_exp2_cuda_float32 PASSED [0.0044s] [ 81%] 2025-12-04T13:35:25.9588571Z test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfft2_cuda_float32 PASSED [0.0139s] [ 81%] 2025-12-04T13:35:25.9588670Z test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft_cuda_float32 PASSED [0.0092s] [ 81%] 2025-12-04T13:35:25.9588780Z test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifftshift_cuda_float32 PASSED [0.0068s] [ 81%] 2025-12-04T13:35:25.9588880Z test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft_cuda_float32 PASSED [0.8901s] [ 81%] 2025-12-04T13:35:25.9588979Z test_ops.py::TestFakeTensorCUDA::test_fake_flatten_cuda_float32 PASSED [0.0086s] [ 82%] 2025-12-04T13:35:25.9589076Z test_ops.py::TestFakeTensorCUDA::test_fake_flip_cuda_float32 PASSED [0.0094s] [ 82%] 2025-12-04T13:35:25.9589175Z test_ops.py::TestFakeTensorCUDA::test_fake_flipud_cuda_float32 PASSED [0.0037s] [ 82%] 2025-12-04T13:35:25.9589281Z test_ops.py::TestFakeTensorCUDA::test_fake_float_power_cuda_float32 PASSED [0.0172s] [ 82%] 2025-12-04T13:35:25.9589375Z test_ops.py::TestFakeTensorCUDA::test_fake_fmin_cuda_float32 PASSED [0.0099s] [ 82%] 2025-12-04T13:35:25.9589469Z test_ops.py::TestFakeTensorCUDA::test_fake_frac_cuda_float32 PASSED [0.0032s] [ 82%] 2025-12-04T13:35:25.9589562Z test_ops.py::TestFakeTensorCUDA::test_fake_full_cuda_float32 PASSED [0.0043s] [ 82%] 2025-12-04T13:35:25.9589658Z test_ops.py::TestFakeTensorCUDA::test_fake_geqrf_cuda_float32 PASSED [0.0448s] [ 82%] 2025-12-04T13:35:25.9589763Z test_ops.py::TestFakeTensorCUDA::test_fake_hash_tensor_cuda_float32 PASSED [0.0150s] [ 82%] 2025-12-04T13:35:25.9589860Z test_ops.py::TestFakeTensorCUDA::test_fake_igammac_cuda_float32 PASSED [0.0098s] [ 82%] 2025-12-04T13:35:25.9589965Z test_ops.py::TestFakeTensorCUDA::test_fake_index_copy_cuda_float32 PASSED [0.0060s] [ 82%] 2025-12-04T13:35:25.9590067Z test_ops.py::TestFakeTensorCUDA::test_fake_index_fill_cuda_float32 PASSED [0.0075s] [ 82%] 2025-12-04T13:35:25.9590182Z test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_prod_cuda_float32 PASSED [0.0091s] [ 82%] 2025-12-04T13:35:25.9590279Z test_ops.py::TestFakeTensorCUDA::test_fake_inner_cuda_float32 PASSED [0.0048s] [ 82%] 2025-12-04T13:35:25.9590420Z test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_unary_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 82%] 2025-12-04T13:35:25.9590513Z test_ops.py::TestFakeTensorCUDA::test_fake_lcm_cuda_int64 PASSED [0.0146s] [ 82%] 2025-12-04T13:35:25.9590611Z test_ops.py::TestFakeTensorCUDA::test_fake_ldexp_cuda_float32 PASSED [0.0125s] [ 82%] 2025-12-04T13:35:25.9590705Z test_ops.py::TestFakeTensorCUDA::test_fake_le_cuda_float32 PASSED [0.0098s] [ 82%] 2025-12-04T13:35:25.9590838Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eig_cuda_float32 PASSED [0.0446s] [ 82%] 2025-12-04T13:35:25.9590953Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_cuda_float32 PASSED [0.0123s] [ 82%] 2025-12-04T13:35:25.9591061Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_ex_cuda_float32 PASSED [0.0107s] [ 82%] 2025-12-04T13:35:25.9591176Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32 PASSED [0.0061s] [ 82%] 2025-12-04T13:35:25.9591370Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_solve_cuda_float32 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 82%] 2025-12-04T13:35:25.9591476Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_cuda_float32 PASSED [0.1270s] [ 82%] 2025-12-04T13:35:25.9591586Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_cuda_float32 PASSED [0.0591s] [ 82%] 2025-12-04T13:35:25.9591690Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_cuda_float32 PASSED [0.0353s] [ 82%] 2025-12-04T13:35:25.9591824Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 82%] 2025-12-04T13:35:25.9591935Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_ex_cuda_float32 PASSED [0.8921s] [ 82%] 2025-12-04T13:35:25.9592052Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svdvals_cuda_float32 PASSED [0.0356s] [ 82%] 2025-12-04T13:35:25.9592164Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorinv_cuda_float32 PASSED [0.0105s] [ 82%] 2025-12-04T13:35:25.9592270Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vecdot_cuda_float32 PASSED [0.0464s] [ 83%] 2025-12-04T13:35:25.9592384Z test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vector_norm_cuda_float32 PASSED [0.1404s] [ 83%] 2025-12-04T13:35:25.9592483Z test_ops.py::TestFakeTensorCUDA::test_fake_linspace_cuda_float32 PASSED [0.0321s] [ 83%] 2025-12-04T13:35:25.9592606Z test_ops.py::TestFakeTensorCUDA::test_fake_linspace_tensor_overload_cuda_float32 PASSED [0.1342s] [ 83%] 2025-12-04T13:35:25.9592709Z test_ops.py::TestFakeTensorCUDA::test_fake_log_normal_cuda_float32 PASSED [0.0062s] [ 83%] 2025-12-04T13:35:25.9592816Z test_ops.py::TestFakeTensorCUDA::test_fake_logcumsumexp_cuda_float32 PASSED [0.0063s] [ 83%] 2025-12-04T13:35:25.9592921Z test_ops.py::TestFakeTensorCUDA::test_fake_logical_and_cuda_float32 PASSED [0.0113s] [ 83%] 2025-12-04T13:35:25.9593026Z test_ops.py::TestFakeTensorCUDA::test_fake_logical_not_cuda_float32 PASSED [0.0045s] [ 83%] 2025-12-04T13:35:25.9593122Z test_ops.py::TestFakeTensorCUDA::test_fake_logit_cuda_float32 PASSED [0.0067s] [ 83%] 2025-12-04T13:35:25.9593222Z test_ops.py::TestFakeTensorCUDA::test_fake_logspace_cuda_float32 PASSED [0.1943s] [ 83%] 2025-12-04T13:35:25.9593316Z test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32 PASSED [0.0101s] [ 83%] 2025-12-04T13:35:25.9593419Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_amin_cuda_float32 PASSED [0.1240s] [ 83%] 2025-12-04T13:35:25.9593527Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumprod_cuda_float32 PASSED [0.0318s] [ 83%] 2025-12-04T13:35:25.9593634Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumsum_cuda_float32 PASSED [0.0317s] [ 83%] 2025-12-04T13:35:25.9593743Z test_ops.py::TestFakeTensorCUDA::test_fake_masked_median_cuda_float32 PASSED [0.0241s] [ 83%] 2025-12-04T13:35:25.9593844Z test_ops.py::TestFakeTensorCUDA::test_fake_matrix_exp_cuda_float32 PASSED [0.0052s] [ 83%] 2025-12-04T13:35:25.9593942Z test_ops.py::TestFakeTensorCUDA::test_fake_maximum_cuda_float32 PASSED [0.0097s] [ 83%] 2025-12-04T13:35:25.9594064Z test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32 PASSED [0.0274s] [ 83%] 2025-12-04T13:35:25.9594161Z test_ops.py::TestFakeTensorCUDA::test_fake_mode_cuda_float32 PASSED [0.0078s] [ 83%] 2025-12-04T13:35:25.9594254Z test_ops.py::TestFakeTensorCUDA::test_fake_mv_cuda_float32 PASSED [0.0038s] [ 83%] 2025-12-04T13:35:25.9594378Z test_ops.py::TestFakeTensorCUDA::test_fake_narrow_cuda_float32 SKIPPED [0.0009s] (Skip failing test) [ 83%] 2025-12-04T13:35:25.9594524Z test_ops.py::TestFakeTensorCUDA::test_fake_native_batch_norm_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 83%] 2025-12-04T13:35:25.9594656Z test_ops.py::TestFakeTensorCUDA::test_fake_native_dropout_backward_cuda_float32 PASSED [0.0154s] [ 83%] 2025-12-04T13:35:25.9594768Z test_ops.py::TestFakeTensorCUDA::test_fake_new_empty_strided_cuda_float32 PASSED [0.0077s] [ 83%] 2025-12-04T13:35:25.9594902Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool1d_cuda_float32 PASSED [0.0136s] [ 83%] 2025-12-04T13:35:25.9595034Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.0173s] [ 83%] 2025-12-04T13:35:25.9595161Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_alpha_dropout_cuda_float32 PASSED [0.0208s] [ 83%] 2025-12-04T13:35:25.9595289Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_channel_shuffle_cuda_float32 PASSED [0.0043s] [ 83%] 2025-12-04T13:35:25.9595403Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv1d_cuda_float32 PASSED [0.0238s] [ 83%] 2025-12-04T13:35:25.9595540Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool2d_cuda_float32 PASSED [0.0473s] [ 84%] 2025-12-04T13:35:25.9595698Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest-exact_cuda_float32 PASSED [0.0172s] [ 84%] 2025-12-04T13:35:25.9595833Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0175s] [ 84%] 2025-12-04T13:35:25.9595946Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_linear_cuda_float32 PASSED [0.0416s] [ 84%] 2025-12-04T13:35:25.9596065Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool2d_cuda_float32 PASSED [1.1237s] [ 84%] 2025-12-04T13:35:25.9596193Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool1d_grad_cuda_float32 PASSED [0.0944s] [ 84%] 2025-12-04T13:35:25.9596338Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_head_attention_forward_cuda_float32 PASSED [2.8172s] [ 84%] 2025-12-04T13:35:25.9596467Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multi_margin_loss_cuda_float32 PASSED [0.0469s] [ 84%] 2025-12-04T13:35:25.9596606Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_margin_loss_cuda_float32 PASSED [0.0482s] [ 84%] 2025-12-04T13:35:25.9596752Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_nll_loss_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 84%] 2025-12-04T13:35:25.9596876Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_circular_cuda_float32 PASSED [0.0282s] [ 84%] 2025-12-04T13:35:25.9596989Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pdist_cuda_float32 PASSED [0.0080s] [ 84%] 2025-12-04T13:35:25.9597110Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_threshold_cuda_float32 PASSED [0.0059s] [ 84%] 2025-12-04T13:35:25.9597244Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_loss_cuda_float32 PASSED [0.0276s] [ 84%] 2025-12-04T13:35:25.9597400Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_triplet_margin_with_distance_loss_cuda_float32 PASSED [0.0277s] [ 84%] 2025-12-04T13:35:25.9597515Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_unfold_cuda_float32 PASSED [0.4811s] [ 84%] 2025-12-04T13:35:25.9597645Z test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_bilinear_cuda_float32 PASSED [0.0483s] [ 84%] 2025-12-04T13:35:25.9597745Z test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_cuda_float32 PASSED [0.1020s] [ 84%] 2025-12-04T13:35:25.9597847Z test_ops.py::TestFakeTensorCUDA::test_fake_norm_inf_cuda_float32 PASSED [0.0053s] [ 84%] 2025-12-04T13:35:25.9597967Z test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_0_cuda_float32 PASSED [0.0089s] [ 84%] 2025-12-04T13:35:25.9598098Z test_ops.py::TestFakeTensorCUDA::test_fake_positive_cuda_float32 PASSED [0.0025s] [ 84%] 2025-12-04T13:35:25.9598195Z test_ops.py::TestFakeTensorCUDA::test_fake_pow_cuda_float32 PASSED [0.0099s] [ 84%] 2025-12-04T13:35:25.9598318Z test_ops.py::TestFakeTensorCUDA::test_fake_rad2deg_cuda_float32 PASSED [0.0030s] [ 84%] 2025-12-04T13:35:25.9598428Z test_ops.py::TestFakeTensorCUDA::test_fake_randn_cuda_float32 PASSED [0.0039s] [ 84%] 2025-12-04T13:35:25.9598530Z test_ops.py::TestFakeTensorCUDA::test_fake_reciprocal_cuda_float32 PASSED [0.0044s] [ 84%] 2025-12-04T13:35:25.9598635Z test_ops.py::TestFakeTensorCUDA::test_fake_resolve_conj_cuda_float32 PASSED [0.8835s] [ 84%] 2025-12-04T13:35:25.9598731Z test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32 PASSED [0.0175s] [ 84%] 2025-12-04T13:35:25.9598830Z test_ops.py::TestFakeTensorCUDA::test_fake_rsub_cuda_float32 PASSED [0.0122s] [ 84%] 2025-12-04T13:35:25.9598945Z test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amax_cuda_float32 PASSED [0.0222s] [ 84%] 2025-12-04T13:35:25.9599060Z test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_amin_cuda_float32 PASSED [0.0217s] [ 84%] 2025-12-04T13:35:25.9599165Z test_ops.py::TestFakeTensorCUDA::test_fake_searchsorted_cuda_float32 PASSED [0.2623s] [ 85%] 2025-12-04T13:35:25.9599273Z test_ops.py::TestFakeTensorCUDA::test_fake_select_scatter_cuda_float32 PASSED [0.0064s] [ 85%] 2025-12-04T13:35:25.9599382Z test_ops.py::TestFakeTensorCUDA::test_fake_sgn_cuda_float32 PASSED [0.0030s] [ 85%] 2025-12-04T13:35:25.9599513Z test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_cosine_cuda_float32 PASSED [0.0246s] [ 85%] 2025-12-04T13:35:25.9599642Z test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_hamming_cuda_float32 PASSED [0.0241s] [ 85%] 2025-12-04T13:35:25.9599757Z test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_hann_cuda_float32 PASSED [0.0242s] [ 85%] 2025-12-04T13:35:25.9599856Z test_ops.py::TestFakeTensorCUDA::test_fake_softmax_cuda_float32 PASSED [0.0069s] [ 85%] 2025-12-04T13:35:25.9599969Z test_ops.py::TestFakeTensorCUDA::test_fake_softmax_with_dtype_cuda_float32 PASSED [0.0100s] [ 85%] 2025-12-04T13:35:25.9600080Z test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j1_cuda_float32 PASSED [0.0043s] [ 85%] 2025-12-04T13:35:25.9600193Z test_ops.py::TestFakeTensorCUDA::test_fake_special_log_ndtr_cuda_float32 PASSED [0.0058s] [ 85%] 2025-12-04T13:35:25.9600318Z test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k0_cuda_float32 PASSED [0.0054s] [ 85%] 2025-12-04T13:35:25.9600443Z test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k1_cuda_float32 PASSED [0.0042s] [ 85%] 2025-12-04T13:35:25.9600547Z test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtr_cuda_float32 PASSED [0.0064s] [ 85%] 2025-12-04T13:35:25.9600673Z test_ops.py::TestFakeTensorCUDA::test_fake_special_spherical_bessel_j0_cuda_float32 PASSED [0.0043s] [ 85%] 2025-12-04T13:35:25.9600777Z test_ops.py::TestFakeTensorCUDA::test_fake_std_unbiased_cuda_float32 PASSED [0.0037s] [ 85%] 2025-12-04T13:35:25.9600873Z test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32 PASSED [0.0219s] [ 85%] 2025-12-04T13:35:25.9600969Z test_ops.py::TestFakeTensorCUDA::test_fake_sum_cuda_float32 PASSED [0.0160s] [ 85%] 2025-12-04T13:35:25.9601185Z test_ops.py::TestFakeTensorCUDA::test_fake_take_along_dim_cuda_float32 PASSED [0.0104s] [ 85%] 2025-12-04T13:35:25.9601282Z test_ops.py::TestFakeTensorCUDA::test_fake_tan_cuda_float32 PASSED [0.0029s] [ 85%] 2025-12-04T13:35:25.9601384Z test_ops.py::TestFakeTensorCUDA::test_fake_tensordot_cuda_float32 PASSED [0.0104s] [ 85%] 2025-12-04T13:35:25.9601479Z test_ops.py::TestFakeTensorCUDA::test_fake_tile_cuda_float32 PASSED [0.0375s] [ 85%] 2025-12-04T13:35:25.9601580Z test_ops.py::TestFakeTensorCUDA::test_fake_transpose_cuda_float32 PASSED [0.0074s] [ 85%] 2025-12-04T13:35:25.9601693Z test_ops.py::TestFakeTensorCUDA::test_fake_triangular_solve_cuda_float32 PASSED [0.0232s] [ 85%] 2025-12-04T13:35:25.9601804Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rdiv___cuda_float32 PASSED [0.0165s] [ 85%] 2025-12-04T13:35:25.9601914Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32 PASSED [0.0117s] [ 85%] 2025-12-04T13:35:25.9602056Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rsub___cuda_float32 PASSED [0.0111s] [ 85%] 2025-12-04T13:35:25.9602183Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acos_cuda_float32 PASSED [0.8914s] [ 85%] 2025-12-04T13:35:25.9602298Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_alias_copy_cuda_float32 PASSED [0.0055s] [ 85%] 2025-12-04T13:35:25.9602403Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_all_cuda_float32 PASSED [0.8880s] [ 85%] 2025-12-04T13:35:25.9602511Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_angle_cuda_float32 PASSED [0.0047s] [ 85%] 2025-12-04T13:35:25.9602623Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argmin_cuda_float32 PASSED [0.0118s] [ 86%] 2025-12-04T13:35:25.9602730Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atan_cuda_float32 PASSED [0.8767s] [ 86%] 2025-12-04T13:35:25.9602844Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_1d_cuda_float32 PASSED [0.0078s] [ 86%] 2025-12-04T13:35:25.9602957Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_3d_cuda_float32 PASSED [0.0100s] [ 86%] 2025-12-04T13:35:25.9603074Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bernoulli_cuda_float32 PASSED [0.0062s] [ 86%] 2025-12-04T13:35:25.9603198Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_block_diag_cuda_float32 PASSED [0.0079s] [ 86%] 2025-12-04T13:35:25.9603321Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_shapes_cuda_float32 PASSED [0.0042s] [ 86%] 2025-12-04T13:35:25.9603427Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_byte_cuda_float32 PASSED [0.0088s] [ 86%] 2025-12-04T13:35:25.9603546Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cartesian_prod_cuda_float32 PASSED [0.0159s] [ 86%] 2025-12-04T13:35:25.9603659Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cauchy_cuda_float32 PASSED [0.0072s] [ 86%] 2025-12-04T13:35:25.9603767Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdist_cuda_float32 PASSED [0.3097s] [ 86%] 2025-12-04T13:35:25.9603875Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_char_cuda_float32 PASSED [0.0081s] [ 86%] 2025-12-04T13:35:25.9603995Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_solve_cuda_float32 PASSED [0.8921s] [ 86%] 2025-12-04T13:35:25.9604105Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clamp_cuda_float32 PASSED [0.0167s] [ 86%] 2025-12-04T13:35:25.9604222Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_conj_physical_cuda_float32 PASSED [0.8691s] [ 86%] 2025-12-04T13:35:25.9604337Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_contiguous_cuda_float32 PASSED [0.0039s] [ 86%] 2025-12-04T13:35:25.9604448Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumprod_cuda_float32 PASSED [0.0144s] [ 86%] 2025-12-04T13:35:25.9604556Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dist_cuda_float32 PASSED [0.0898s] [ 86%] 2025-12-04T13:35:25.9604679Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_trunc_rounding_cuda_float32 PASSED [0.0134s] [ 86%] 2025-12-04T13:35:25.9604789Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_double_cuda_float32 PASSED [0.0081s] [ 86%] 2025-12-04T13:35:25.9604899Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dstack_cuda_float32 PASSED [0.0076s] [ 86%] 2025-12-04T13:35:25.9605011Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_einsum_cuda_float32 PASSED [0.0510s] [ 86%] 2025-12-04T13:35:25.9605130Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_permuted_cuda_float32 PASSED [0.0173s] [ 86%] 2025-12-04T13:35:25.9605246Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_strided_cuda_float32 PASSED [0.0047s] [ 86%] 2025-12-04T13:35:25.9605352Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfc_cuda_float32 PASSED [0.0047s] [ 86%] 2025-12-04T13:35:25.9605468Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_copy_cuda_float32 PASSED [0.0087s] [ 86%] 2025-12-04T13:35:25.9605579Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft_cuda_float32 PASSED [0.0082s] [ 86%] 2025-12-04T13:35:25.9605694Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfftn_cuda_float32 PASSED [0.0098s] [ 86%] 2025-12-04T13:35:25.9605836Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flatten_cuda_float32 PASSED [0.8749s] [ 86%] 2025-12-04T13:35:25.9605954Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flip_cuda_float32 PASSED [0.0117s] [ 86%] 2025-12-04T13:35:25.9606062Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_flipud_cuda_float32 PASSED [0.0040s] [ 87%] 2025-12-04T13:35:25.9606178Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_power_cuda_float32 PASSED [0.0241s] [ 87%] 2025-12-04T13:35:25.9606285Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fmax_cuda_float32 PASSED [0.0118s] [ 87%] 2025-12-04T13:35:25.9606391Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_cuda_float32 PASSED [0.0038s] [ 87%] 2025-12-04T13:35:25.9606505Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geometric_cuda_float32 PASSED [0.8664s] [ 87%] 2025-12-04T13:35:25.9606609Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gt_cuda_float32 PASSED [0.0136s] [ 87%] 2025-12-04T13:35:25.9606720Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hstack_cuda_float32 PASSED [0.0047s] [ 87%] 2025-12-04T13:35:25.9606843Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hypot_cuda_float32 PASSED [0.0118s] [ 87%] 2025-12-04T13:35:25.9606958Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_copy_cuda_float32 PASSED [0.0062s] [ 87%] 2025-12-04T13:35:25.9607082Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amax_cuda_float32 PASSED [0.0096s] [ 87%] 2025-12-04T13:35:25.9607199Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_select_cuda_float32 PASSED [0.0047s] [ 87%] 2025-12-04T13:35:25.9607308Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_inner_cuda_float32 PASSED [0.0059s] [ 87%] 2025-12-04T13:35:25.9607414Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_int_cuda_float32 PASSED [0.0088s] [ 87%] 2025-12-04T13:35:25.9607526Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isneginf_cuda_float32 PASSED [0.8784s] [ 87%] 2025-12-04T13:35:25.9607678Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_cuda_float32 SKIPPED [0.0015s] (Skip failing test) [ 87%] 2025-12-04T13:35:25.9607806Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_ex_cuda_float32 PASSED [0.0184s] [ 87%] 2025-12-04T13:35:25.9607958Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvals_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 87%] 2025-12-04T13:35:25.9608116Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32 PASSED [0.0075s] [ 87%] 2025-12-04T13:35:25.9608323Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_solve_cuda_float32 SKIPPED [0.0006s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 87%] 2025-12-04T13:35:25.9608460Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lstsq_grad_oriented_cuda_float32 PASSED [0.8768s] [ 87%] 2025-12-04T13:35:25.9608576Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_svd_cuda_float32 PASSED [0.2509s] [ 87%] 2025-12-04T13:35:25.9608691Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_cuda_float32 PASSED [0.0278s] [ 87%] 2025-12-04T13:35:25.9608826Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linspace_tensor_overload_cuda_float32 PASSED [0.1377s] [ 87%] 2025-12-04T13:35:25.9608935Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log2_cuda_float32 PASSED [0.0049s] [ 87%] 2025-12-04T13:35:25.9609065Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32 PASSED [0.0274s] [ 87%] 2025-12-04T13:35:25.9609180Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logaddexp2_cuda_float32 PASSED [0.0049s] [ 87%] 2025-12-04T13:35:25.9609294Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_or_cuda_float32 PASSED [0.0141s] [ 87%] 2025-12-04T13:35:25.9609411Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_cumsum_cuda_float32 PASSED [0.0215s] [ 87%] 2025-12-04T13:35:25.9609561Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_log_softmax_cuda_float32 PASSED [0.9436s] [ 87%] 2025-12-04T13:35:25.9609696Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logsumexp_cuda_float32 PASSED [0.1146s] [ 88%] 2025-12-04T13:35:25.9609812Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_mean_cuda_float32 PASSED [0.1135s] [ 88%] 2025-12-04T13:35:25.9609944Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_list_of_tensors_cuda_float32 PASSED [0.0410s] [ 88%] 2025-12-04T13:35:25.9610056Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_binary_cuda_float32 PASSED [0.0127s] [ 88%] 2025-12-04T13:35:25.9610184Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_no_dim_cuda_float32 PASSED [0.9177s] [ 88%] 2025-12-04T13:35:25.9610289Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mv_cuda_float32 PASSED [0.0058s] [ 88%] 2025-12-04T13:35:25.9610402Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nan_to_num_cuda_float32 PASSED [0.8868s] [ 88%] 2025-12-04T13:35:25.9610518Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_copy_cuda_float32 PASSED [0.0073s] [ 88%] 2025-12-04T13:35:25.9610655Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_batch_norm_cuda_float32 PASSED [0.0712s] [ 88%] 2025-12-04T13:35:25.9610786Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_dropout_backward_cuda_float32 PASSED [0.0225s] [ 88%] 2025-12-04T13:35:25.9610893Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_neg_cuda_float32 PASSED [0.0031s] [ 88%] 2025-12-04T13:35:25.9611008Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_ones_cuda_float32 PASSED [0.0082s] [ 88%] 2025-12-04T13:35:25.9611153Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool2d_cuda_float32 PASSED [0.0093s] [ 88%] 2025-12-04T13:35:25.9611299Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool3d_cuda_float32 PASSED [0.9023s] [ 88%] 2025-12-04T13:35:25.9611446Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_cuda_float32 PASSED [0.0650s] [ 88%] 2025-12-04T13:35:25.9611609Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_with_logits_cuda_float32 PASSED [0.1511s] [ 88%] 2025-12-04T13:35:25.9611735Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_celu_cuda_float32 PASSED [0.0061s] [ 88%] 2025-12-04T13:35:25.9611863Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv3d_cuda_float32 PASSED [0.0341s] [ 88%] 2025-12-04T13:35:25.9612005Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose1d_cuda_float32 PASSED [0.0318s] [ 88%] 2025-12-04T13:35:25.9612138Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout2d_cuda_float32 PASSED [0.0163s] [ 88%] 2025-12-04T13:35:25.9612269Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout3d_cuda_float32 PASSED [0.0207s] [ 88%] 2025-12-04T13:35:25.9612438Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_bag_cuda_float32 SKIPPED [0.0011s] (Skip failing test) [ 88%] 2025-12-04T13:35:25.9612561Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_gelu_cuda_float32 PASSED [0.0149s] [ 88%] 2025-12-04T13:35:25.9612696Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_group_norm_cuda_float32 PASSED [0.0500s] [ 88%] 2025-12-04T13:35:25.9612824Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardtanh_cuda_float32 PASSED [0.0108s] [ 88%] 2025-12-04T13:35:25.9612968Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bicubic_cuda_float32 PASSED [1.6304s] [ 88%] 2025-12-04T13:35:25.9613115Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bilinear_cuda_float32 PASSED [0.4184s] [ 88%] 2025-12-04T13:35:25.9613259Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_linear_cuda_float32 PASSED [0.1538s] [ 88%] 2025-12-04T13:35:25.9613425Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_nearest_cuda_float32 PASSED [0.0187s] [ 88%] 2025-12-04T13:35:25.9613584Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_trilinear_cuda_float32 PASSED [0.7379s] [ 88%] 2025-12-04T13:35:25.9613714Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_l1_loss_cuda_float32 PASSED [0.0141s] [ 89%] 2025-12-04T13:35:25.9613842Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mse_loss_cuda_float32 PASSED [0.0163s] [ 89%] 2025-12-04T13:35:25.9614000Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multilabel_soft_margin_loss_cuda_float32 PASSED [0.0434s] [ 89%] 2025-12-04T13:35:25.9614141Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_unshuffle_cuda_float32 PASSED [0.0060s] [ 89%] 2025-12-04T13:35:25.9614268Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_prelu_cuda_float32 PASSED [0.0631s] [ 89%] 2025-12-04T13:35:25.9614394Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu6_cuda_float32 PASSED [0.0070s] [ 89%] 2025-12-04T13:35:25.9614554Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32 PASSED [0.1453s] [ 89%] 2025-12-04T13:35:25.9614695Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_tanhshrink_cuda_float32 PASSED [0.0064s] [ 89%] 2025-12-04T13:35:25.9614826Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_threshold_cuda_float32 PASSED [0.0069s] [ 89%] 2025-12-04T13:35:25.9614938Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_cuda_float32 PASSED [0.0124s] [ 89%] 2025-12-04T13:35:25.9615048Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_cuda_float32 PASSED [0.0030s] [ 89%] 2025-12-04T13:35:25.9615160Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_permute_cuda_float32 PASSED [0.0052s] [ 89%] 2025-12-04T13:35:25.9615293Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_3_cuda_float32 PASSED [0.0098s] [ 89%] 2025-12-04T13:35:25.9615426Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_4_cuda_float32 PASSED [0.0098s] [ 89%] 2025-12-04T13:35:25.9615535Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pow_cuda_float32 PASSED [0.0117s] [ 89%] 2025-12-04T13:35:25.9615642Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_put_cuda_float32 PASSED [0.0259s] [ 89%] 2025-12-04T13:35:25.9615753Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randint_cuda_float32 PASSED [0.0129s] [ 89%] 2025-12-04T13:35:25.9615870Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resolve_neg_cuda_float32 PASSED [0.0027s] [ 89%] 2025-12-04T13:35:25.9615977Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_roll_cuda_float32 PASSED [0.0178s] [ 89%] 2025-12-04T13:35:25.9616101Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_3_cuda_float32 PASSED [0.0045s] [ 89%] 2025-12-04T13:35:25.9616210Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rsqrt_cuda_float32 PASSED [0.0045s] [ 89%] 2025-12-04T13:35:25.9616345Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_mean_cuda_float32 PASSED [0.0227s] [ 89%] 2025-12-04T13:35:25.9616475Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_prod_cuda_float32 PASSED [0.9164s] [ 89%] 2025-12-04T13:35:25.9616602Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_sum_cuda_float32 PASSED [0.0252s] [ 89%] 2025-12-04T13:35:25.9616711Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sign_cuda_float32 PASSED [0.0034s] [ 89%] 2025-12-04T13:35:25.9616843Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_bartlett_cuda_float32 PASSED [0.0120s] [ 89%] 2025-12-04T13:35:25.9616971Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_cosine_cuda_float32 PASSED [0.0099s] [ 89%] 2025-12-04T13:35:25.9617114Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_general_hamming_cuda_float32 PASSED [0.0244s] [ 89%] 2025-12-04T13:35:25.9617264Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hamming_cuda_float32 PASSED [0.0239s] [ 89%] 2025-12-04T13:35:25.9617402Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32 PASSED [0.0242s] [ 89%] 2025-12-04T13:35:25.9617545Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_u_cuda_float32 PASSED [0.0111s] [ 90%] 2025-12-04T13:35:25.9617688Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_w_cuda_float32 PASSED [0.0111s] [ 90%] 2025-12-04T13:35:25.9617806Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_entr_cuda_float32 PASSED [0.0054s] [ 90%] 2025-12-04T13:35:25.9617947Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_he_cuda_float32 PASSED [0.0112s] [ 90%] 2025-12-04T13:35:25.9618102Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_log_ndtr_cuda_float32 PASSED [0.0075s] [ 90%] 2025-12-04T13:35:25.9618240Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k1_cuda_float32 PASSED [0.8943s] [ 90%] 2025-12-04T13:35:25.9618359Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtr_cuda_float32 PASSED [0.0107s] [ 90%] 2025-12-04T13:35:25.9618501Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtri_cuda_float32 PASSED [0.9030s] [ 90%] 2025-12-04T13:35:25.9618647Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k1_cuda_float32 PASSED [0.0064s] [ 90%] 2025-12-04T13:35:25.9618803Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_v_cuda_float32 PASSED [0.0114s] [ 90%] 2025-12-04T13:35:25.9618925Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_list_args_cuda_float32 PASSED [0.0046s] [ 90%] 2025-12-04T13:35:25.9619053Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_copy_cuda_float32 PASSED [0.0050s] [ 90%] 2025-12-04T13:35:25.9619170Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_copy_cuda_float32 PASSED [0.0080s] [ 90%] 2025-12-04T13:35:25.9619293Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_multiple_cuda_float32 PASSED [0.0067s] [ 90%] 2025-12-04T13:35:25.9619405Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stack_cuda_float32 PASSED [0.0112s] [ 90%] 2025-12-04T13:35:25.9619528Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_unbiased_cuda_float32 PASSED [0.0045s] [ 90%] 2025-12-04T13:35:25.9619637Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stft_cuda_float32 PASSED [0.0319s] [ 90%] 2025-12-04T13:35:25.9619743Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sub_cuda_float32 PASSED [0.0141s] [ 90%] 2025-12-04T13:35:25.9619849Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_cuda_float32 PASSED [0.0360s] [ 90%] 2025-12-04T13:35:25.9619990Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_sparse_cuda_float32 SKIPPED [0.0010s] (Skip failing test) [ 90%] 2025-12-04T13:35:25.9620098Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_topk_cuda_float32 PASSED [0.0135s] [ 90%] 2025-12-04T13:35:25.9620208Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32 PASSED [0.0030s] [ 90%] 2025-12-04T13:35:25.9620325Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_indices_cuda_int64 PASSED [0.0113s] [ 90%] 2025-12-04T13:35:25.9620437Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unbind_cuda_float32 PASSED [0.8983s] [ 90%] 2025-12-04T13:35:25.9620546Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_cuda_float32 PASSED [0.2276s] [ 90%] 2025-12-04T13:35:25.9620662Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unravel_index_cuda_int64 PASSED [0.0163s] [ 90%] 2025-12-04T13:35:25.9620779Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_chunk_cuda_float32 PASSED [0.8839s] [ 90%] 2025-12-04T13:35:25.9620901Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_copy_cuda_float32 PASSED [0.0106s] [ 90%] 2025-12-04T13:35:25.9621040Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_cuda_float32 PASSED [0.0089s] [ 90%] 2025-12-04T13:35:25.9621162Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32 PASSED [0.8730s] [ 90%] 2025-12-04T13:35:25.9621282Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_as_complex_cuda_float32 PASSED [0.0044s] [ 91%] 2025-12-04T13:35:25.9621397Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_copy_cuda_float32 PASSED [0.0131s] [ 91%] 2025-12-04T13:35:25.9621505Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_where_cuda_float32 PASSED [0.9076s] [ 91%] 2025-12-04T13:35:25.9621620Z test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_like_cuda_float32 PASSED [0.0099s] [ 91%] 2025-12-04T13:35:25.9621737Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float32 PASSED [0.0073s] [ 91%] 2025-12-04T13:35:25.9621852Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int16 PASSED [0.0062s] [ 91%] 2025-12-04T13:35:25.9621978Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex128 PASSED [0.0116s] [ 91%] 2025-12-04T13:35:25.9622095Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int64 PASSED [0.0115s] [ 91%] 2025-12-04T13:35:25.9622254Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex128 PASSED [0.0403s] [ 91%] 2025-12-04T13:35:25.9622396Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float16 PASSED [0.0406s] [ 91%] 2025-12-04T13:35:25.9622536Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float64 PASSED [0.0403s] [ 91%] 2025-12-04T13:35:25.9622676Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int64 PASSED [0.0397s] [ 91%] 2025-12-04T13:35:25.9622816Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_uint8 PASSED [0.0247s] [ 91%] 2025-12-04T13:35:25.9622940Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_bfloat16 PASSED [0.0609s] [ 91%] 2025-12-04T13:35:25.9623065Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_complex64 PASSED [0.0606s] [ 91%] 2025-12-04T13:35:25.9623186Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float64 PASSED [0.0577s] [ 91%] 2025-12-04T13:35:25.9623303Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int64 PASSED [0.0567s] [ 91%] 2025-12-04T13:35:25.9623446Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_bfloat16 PASSED [0.2577s] [ 91%] 2025-12-04T13:35:25.9623561Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16 PASSED [0.8903s] [ 91%] 2025-12-04T13:35:25.9623674Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float32 PASSED [0.0033s] [ 91%] 2025-12-04T13:35:25.9623787Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool PASSED [0.8764s] [ 91%] 2025-12-04T13:35:25.9623908Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex128 PASSED [0.0031s] [ 91%] 2025-12-04T13:35:25.9624028Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_complex32 PASSED [0.8843s] [ 91%] 2025-12-04T13:35:25.9624142Z test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_uint8 PASSED [0.0036s] [ 91%] 2025-12-04T13:35:25.9624252Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float16 PASSED [0.0067s] [ 91%] 2025-12-04T13:35:25.9624364Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex128 PASSED [0.8835s] [ 91%] 2025-12-04T13:35:25.9624474Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_float32 PASSED [0.0036s] [ 91%] 2025-12-04T13:35:25.9624587Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float16 PASSED [0.0101s] [ 91%] 2025-12-04T13:35:25.9624699Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int64 PASSED [0.0099s] [ 91%] 2025-12-04T13:35:25.9624855Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex64 PASSED [0.0353s] [ 92%] 2025-12-04T13:35:25.9625001Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float32 PASSED [0.0355s] [ 92%] 2025-12-04T13:35:25.9625134Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int32 PASSED [0.0360s] [ 92%] 2025-12-04T13:35:25.9625264Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int64 PASSED [0.0354s] [ 92%] 2025-12-04T13:35:25.9625378Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float32 PASSED [0.0514s] [ 92%] 2025-12-04T13:35:25.9625487Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int16 PASSED [0.0481s] [ 92%] 2025-12-04T13:35:25.9625596Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int64 PASSED [0.0477s] [ 92%] 2025-12-04T13:35:25.9625705Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_uint8 PASSED [0.0169s] [ 92%] 2025-12-04T13:35:25.9625843Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_bfloat16 PASSED [0.2268s] [ 92%] 2025-12-04T13:35:25.9625988Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float64 PASSED [0.2286s] [ 92%] 2025-12-04T13:35:25.9628882Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_uint8 PASSED [0.0689s] [ 92%] 2025-12-04T13:35:25.9629009Z test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex32 PASSED [0.8817s] [ 92%] 2025-12-04T13:35:25.9629124Z test_ops.py::TestTagsCUDA::test_tags___rdiv___cuda_float32 SKIPPED [0.0014s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629233Z test_ops.py::TestTagsCUDA::test_tags___rsub___cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629369Z test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bool_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629507Z test_ops.py::TestTagsCUDA::test_tags__refs__conversions_short_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629633Z test_ops.py::TestTagsCUDA::test_tags__refs_allclose_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629750Z test_ops.py::TestTagsCUDA::test_tags__refs_asinh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629865Z test_ops.py::TestTagsCUDA::test_tags__refs_atan_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9629996Z test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_left_shift_cuda_int64 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630129Z test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_shapes_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630245Z test_ops.py::TestTagsCUDA::test_tags__refs_chunk_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630365Z test_ops.py::TestTagsCUDA::test_tags__refs_clamp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630487Z test_ops.py::TestTagsCUDA::test_tags__refs_clamp_max_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630609Z test_ops.py::TestTagsCUDA::test_tags__refs_clamp_min_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630724Z test_ops.py::TestTagsCUDA::test_tags__refs_clone_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630851Z test_ops.py::TestTagsCUDA::test_tags__refs_conj_physical_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9630977Z test_ops.py::TestTagsCUDA::test_tags__refs_contiguous_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9631097Z test_ops.py::TestTagsCUDA::test_tags__refs_copysign_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9631218Z test_ops.py::TestTagsCUDA::test_tags__refs_diag_embed_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 92%] 2025-12-04T13:35:25.9631384Z test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9631521Z test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9631652Z test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_scatter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9631771Z test_ops.py::TestTagsCUDA::test_tags__refs_dstack_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9631886Z test_ops.py::TestTagsCUDA::test_tags__refs_erfc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632006Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632130Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632254Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfftn_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632374Z test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632506Z test_ops.py::TestTagsCUDA::test_tags__refs_fliplr_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632621Z test_ops.py::TestTagsCUDA::test_tags__refs_floor_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632746Z test_ops.py::TestTagsCUDA::test_tags__refs_floor_divide_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632860Z test_ops.py::TestTagsCUDA::test_tags__refs_fmax_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9632973Z test_ops.py::TestTagsCUDA::test_tags__refs_frac_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633089Z test_ops.py::TestTagsCUDA::test_tags__refs_hstack_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633205Z test_ops.py::TestTagsCUDA::test_tags__refs_isnan_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633326Z test_ops.py::TestTagsCUDA::test_tags__refs_istft_cuda_complex64 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633438Z test_ops.py::TestTagsCUDA::test_tags__refs_lerp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633571Z test_ops.py::TestTagsCUDA::test_tags__refs_linalg_diagonal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633692Z test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svd_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633818Z test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vecdot_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9633950Z test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vector_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634074Z test_ops.py::TestTagsCUDA::test_tags__refs_linspace_cuda_float32 SKIPPED [0.0012s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634189Z test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634311Z test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634434Z test_ops.py::TestTagsCUDA::test_tags__refs_logical_not_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634553Z test_ops.py::TestTagsCUDA::test_tags__refs_maximum_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634670Z test_ops.py::TestTagsCUDA::test_tags__refs_new_ones_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634789Z test_ops.py::TestTagsCUDA::test_tags__refs_new_zeros_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9634928Z test_ops.py::TestTagsCUDA::test_tags__refs_nextafter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 93%] 2025-12-04T13:35:25.9635077Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_elu_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635220Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_group_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635360Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635500Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_huber_loss_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635635Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_l1_loss_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635791Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_log_softmax_with_dtype_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9635924Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636075Z test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softplus_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636189Z test_ops.py::TestTagsCUDA::test_tags__refs_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636308Z test_ops.py::TestTagsCUDA::test_tags__refs_rad2deg_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636424Z test_ops.py::TestTagsCUDA::test_tags__refs_round_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636537Z test_ops.py::TestTagsCUDA::test_tags__refs_rsub_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636665Z test_ops.py::TestTagsCUDA::test_tags__refs_select_scatter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636780Z test_ops.py::TestTagsCUDA::test_tags__refs_sinc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9636907Z test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtri_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637039Z test_ops.py::TestTagsCUDA::test_tags__refs_special_xlog1py_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637158Z test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637289Z test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_multiple_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637403Z test_ops.py::TestTagsCUDA::test_tags__refs_std_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637513Z test_ops.py::TestTagsCUDA::test_tags__refs_t_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637626Z test_ops.py::TestTagsCUDA::test_tags__refs_tan_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637740Z test_ops.py::TestTagsCUDA::test_tags__refs_tril_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637854Z test_ops.py::TestTagsCUDA::test_tags__refs_triu_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9637969Z test_ops.py::TestTagsCUDA::test_tags__refs_trunc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638141Z test_ops.py::TestTagsCUDA::test_tags__refs_view_as_complex_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638256Z test_ops.py::TestTagsCUDA::test_tags__refs_view_as_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638372Z test_ops.py::TestTagsCUDA::test_tags__refs_xlogy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638481Z test_ops.py::TestTagsCUDA::test_tags_acosh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638614Z test_ops.py::TestTagsCUDA::test_tags_add_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638737Z test_ops.py::TestTagsCUDA::test_tags_addbmm_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 94%] 2025-12-04T13:35:25.9638847Z test_ops.py::TestTagsCUDA::test_tags_addmv_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9638954Z test_ops.py::TestTagsCUDA::test_tags_amin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639066Z test_ops.py::TestTagsCUDA::test_tags_arange_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639176Z test_ops.py::TestTagsCUDA::test_tags_argmax_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639289Z test_ops.py::TestTagsCUDA::test_tags_argsort_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639410Z test_ops.py::TestTagsCUDA::test_tags_as_strided_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639517Z test_ops.py::TestTagsCUDA::test_tags_asin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639635Z test_ops.py::TestTagsCUDA::test_tags_bernoulli_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639768Z test_ops.py::TestTagsCUDA::test_tags_bfloat16_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9639898Z test_ops.py::TestTagsCUDA::test_tags_bitwise_right_shift_cuda_int64 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640004Z test_ops.py::TestTagsCUDA::test_tags_bmm_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640123Z test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640229Z test_ops.py::TestTagsCUDA::test_tags_ceil_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640338Z test_ops.py::TestTagsCUDA::test_tags_cfloat_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640444Z test_ops.py::TestTagsCUDA::test_tags_char_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640558Z test_ops.py::TestTagsCUDA::test_tags_cholesky_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640666Z test_ops.py::TestTagsCUDA::test_tags_chunk_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640775Z test_ops.py::TestTagsCUDA::test_tags_cummax_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640885Z test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9640994Z test_ops.py::TestTagsCUDA::test_tags_double_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641098Z test_ops.py::TestTagsCUDA::test_tags_eq_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641207Z test_ops.py::TestTagsCUDA::test_tags_equal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641313Z test_ops.py::TestTagsCUDA::test_tags_erfc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641421Z test_ops.py::TestTagsCUDA::test_tags_exp2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641540Z test_ops.py::TestTagsCUDA::test_tags_exponential_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641656Z test_ops.py::TestTagsCUDA::test_tags_fft_ihfft2_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641769Z test_ops.py::TestTagsCUDA::test_tags_fft_rfft2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641878Z test_ops.py::TestTagsCUDA::test_tags_fft_rfft_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9641984Z test_ops.py::TestTagsCUDA::test_tags_full_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 95%] 2025-12-04T13:35:25.9642094Z test_ops.py::TestTagsCUDA::test_tags_full_like_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642223Z test_ops.py::TestTagsCUDA::test_tags_hsplit_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642346Z test_ops.py::TestTagsCUDA::test_tags_index_add_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642459Z test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642584Z test_ops.py::TestTagsCUDA::test_tags_index_reduce_prod_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642703Z test_ops.py::TestTagsCUDA::test_tags_index_select_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642813Z test_ops.py::TestTagsCUDA::test_tags_inner_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9642924Z test_ops.py::TestTagsCUDA::test_tags_isclose_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643034Z test_ops.py::TestTagsCUDA::test_tags_isreal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643144Z test_ops.py::TestTagsCUDA::test_tags_ldexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643258Z test_ops.py::TestTagsCUDA::test_tags_le_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643368Z test_ops.py::TestTagsCUDA::test_tags_lgamma_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643570Z test_ops.py::TestTagsCUDA::test_tags_linalg_householder_product_cuda_float32 SKIPPED [0.0005s] (skipCUDAIfRocm: test doesn't currently work on the ROCm stack) [ 96%] 2025-12-04T13:35:25.9643713Z test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_hermitian_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643829Z test_ops.py::TestTagsCUDA::test_tags_linalg_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9643946Z test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644130Z test_ops.py::TestTagsCUDA::test_tags_linalg_pinv_singular_cuda_float32 SKIPPED [0.0005s] (test is slow; run with PYTORCH_TEST_WITH_SLOW to enable test) [ 96%] 2025-12-04T13:35:25.9644249Z test_ops.py::TestTagsCUDA::test_tags_linalg_solve_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644386Z test_ops.py::TestTagsCUDA::test_tags_linspace_tensor_overload_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644493Z test_ops.py::TestTagsCUDA::test_tags_log2_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644612Z test_ops.py::TestTagsCUDA::test_tags_logaddexp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644729Z test_ops.py::TestTagsCUDA::test_tags_logical_and_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644846Z test_ops.py::TestTagsCUDA::test_tags_logical_xor_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9644983Z test_ops.py::TestTagsCUDA::test_tags_logspace_tensor_overload_cuda_float32 SKIPPED [0.0012s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645101Z test_ops.py::TestTagsCUDA::test_tags_masked_amin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645220Z test_ops.py::TestTagsCUDA::test_tags_masked_argmin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645335Z test_ops.py::TestTagsCUDA::test_tags_masked_mean_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645449Z test_ops.py::TestTagsCUDA::test_tags_masked_norm_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645571Z test_ops.py::TestTagsCUDA::test_tags_masked_softmin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645685Z test_ops.py::TestTagsCUDA::test_tags_masked_std_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 96%] 2025-12-04T13:35:25.9645820Z test_ops.py::TestTagsCUDA::test_tags_matrix_exp_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9645959Z test_ops.py::TestTagsCUDA::test_tags_max_reduction_no_dim_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646092Z test_ops.py::TestTagsCUDA::test_tags_max_reduction_with_dim_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646224Z test_ops.py::TestTagsCUDA::test_tags_meshgrid_list_of_tensors_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646340Z test_ops.py::TestTagsCUDA::test_tags_multinomial_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646455Z test_ops.py::TestTagsCUDA::test_tags_narrow_copy_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646578Z test_ops.py::TestTagsCUDA::test_tags_native_layer_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646686Z test_ops.py::TestTagsCUDA::test_tags_ne_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646794Z test_ops.py::TestTagsCUDA::test_tags_neg_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9646920Z test_ops.py::TestTagsCUDA::test_tags_new_zeros_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647034Z test_ops.py::TestTagsCUDA::test_tags_nextafter_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647182Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647345Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_with_logits_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647472Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_celu_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647600Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv2d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647730Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9647874Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose3d_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648043Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_without_train_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648236Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool2d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648386Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_fractional_max_pool3d_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648530Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_gaussian_nll_loss_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648665Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_group_norm_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648800Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardshrink_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9648932Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_hardtanh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649081Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649238Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest-exact_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649367Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_linear_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649493Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool1d_cuda_float32 SKIPPED [0.0001s] (Skipped!) [ 97%] 2025-12-04T13:35:25.9649666Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649804Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_mish_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 97%] 2025-12-04T13:35:25.9649941Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650085Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_pairwise_distance_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650223Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_shuffle_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650364Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_unshuffle_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650489Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_selu_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650631Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_complex_cuda_complex64 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650777Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_softsign_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9650912Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651056Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_loss_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651220Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_with_distance_loss_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651348Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_unfold_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651492Z test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_bilinear_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651610Z test_ops.py::TestTagsCUDA::test_tags_pca_lowrank_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651726Z test_ops.py::TestTagsCUDA::test_tags_permute_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651839Z test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9651974Z test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_2_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652081Z test_ops.py::TestTagsCUDA::test_tags_put_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652194Z test_ops.py::TestTagsCUDA::test_tags_randint_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652311Z test_ops.py::TestTagsCUDA::test_tags_reciprocal_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652428Z test_ops.py::TestTagsCUDA::test_tags_reshape_as_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652543Z test_ops.py::TestTagsCUDA::test_tags_reshape_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652654Z test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652761Z test_ops.py::TestTagsCUDA::test_tags_roll_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9652889Z test_ops.py::TestTagsCUDA::test_tags_round_decimals_neg_3_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653011Z test_ops.py::TestTagsCUDA::test_tags_scatter_add_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653138Z test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_amin_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653272Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_bartlett_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653424Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_blackman_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653567Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_cosine_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653697Z test_ops.py::TestTagsCUDA::test_tags_signal_windows_kaiser_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 98%] 2025-12-04T13:35:25.9653823Z test_ops.py::TestTagsCUDA::test_tags_special_bessel_j1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9653945Z test_ops.py::TestTagsCUDA::test_tags_special_bessel_y1_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654090Z test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_t_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654228Z test_ops.py::TestTagsCUDA::test_tags_special_hermite_polynomial_h_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654372Z test_ops.py::TestTagsCUDA::test_tags_special_laguerre_polynomial_l_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654524Z test_ops.py::TestTagsCUDA::test_tags_special_legendre_polynomial_p_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654648Z test_ops.py::TestTagsCUDA::test_tags_special_log_ndtr_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654783Z test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_k0_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9654937Z test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_w_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655061Z test_ops.py::TestTagsCUDA::test_tags_special_xlog1py_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655189Z test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_copy_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655302Z test_ops.py::TestTagsCUDA::test_tags_square_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655427Z test_ops.py::TestTagsCUDA::test_tags_squeeze_multiple_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655546Z test_ops.py::TestTagsCUDA::test_tags_std_unbiased_cuda_float32 SKIPPED [0.0011s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655664Z test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655773Z test_ops.py::TestTagsCUDA::test_tags_tanh_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9655890Z test_ops.py::TestTagsCUDA::test_tags_to_sparse_cuda_float32 SKIPPED [0.0001s] (Allowed exception) [ 99%] 2025-12-04T13:35:25.9656038Z test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__safe_softmax_default_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656150Z test_ops.py::TestTagsCUDA::test_tags_trapz_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656260Z test_ops.py::TestTagsCUDA::test_tags_tril_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656368Z test_ops.py::TestTagsCUDA::test_tags_trunc_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656485Z test_ops.py::TestTagsCUDA::test_tags_unbind_copy_cuda_float32 SKIPPED [0.0010s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656598Z test_ops.py::TestTagsCUDA::test_tags_unsqueeze_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656722Z test_ops.py::TestTagsCUDA::test_tags_var_mean_unbiased_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656827Z test_ops.py::TestTagsCUDA::test_tags_vdot_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9656936Z test_ops.py::TestTagsCUDA::test_tags_view_as_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9657062Z test_ops.py::TestTagsCUDA::test_tags_zero__cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9657186Z test_ops.py::TestTagsCUDA::test_tags_zeros_like_cuda_float32 SKIPPED [0.0009s] (Only runs on cpu) [ 99%] 2025-12-04T13:35:25.9657348Z test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_div_floor_rounding_cuda_float32 PASSED [0.0019s] [ 99%] 2025-12-04T13:35:25.9657489Z test_ops.py::TestForwardADWithScalarsCUDA::test_0d_tensor_with_python_scalar_mul_cuda_float32 PASSED [0.0015s] [100%] 2025-12-04T13:35:25.9657493Z 2025-12-04T13:35:25.9657661Z - generated xml file: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/test_ops/test_ops-7a7b1e95275579c0.xml - 2025-12-04T13:35:25.9657752Z = 2591 passed, 355 skipped, 3936 deselected, 29 xfailed in 1080.22s (0:18:00) == 2025-12-04T13:35:25.9657948Z The following tests failed and then succeeded when run in a new process['test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128'] 2025-12-04T13:35:25.9657953Z 2025-12-04T13:35:25.9658106Z FINISHED PRINTING LOG FILE of test_ops 1/5 (test/test-reports/test_ops_1.5_495b45214c930060_.log) 2025-12-04T13:35:25.9658122Z 2025-12-04T13:35:25.9658211Z Finished test_ops 1/5 ... [2025-12-04 13:35:25.669410][4980108.165594253], took 48.53min 2025-12-04T13:35:25.9658444Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:35:25.9658534Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:35:25.9658629Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T13:35:25.9658679Z Uploading artifacts took 0.00 seconds 2025-12-04T13:35:25.9658795Z Running torch_np/numpy_tests/linalg/test_linalg 1/1 ... [2025-12-04 13:35:25.688630][4980108.184815657] 2025-12-04T13:35:25.9658844Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:35:25.9659168Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/linalg/test_linalg.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:35:25.689103] 2025-12-04T13:35:33.3403020Z 2025-12-04T13:35:33.3404164Z torch_np/numpy_tests/linalg/test_linalg 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.linalg.test_linalg_1.1_cde7a1b6e68ef76e_.log 2025-12-04T13:35:33.3508911Z Running 268 items in this shard: test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_0_size_k, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSolve::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestInv::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvals::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEig::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_empty_identity, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVD::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_generalized_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_generalized_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestSVDHermitian::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_basic_nonsvd, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_nan, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_singular, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCond::test_stacked_singular, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_empty_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_generalized_empty_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_generalized_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinv::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinvHermitian::test_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinvHermitian::test_generalized_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinvHermitian::test_generalized_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestPinvHermitian::test_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_generalized_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_generalized_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestDet::test_zero, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_0_n_0_n_rhs_0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_0_n_4_n_rhs_1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_0_n_4_n_rhs_2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_4_n_0_n_rhs_1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_4_n_0_n_rhs_2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_a_b_m_4_n_2_n_rhs_2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_empty_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_future_rcond, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_incompatible_dims, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_nonsq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestLstsq::test_sq_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalshCases::test_generalized_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalshCases::test_generalized_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalshCases::test_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalshCases::test_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_UPLO, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_invalid, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigvalsh::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEighCases::test_generalized_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEighCases::test_generalized_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEighCases::test_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEighCases::test_empty_herm_cases, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_UPLO, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_invalid, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_types_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_types_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_types_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestEigh::test_types_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNorm_NonSystematic::test_intmin, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_axis, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_bad_args, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_keepdims, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_matrix_2x2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_matrix_3x3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_matrix_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_matrix_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_vector, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormDouble::test_vector_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_axis, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_bad_args, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_keepdims, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_matrix_2x2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_matrix_3x3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_matrix_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_matrix_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_vector, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormSingle::test_vector_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_axis, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_bad_args, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_keepdims, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_matrix_2x2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_matrix_3x3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_matrix_empty, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_matrix_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_vector, test/torch_np/numpy_tests/linalg/test_linalg.py::TestNormInt64::test_vector_return_type, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMatrixRank::test_matrix_rank, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMatrixRank::test_reduced_rank, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMatrixRank::test_symmetric_rank, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_mode_all_but_economic, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_mode_raw, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_qr_empty_m_0_n_0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_qr_empty_m_0_n_3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_qr_empty_m_3_n_0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size0_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size0_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size0_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size0_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size1_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size1_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size1_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size1_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size2_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size2_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size2_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size0_outer_size2_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size0_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size0_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size0_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size0_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size1_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size1_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size1_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size1_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size2_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size2_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size2_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size1_outer_size2_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size0_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size0_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size0_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size0_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size1_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size1_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size1_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size1_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size2_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size2_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size2_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size2_outer_size2_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size0_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size0_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size0_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size0_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size1_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size1_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size1_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size1_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size2_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size2_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size2_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size3_outer_size2_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size0_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size0_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size0_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size0_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size1_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size1_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size1_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size1_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size2_dt0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size2_dt1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size2_dt2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestQR::test_stacked_inputs_size4_outer_size2_dt3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_0_size, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape0_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape0_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape0_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape0_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape1_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape1_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape1_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape1_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape2_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape2_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape2_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape2_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape3_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape3_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape3_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape3_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape4_dtype0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape4_dtype1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape4_dtype2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestCholesky::test_basic_property_shape4_dtype3, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc::test_byteorder_check, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc::test_generalized_raise_multiloop, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc::test_sdot_bug_8577, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc::test_xerbla_override, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_basic_function_with_dynamic_programming_optimization, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_basic_function_with_three_arguments, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_basic_function_with_two_arguments, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_dynamic_programming_logic, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_dynamic_programming_optimization_and_out, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_three_arguments_and_out, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_too_few_input_arrays, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_two_arguments_and_out, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_vector_as_first_and_last_argument, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_vector_as_first_argument, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMultiDot::test_vector_as_last_argument, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_non_square_handling_arr0_ind_2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_non_square_handling_arr1_ind_1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_tensorinv_ind_limit_ind_-2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_tensorinv_ind_limit_ind_0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_tensorinv_result, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_tensorinv_shape_shape0_ind_2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorinv::test_tensorinv_shape_shape1_ind_1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorsolve::test_non_square_handling_a0_axes0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorsolve::test_non_square_handling_a1_axes1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorsolve::test_tensorsolve_result_shape0, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorsolve::test_tensorsolve_result_shape1, test/torch_np/numpy_tests/linalg/test_linalg.py::TestTensorsolve::test_tensorsolve_result_shape2, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc2::test_blas64_dot, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc2::test_blas64_geqrf_lwork_smoketest, test/torch_np/numpy_tests/linalg/test_linalg.py::TestMisc2::test_unsupported_commontype 2025-12-04T13:35:33.3611803Z 2025-12-04T13:35:33.3612270Z Finished torch_np/numpy_tests/linalg/test_linalg 1/1 ... [2025-12-04 13:35:33.340467][4980115.836651879], took 0.13min 2025-12-04T13:35:33.3613632Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:35:33.3614800Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:35:33.3615503Z Running nn/test_module_hooks 1/1 ... [2025-12-04 13:35:33.359723][4980115.855908943] 2025-12-04T13:35:33.3616092Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:35:33.3617353Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_module_hooks.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:35:33.360201] 2025-12-04T13:35:35.7316072Z 2025-12-04T13:35:35.7317172Z nn/test_module_hooks 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_module_hooks_1.1_9ab86a9ddeac745c_.log 2025-12-04T13:35:35.7338619Z Running 53 items in this shard: test/nn/test_module_hooks.py::TestModuleHooks::test_always_called_forward_hooks, test/nn/test_module_hooks.py::TestModuleHooks::test_bw_hook_warning_for_non_tensor_or_tuple, test/nn/test_module_hooks.py::TestModuleHooks::test_forward_hooks_named_tuple_False, test/nn/test_module_hooks.py::TestModuleHooks::test_forward_hooks_named_tuple_True, test/nn/test_module_hooks.py::TestModuleHooks::test_forward_pre_hooks_named_tuple_False, test/nn/test_module_hooks.py::TestModuleHooks::test_forward_pre_hooks_named_tuple_True, test/nn/test_module_hooks.py::TestModuleHooks::test_full_backward_hooks_named_tuple_False, test/nn/test_module_hooks.py::TestModuleHooks::test_full_backward_hooks_named_tuple_True, test/nn/test_module_hooks.py::TestModuleHooks::test_full_backward_pre_hooks_named_tuple_False, test/nn/test_module_hooks.py::TestModuleHooks::test_full_backward_pre_hooks_named_tuple_True, test/nn/test_module_hooks.py::TestModuleHooks::test_kwarg_hooks, test/nn/test_module_hooks.py::TestModuleHooks::test_mixed_hooks_named_tuple_False, test/nn/test_module_hooks.py::TestModuleHooks::test_mixed_hooks_named_tuple_True, test/nn/test_module_hooks.py::TestModuleHooks::test_remove_kwarg_hooks, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_module_pre_hook_swap_False, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_module_pre_hook_swap_True, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_post_hook_backward_compatibility_swap_False, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_post_hook_backward_compatibility_swap_True, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_post_hook_swap_False, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_post_hook_swap_True, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_pre_hook_swap_False, test/nn/test_module_hooks.py::TestStateDictHooks::test_load_state_dict_pre_hook_swap_True, test/nn/test_module_hooks.py::TestStateDictHooks::test_no_extra_ref_to_module, test/nn/test_module_hooks.py::TestStateDictHooks::test_pickled_hook, test/nn/test_module_hooks.py::TestStateDictHooks::test_register_state_dict_post_hook_private_False, test/nn/test_module_hooks.py::TestStateDictHooks::test_register_state_dict_post_hook_private_True, test/nn/test_module_hooks.py::TestStateDictHooks::test_register_state_dict_pre_hook, test/nn/test_module_hooks.py::TestStateDictHooks::test_register_state_dict_pre_hook_backward_compat, test/nn/test_module_hooks.py::TestStateDictHooks::test_register_state_dict_pre_hook_lazy_module, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_global_and_local_hooks_order, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_backward_global_hook_writeable, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_forward_forward_hook_removable, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_forward_preforward_hook_removable, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_global_forward_preforward_hook_writeable, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_global_hook_invalid_outputs, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_global_hooks, test/nn/test_module_hooks.py::TestModuleGlobalHooks::test_module_global_hooks_with_kwargs, test/nn/test_module_hooks.py::TestModuleHookNN::test_backward_hooks_interaction, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_backward_size, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_backward_writeable, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_buffer_registration, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_cpp, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_extra_input, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_forward_preforward_writable, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_inplace, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_invalid_outputs, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_last_arg_requires_grad, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_no_requires_grad, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_non_full_warning, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_parameter_registration, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_requires_grad, test/nn/test_module_hooks.py::TestModuleHookNN::test_hook_submodule_registration, test/nn/test_module_hooks.py::TestModuleHookNN::test_hooks 2025-12-04T13:35:35.7358560Z 2025-12-04T13:35:35.7358933Z Finished nn/test_module_hooks 1/1 ... [2025-12-04 13:35:35.731387][4980118.227572425], took 0.04min 2025-12-04T13:35:35.7360216Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:35:35.7504665Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:35:35.7511430Z Running torch_np/numpy_tests/lib/test_twodim_base 1/1 ... [2025-12-04 13:35:35.750883][4980118.247068975] 2025-12-04T13:35:35.7512196Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:35:35.7515868Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/lib/test_twodim_base.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:35:35.751355] 2025-12-04T13:35:38.8236403Z 2025-12-04T13:35:38.8237786Z torch_np/numpy_tests/lib/test_twodim_base 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.lib.test_twodim_base_1.1_ee737928c3c0eeb6_.log 2025-12-04T13:35:38.8250551Z Running 34 items in this shard: test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_2d, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_basic, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_bool, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_diag, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_diag2d, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_eye_bounds, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestEye::test_order, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestDiag::test_diag_bounds, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestDiag::test_failure, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestDiag::test_fortran_order, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestDiag::test_matrix, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestDiag::test_vector, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestFliplr::test_basic, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestFlipud::test_basic, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_all_outliers, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_asym, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_bad_length_x_len_10_y_len_11, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_bad_length_x_len_20_y_len_19, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_binparameter_combination, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_density, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_empty, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestHistogram2d::test_simple, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_dtype, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_mask_indices, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_tril_indices, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_tril_triu_dtype, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_tril_triu_ndim2, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_tril_triu_ndim3, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTri::test_tril_triu_with_inf, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTriuIndices::test_triu_indices, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTrilIndicesFrom::test_exceptions, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestTriuIndicesFrom::test_exceptions, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestVander::test_basic, test/torch_np/numpy_tests/lib/test_twodim_base.py::TestVander::test_dtypes 2025-12-04T13:35:38.8262898Z 2025-12-04T13:35:38.8263354Z Finished torch_np/numpy_tests/lib/test_twodim_base 1/1 ... [2025-12-04 13:35:38.823340][4980121.31952494], took 0.05min 2025-12-04T13:35:38.8271601Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:35:38.8424665Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:35:38.8431325Z Running profiler/test_memory_profiler 1/1 ... [2025-12-04 13:35:38.842778][4980121.33896435] 2025-12-04T13:35:38.8432058Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:35:38.8435266Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'profiler/test_memory_profiler.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:35:38.843257] 2025-12-04T13:35:43.6244403Z 2025-12-04T13:35:43.6245616Z profiler/test_memory_profiler 1/1 was successful, full logs can be found in artifacts with path test/test-reports/profiler.test_memory_profiler_1.1_2aa8cac9b154c706_.log 2025-12-04T13:35:43.6258798Z Running 33 items in this shard: test/profiler/test_memory_profiler.py::TestMemoryProfiler::test_config_check, test/profiler/test_memory_profiler.py::TestIdentifyGradients::test_extract_gradients_from_module, test/profiler/test_memory_profiler.py::TestIdentifyGradients::test_extract_gradients_from_module_and_optimizer, test/profiler/test_memory_profiler.py::TestIdentifyGradients::test_extract_gradients_from_optimizer, test/profiler/test_memory_profiler.py::TestIdentifyGradients::test_extract_gradients_from_optimizer_set_to_none, test/profiler/test_memory_profiler.py::TestIdentifyGradients::test_extract_gradients_low_level, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_complicated, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_non_op_allocations, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_simple, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_simple_backward, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_simple_inplace, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_stacked, test/profiler/test_memory_profiler.py::TestDataFlow::test_data_flow_graph_with_annotations, test/profiler/test_memory_profiler.py::TestDataFlow::test_match_schemas, test/profiler/test_memory_profiler.py::TestDataFlow::test_match_schemas_backward, test/profiler/test_memory_profiler.py::TestDataFlow::test_match_schemas_tensorlist, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_sequential_fwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_sequential_fwd_bwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_fwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_fwd_bwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_fwd_bwd_step, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_module_fwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_module_fwd_bwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_categories_e2e_simple_module_fwd_bwd_step, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_inputs_fwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_inputs_fwd_bwd, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_inputs_fwd_lazy, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_lazily_initialized, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_manual_optimizer_step, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_memory_timeline, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_parameters_and_gradients, test/profiler/test_memory_profiler.py::TestMemoryProfilerE2E::test_parameters_and_gradients_set_to_none, test/profiler/test_memory_profiler.py::TestMemoryProfilerTimelineCUDA::test_memory_timeline_no_id_cuda 2025-12-04T13:35:43.6273558Z 2025-12-04T13:35:43.6273976Z Finished profiler/test_memory_profiler 1/1 ... [2025-12-04 13:35:43.624079][4980126.120262967], took 0.08min 2025-12-04T13:35:43.6278346Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:35:43.6434906Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:35:43.6439312Z Running test_serialization 1/1 ... [2025-12-04 13:35:43.643707][4980126.139893524] 2025-12-04T13:35:43.6439911Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:35:43.6444199Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_serialization.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:35:43.644196] 2025-12-04T13:36:06.6003728Z 2025-12-04T13:36:06.6004414Z test_serialization 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_serialization_1.1_780fc43d037c01c0_.log 2025-12-04T13:36:06.6049227Z Running 206 items in this shard: test/test_serialization.py::TestOldSerialization::test_debug_set_in_ci, test/test_serialization.py::TestOldSerialization::test_load_error_msg, test/test_serialization.py::TestOldSerialization::test_load_nonexistent_device, test/test_serialization.py::TestOldSerialization::test_load_python2_unicode_module, test/test_serialization.py::TestOldSerialization::test_load_unicode_error_msg, test/test_serialization.py::TestOldSerialization::test_pickle_module, test/test_serialization.py::TestOldSerialization::test_safe_load_basic_types, test/test_serialization.py::TestOldSerialization::test_save_different_dtype_error, test/test_serialization.py::TestOldSerialization::test_save_different_dtype_unallocated, test/test_serialization.py::TestOldSerialization::test_serialization, test/test_serialization.py::TestOldSerialization::test_serialization_backwards_compat, test/test_serialization.py::TestOldSerialization::test_serialization_backwards_compat_safe, test/test_serialization.py::TestOldSerialization::test_serialization_container, test/test_serialization.py::TestOldSerialization::test_serialization_container_filelike, test/test_serialization.py::TestOldSerialization::test_serialization_dill, test/test_serialization.py::TestOldSerialization::test_serialization_dill_version_not_supported, test/test_serialization.py::TestOldSerialization::test_serialization_fake_zip, test/test_serialization.py::TestOldSerialization::test_serialization_filelike, test/test_serialization.py::TestOldSerialization::test_serialization_filelike_api_requirements, test/test_serialization.py::TestOldSerialization::test_serialization_filelike_exceptions, test/test_serialization.py::TestOldSerialization::test_serialization_filelike_missing_attrs, test/test_serialization.py::TestOldSerialization::test_serialization_filelike_stress, test/test_serialization.py::TestOldSerialization::test_serialization_filelike_uses_readinto, test/test_serialization.py::TestOldSerialization::test_serialization_gzip, test/test_serialization.py::TestOldSerialization::test_serialization_map_location, test/test_serialization.py::TestOldSerialization::test_serialization_offset, test/test_serialization.py::TestOldSerialization::test_serialization_offset_filelike_weights_only_False, test/test_serialization.py::TestOldSerialization::test_serialization_offset_filelike_weights_only_True, test/test_serialization.py::TestOldSerialization::test_serialization_offset_gzip, test/test_serialization.py::TestOldSerialization::test_serialization_safe, test/test_serialization.py::TestOldSerialization::test_serialization_save_warnings, test/test_serialization.py::TestOldSerialization::test_serialization_sparse, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_bsc_invalid, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_bsr_invalid, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_csc_invalid, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_csr_invalid, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_invalid, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_invalid_legacy_ctor, test/test_serialization.py::TestOldSerialization::test_serialization_sparse_safe, test/test_serialization.py::TestOldSerialization::test_serialization_storage_slice, test/test_serialization.py::TestOldSerialization::test_serialization_zipfile_utils, test/test_serialization.py::TestOldSerialization::test_serialize_device, test/test_serialization.py::TestOldSerialization::test_skip_data_load, test/test_serialization.py::TestSerialization::test_crc32_options_compute_crc32_False_filename_False, test/test_serialization.py::TestSerialization::test_crc32_options_compute_crc32_False_filename_True, test/test_serialization.py::TestSerialization::test_crc32_options_compute_crc32_True_filename_False, test/test_serialization.py::TestSerialization::test_crc32_options_compute_crc32_True_filename_True, test/test_serialization.py::TestSerialization::test_debug_set_in_ci, test/test_serialization.py::TestSerialization::test_filewriter_metadata_writing_filename_False, test/test_serialization.py::TestSerialization::test_filewriter_metadata_writing_filename_True, test/test_serialization.py::TestSerialization::test_get_unsafe_globals_in_checkpoint, test/test_serialization.py::TestSerialization::test_has_format_version, test/test_serialization.py::TestSerialization::test_load_error_msg, test/test_serialization.py::TestSerialization::test_load_njt_weights_only_should_import_False, test/test_serialization.py::TestSerialization::test_load_njt_weights_only_should_import_True, test/test_serialization.py::TestSerialization::test_load_nonexistent_device, test/test_serialization.py::TestSerialization::test_load_python2_unicode_module, test/test_serialization.py::TestSerialization::test_load_unicode_error_msg, test/test_serialization.py::TestSerialization::test_lr_scheduler_serialization, test/test_serialization.py::TestSerialization::test_meta_serialization_weights_only_False, test/test_serialization.py::TestSerialization::test_meta_serialization_weights_only_True, test/test_serialization.py::TestSerialization::test_mmap_load_offset_calculation_path_type0, test/test_serialization.py::TestSerialization::test_mmap_load_offset_calculation_path_type1, test/test_serialization.py::TestSerialization::test_pathlike_serialization_weights_only_False, test/test_serialization.py::TestSerialization::test_pathlike_serialization_weights_only_True, test/test_serialization.py::TestSerialization::test_pickle_module, test/test_serialization.py::TestSerialization::test_safe_load_basic_types, test/test_serialization.py::TestSerialization::test_save_different_dtype_error, test/test_serialization.py::TestSerialization::test_save_different_dtype_unallocated, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_bfloat16_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_bfloat16_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_bool_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_bool_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_complex128_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_complex128_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_complex64_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_complex64_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float16_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float16_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float32_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float32_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float64_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_float64_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int16_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int16_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int32_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int32_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int64_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int64_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int8_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_int8_weights_only_True, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_uint8_weights_only_False, test/test_serialization.py::TestSerialization::test_save_load_preserves_dtype_uint8_weights_only_True, test/test_serialization.py::TestSerialization::test_serializaion_no_storage_leak, test/test_serialization.py::TestSerialization::test_serialization, test/test_serialization.py::TestSerialization::test_serialization_2gb_file, test/test_serialization.py::TestSerialization::test_serialization_4gb_file, test/test_serialization.py::TestSerialization::test_serialization_backwards_compat, test/test_serialization.py::TestSerialization::test_serialization_backwards_compat_safe, test/test_serialization.py::TestSerialization::test_serialization_byte_literal_byte_literals0_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_byte_literal_byte_literals0_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_byte_literal_byte_literals1_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_byte_literal_byte_literals1_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_byteorder_mark, test/test_serialization.py::TestSerialization::test_serialization_dill, test/test_serialization.py::TestSerialization::test_serialization_dill_version_not_supported, test/test_serialization.py::TestSerialization::test_serialization_dtype_complex32_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_complex32_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_dtype_float8_e4m3fn_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_float8_e4m3fn_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_dtype_float8_e5m2_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_float8_e5m2_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint16_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint16_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint32_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint32_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint64_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_dtype_uint64_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_efficient_zerotensor_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_efficient_zerotensor_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_fake_zip, test/test_serialization.py::TestSerialization::test_serialization_filelike, test/test_serialization.py::TestSerialization::test_serialization_filelike_api_requirements, test/test_serialization.py::TestSerialization::test_serialization_filelike_exceptions, test/test_serialization.py::TestSerialization::test_serialization_filelike_missing_attrs, test/test_serialization.py::TestSerialization::test_serialization_filelike_stress, test/test_serialization.py::TestSerialization::test_serialization_filelike_uses_readinto, test/test_serialization.py::TestSerialization::test_serialization_gzip, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_bfloat16, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_bool, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_cdouble, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_cfloat, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_double, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_float, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_half, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_int, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_int16, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_int8, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_long, test/test_serialization.py::TestSerialization::test_serialization_load_bom_data_uint8, test/test_serialization.py::TestSerialization::test_serialization_map_location, test/test_serialization.py::TestSerialization::test_serialization_math_bits_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_math_bits_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_ctx, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_options_path_type0_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_options_path_type0_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_options_path_type1_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_options_path_type1_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_mmap_loading_with_map_location, test/test_serialization.py::TestSerialization::test_serialization_nested_class, test/test_serialization.py::TestSerialization::test_serialization_offset_gzip, test/test_serialization.py::TestSerialization::test_serialization_python_attr_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_python_attr_weights_only_True, test/test_serialization.py::TestSerialization::test_serialization_safe, test/test_serialization.py::TestSerialization::test_serialization_save_warnings, test/test_serialization.py::TestSerialization::test_serialization_sparse, test/test_serialization.py::TestSerialization::test_serialization_sparse_bsc_invalid, test/test_serialization.py::TestSerialization::test_serialization_sparse_bsr_invalid, test/test_serialization.py::TestSerialization::test_serialization_sparse_csc_invalid, test/test_serialization.py::TestSerialization::test_serialization_sparse_csr_invalid, test/test_serialization.py::TestSerialization::test_serialization_sparse_invalid, test/test_serialization.py::TestSerialization::test_serialization_sparse_invalid_legacy_ctor, test/test_serialization.py::TestSerialization::test_serialization_sparse_safe, test/test_serialization.py::TestSerialization::test_serialization_storage_slice, test/test_serialization.py::TestSerialization::test_serialization_uintx_intx, test/test_serialization.py::TestSerialization::test_serialization_warning_s390x, test/test_serialization.py::TestSerialization::test_serialization_with_header, test/test_serialization.py::TestSerialization::test_serialization_zipfile_actually_jit, test/test_serialization.py::TestSerialization::test_serialization_zipfile_utils, test/test_serialization.py::TestSerialization::test_serialization_zipfile_weights_only_False, test/test_serialization.py::TestSerialization::test_serialization_zipfile_weights_only_True, test/test_serialization.py::TestSerialization::test_serialize_device, test/test_serialization.py::TestSerialization::test_skip_data_load, test/test_serialization.py::TestSerialization::test_skip_data_serialization_error_cases, test/test_serialization.py::TestSerialization::test_skip_data_serialization_materialize_fake_False, test/test_serialization.py::TestSerialization::test_skip_data_serialization_materialize_fake_True, test/test_serialization.py::TestSerialization::test_skip_data_serialization_preserves_views_materialize_fake_False, test/test_serialization.py::TestSerialization::test_skip_data_serialization_preserves_views_materialize_fake_True, test/test_serialization.py::TestSerialization::test_storage_alignment, test/test_serialization.py::TestSerialization::test_use_pinned_memory_for_d2h, test/test_serialization.py::TestSerialization::test_weights_only_assert, test/test_serialization.py::TestSerialization::test_weights_only_blocked_func_error_msg, test/test_serialization.py::TestSerialization::test_weights_only_env_variables_force_weights_only_False, test/test_serialization.py::TestSerialization::test_weights_only_env_variables_force_weights_only_True, test/test_serialization.py::TestSerialization::test_weights_only_error_unsafe_global_False, test/test_serialization.py::TestSerialization::test_weights_only_error_unsafe_global_True, test/test_serialization.py::TestSerialization::test_weights_only_safe_globals_blocklist, test/test_serialization.py::TestSerialization::test_weights_only_safe_globals_build, test/test_serialization.py::TestSerialization::test_weights_only_safe_globals_build_with_slots_slots_all, test/test_serialization.py::TestSerialization::test_weights_only_safe_globals_build_with_slots_slots_some, test/test_serialization.py::TestSerialization::test_weights_only_safe_globals_newobj, test/test_serialization.py::TestSerialization::test_weights_only_with_zoneinfo_unpickle_registration_success, test/test_serialization.py::TestSubclassSerialization::test_cloned_deepcopy_requires_grad_False, test/test_serialization.py::TestSubclassSerialization::test_cloned_deepcopy_requires_grad_True, test/test_serialization.py::TestSubclassSerialization::test_empty_class_serialization, test/test_serialization.py::TestSubclassSerialization::test_safe_globals_context_manager_weights_only, test/test_serialization.py::TestSubclassSerialization::test_safe_globals_for_weights_only, test/test_serialization.py::TestSubclassSerialization::test_sets_are_loadable_with_weights_only, test/test_serialization.py::TestSubclassSerialization::test_tensor_subclass_deepcopy, test/test_serialization.py::TestSubclassSerialization::test_tensor_subclass_getstate_overwrite, test/test_serialization.py::TestSubclassSerialization::test_tensor_subclass_map_location, test/test_serialization.py::TestSubclassSerialization::test_tensor_subclass_wrapper_serialization, test/test_serialization.py::TestBothSerializationCUDA::test_serialization_new_format_old_format_compat_weights_only_False_cuda, test/test_serialization.py::TestBothSerializationCUDA::test_serialization_new_format_old_format_compat_weights_only_True_cuda 2025-12-04T13:36:06.6074236Z 2025-12-04T13:36:06.6074351Z Finished test_serialization 1/1 ... [2025-12-04 13:36:06.600404][4980149.096590028], took 0.38min 2025-12-04T13:36:06.6074739Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:36:06.6192834Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:36:06.6199779Z Running test_indexing 1/1 ... [2025-12-04 13:36:06.619757][4980149.11594361] 2025-12-04T13:36:06.6200395Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:36:06.6207059Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_indexing.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:36:06.620210] 2025-12-04T13:36:20.3832774Z 2025-12-04T13:36:20.3834450Z test_indexing 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_indexing_1.1_89697147382f554f_.log 2025-12-04T13:36:20.3895370Z Running 186 items in this shard: test/test_indexing.py::TestIndexingCUDA::test_advancedindex_big_cuda, test/test_indexing.py::TestIndexingCUDA::test_advancedindex_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_advancedindex_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_basic_advanced_combined_cuda, test/test_indexing.py::TestIndexingCUDA::test_bool_indices_accumulate_cuda, test/test_indexing.py::TestIndexingCUDA::test_bool_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_bool_mask_assignment_cuda, test/test_indexing.py::TestIndexingCUDA::test_byte_mask2d_cuda, test/test_indexing.py::TestIndexingCUDA::test_byte_mask_accumulate_cuda, test/test_indexing.py::TestIndexingCUDA::test_byte_mask_cuda, test/test_indexing.py::TestIndexingCUDA::test_byte_tensor_assignment_cuda, test/test_indexing.py::TestIndexingCUDA::test_cpu_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_cuda_broadcast_index_use_deterministic_algorithms_cuda, test/test_indexing.py::TestIndexingCUDA::test_ellipsis_tensor_cuda, test/test_indexing.py::TestIndexingCUDA::test_empty_index_cuda, test/test_indexing.py::TestIndexingCUDA::test_empty_ndim_index_bool_cuda, test/test_indexing.py::TestIndexingCUDA::test_empty_ndim_index_cuda, test/test_indexing.py::TestIndexingCUDA::test_empty_slice_cuda, test/test_indexing.py::TestIndexingCUDA::test_errors_index_copy_cuda, test/test_indexing.py::TestIndexingCUDA::test_gather_take_along_dim_cross_device_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_getitem_scalars_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_add_deterministic_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_complex128, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_complex64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_copy_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_copy_deterministic_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_complex128, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_complex64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_copy_scalars_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_complex128, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_complex64, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_fill_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_getitem_copy_bools_slices_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_ind_dtype_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_limits_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_accumulate_duplicate_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_accumulate_empty_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_accumulate_expanded_values_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_accumulate_large_tensor_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_accumulate_non_contiguous_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_deterministic_with_optional_tensors_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_large_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_non_accumulate_deterministic_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_complex128, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_complex64, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_float8_e4m3fn, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_float8_e5m2, test/test_indexing.py::TestIndexingCUDA::test_index_put_src_datatype_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amax_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_amin_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_mean_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_reduce_reduce_prod_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_scalar_with_bool_mask_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_complex128, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_complex64, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float64, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float8_e4m3fn, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float8_e4m3fnuz, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float8_e5m2, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_float8_e5m2fnuz, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_int16, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_int32, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_int8, test/test_indexing.py::TestIndexingCUDA::test_index_select_cuda_uint8, test/test_indexing.py::TestIndexingCUDA::test_index_setitem_bools_slices_cuda, test/test_indexing.py::TestIndexingCUDA::test_index_src_datatype_cuda_bfloat16, test/test_indexing.py::TestIndexingCUDA::test_index_src_datatype_cuda_bool, test/test_indexing.py::TestIndexingCUDA::test_index_src_datatype_cuda_float16, test/test_indexing.py::TestIndexingCUDA::test_index_src_datatype_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_int_assignment_cuda, test/test_indexing.py::TestIndexingCUDA::test_int_indices2d_cuda, test/test_indexing.py::TestIndexingCUDA::test_int_indices_broadcast_cuda, test/test_indexing.py::TestIndexingCUDA::test_int_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_invalid_device_cuda, test/test_indexing.py::TestIndexingCUDA::test_invalid_index_cuda, test/test_indexing.py::TestIndexingCUDA::test_jit_indexing_cuda, test/test_indexing.py::TestIndexingCUDA::test_list_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_multi_dimensional_bool_mask_assignment_cuda, test/test_indexing.py::TestIndexingCUDA::test_multi_dimensional_bool_mask_cuda, test/test_indexing.py::TestIndexingCUDA::test_multiple_bool_indices_cuda, test/test_indexing.py::TestIndexingCUDA::test_multiple_byte_mask_cuda, test/test_indexing.py::TestIndexingCUDA::test_multiple_int_cuda, test/test_indexing.py::TestIndexingCUDA::test_none_cuda, test/test_indexing.py::TestIndexingCUDA::test_out_of_bound_index_cuda, test/test_indexing.py::TestIndexingCUDA::test_set_item_to_scalar_tensor_cuda, test/test_indexing.py::TestIndexingCUDA::test_setitem_expansion_error_cuda, test/test_indexing.py::TestIndexingCUDA::test_setitem_scalars_cuda, test/test_indexing.py::TestIndexingCUDA::test_single_int_cuda, test/test_indexing.py::TestIndexingCUDA::test_step_assignment_cuda, test/test_indexing.py::TestIndexingCUDA::test_step_cuda, test/test_indexing.py::TestIndexingCUDA::test_take_along_dim_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_take_along_dim_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_take_along_dim_invalid_cuda_float32, test/test_indexing.py::TestIndexingCUDA::test_take_along_dim_invalid_cuda_int64, test/test_indexing.py::TestIndexingCUDA::test_unravel_index_errors_cuda, test/test_indexing.py::TestIndexingCUDA::test_variable_slicing_cuda, test/test_indexing.py::TestIndexingCUDA::test_zero_dim_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_assignment_value_mismatch_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_indexing_alldims_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_indexing_onedim_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_indexing_twodim_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_indexing_weirdness_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_indexing_weirdness_tensors_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_list_indexing_cuda, test/test_indexing.py::NumpyTestsCUDA::test_boolean_shape_mismatch_cuda, test/test_indexing.py::NumpyTestsCUDA::test_broadcast_subspace_cuda, test/test_indexing.py::NumpyTestsCUDA::test_broaderrors_indexing_cuda, test/test_indexing.py::NumpyTestsCUDA::test_ellipsis_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_empty_fancy_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_empty_tuple_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_everything_returns_views_cuda, test/test_indexing.py::NumpyTestsCUDA::test_index_is_larger_cuda, test/test_indexing.py::NumpyTestsCUDA::test_index_no_floats_cuda, test/test_indexing.py::NumpyTestsCUDA::test_none_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_single_bool_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_single_int_index_cuda, test/test_indexing.py::NumpyTestsCUDA::test_trivial_fancy_out_of_bounds_cuda, test/test_indexing.py::NumpyTestsCUDA::test_truncate_leading_1s_cuda 2025-12-04T13:36:20.3954626Z 2025-12-04T13:36:20.3954963Z Finished test_indexing 1/1 ... [2025-12-04 13:36:20.383139][4980162.879320464], took 0.23min 2025-12-04T13:36:20.3956188Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:36:20.4025267Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:36:20.4030478Z Running torch_np/numpy_tests/fft/test_pocketfft 1/1 ... [2025-12-04 13:36:20.402744][4980162.898930598] 2025-12-04T13:36:20.4031164Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:36:20.4034765Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/fft/test_pocketfft.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:36:20.403227] 2025-12-04T13:36:45.0884140Z 2025-12-04T13:36:45.0884815Z torch_np/numpy_tests/fft/test_pocketfft 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.fft.test_pocketfft_1.1_b3e72097c0ffcf59_.log 2025-12-04T13:36:45.0909104Z Running 79 items in this shard: test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFTShift::test_fft_n, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_all_1d_norm_preserving, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_axes_op0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_axes_op1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_axes_op2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_axes_op3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_dtypes_dtype0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_dtypes_dtype1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_dtypes_dtype2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_F_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype0_order_non-contiguous_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_F_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype1_order_non-contiguous_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_F_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype2_order_non-contiguous_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_F_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft1, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft3, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft4, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fft_with_order_dtype3_order_non-contiguous_fft5, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_fftn, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_hfft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_identity, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifft_norm0, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifft_norm_backward, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifft_norm_forward, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifft_norm_ortho, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ifftn, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_ihfft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_irfft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_irfft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_irfftn, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_rfft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_rfft2, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFT1D::test_rfftn, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFTThreadSafe::test_fft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFTThreadSafe::test_ifft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFTThreadSafe::test_irfft, test/torch_np/numpy_tests/fft/test_pocketfft.py::TestFFTThreadSafe::test_rfft 2025-12-04T13:36:45.0941404Z 2025-12-04T13:36:45.0941970Z Finished torch_np/numpy_tests/fft/test_pocketfft 1/1 ... [2025-12-04 13:36:45.088183][4980187.584368731], took 0.41min 2025-12-04T13:36:45.0943317Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:36:45.1070758Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:36:45.1076860Z Running functorch/test_ops 2/4 ... [2025-12-04 13:36:45.107392][4980187.603579024] 2025-12-04T13:36:45.1077471Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:36:45.1081485Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'functorch/test_ops.py', '--shard-id=2', '--num-shards=4', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:36:45.107847] 2025-12-04T13:49:13.1788656Z 2025-12-04T13:49:13.1790396Z functorch/test_ops 2/4 was successful, full logs can be found in artifacts with path test/test-reports/functorch.test_ops_2.4_4218ff053ca95328_.log 2025-12-04T13:49:13.2459064Z Running 2549 items in this shard: test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_binary_cross_entropy_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_log_softmax_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_extremal_numerics_nll_loss_cuda, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_bool_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_broadcast_shapes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_hypot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_pinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_solve_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_minimum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_movedim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_fractional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_remainder_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_scalar_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unique_consecutive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_grad_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp__chunk_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_as_strided_partial_views_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_cumulative_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_diff_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_empty_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_irfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ne_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_conv_transpose2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_fractional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_pixel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_permute_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_quantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_general_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_to_sparse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_var_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpjvpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_erf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_igammac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_jiterator_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_mT_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_max_pool2d_with_indices_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_conv_transpose2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_dropout2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_unpool1d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_scaled_dot_product_attention_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_polygamma_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_signbit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_sort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_shifted_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_trapezoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_trunc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_unflatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_var_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvjpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_jvpvmapvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_bool_raises_argmin_cuda_bool, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_amin_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_amin_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_argmin_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_argmin_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_clamp_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_clamp_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_gt_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_le_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_le_cuda_complex64, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_maximum_cuda_complex32, test/functorch/test_ops.py::TestOperatorsCUDA::test_ordered_complex_raises_sort_cuda_complex128, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_T_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_broadcast_to_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_conj_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_contiguous_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_diagonal_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_dsplit_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_split_list_args_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_list_return_split_list_args_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_permute_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_real_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_reshape_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_resolve_neg_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_select_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_squeeze_multiple_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_unflatten_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_as_complex_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_as_grad_op_jvp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_view_then_inplace_view_as_grad_op_vjp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_ge_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_index_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_matrix_rank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_log10_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_log2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_logit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_maximum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_hardtanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_mish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_poisson_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_smooth_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_upsample_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sparse_sampled_addmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_unique_consecutive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjp_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___getitem___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_alias_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_argwhere_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_cos_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_empty_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_exp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ifft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_grid_sampler_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_index_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_item_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_jiterator_4inputs_with_extra_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_lu_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_mm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_multinomial_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_dropout2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_hardtanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_unpool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_nn_functional_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_norm_fro_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_rsub_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_signbit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_bessel_j1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_triu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjp_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvjpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_atanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_diagonal_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_eq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_irfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_masked_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_matrix_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_minimum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_stride_padding_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_interpolate_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_max_unpool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polar_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polygamma_polygamma_n_2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_quantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_i0e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_sum_to_size_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_to_sparse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_var_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmap_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmapvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vjpvmapvmap_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ForwardHasDefaultArgsAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyExpMarkDirtyAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ZeroGradientsGenVmapAutogradFunction_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rdiv___cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__batch_norm_with_update_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__native_batch_norm_legit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__segment_reduce_lengths_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__segment_reduce_offsets_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad__unsafe_masked_index_put_accumulate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_abs_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addbmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addcdiv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addmm_decomposed_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_addr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_alias_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_amin_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_aminmax_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_any_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_as_strided_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atleast_1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_atleast_2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bfloat16_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_block_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_bool_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_broadcast_tensors_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ceil_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ceil_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_char_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cos_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cross_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diag_embed_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diagonal_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_diff_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_div_no_rounding_mode_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_double_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_dstack_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_einsum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_empty_permuted_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_eq_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_erf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_exp2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_exponential_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fft_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_fftn_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_hfftn_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ifft2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ifft_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ihfft_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_ihfftn_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_irfftn_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_flatten_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_float_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_geometric_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_geqrf_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_gradient_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_gt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_half_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_heaviside_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_histc_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_inner_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isinf_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isnan_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_2inputs_2outputs_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_jiterator_unary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_kron_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_le_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cholesky_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eig_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_eigvalsh_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_lu_solve_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_multi_dot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_pinv_hermitian_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_solve_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_svd_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_tensorinv_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_tensorsolve_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_linalg_vecdot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log2_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_logical_xor_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mH_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mT_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_fill_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_fill_functorch_Scalar_only_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_logaddexp_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_mean_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_masked_sum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_matmul_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_max_binary_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_median_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_min_reduction_no_dim_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_minimum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_minimum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_msort_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_multinomial_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_mvlgamma_mvlgamma_p_1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nan_to_num_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanmean_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nanmedian_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_native_layer_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_new_zeros_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool1d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_adaptive_avg_pool3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_alpha_dropout_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_avg_pool2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_batch_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_no_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_padding_no_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_stride_padding_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_ctc_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_dropout3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_dropout3d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_feature_alpha_dropout_without_train_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_fractional_max_pool2d_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_fractional_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_gelu_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_grid_sample_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_group_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hardshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hardshrink_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_bicubic_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_kl_div_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_max_unpool1d_grad_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multi_head_attention_forward_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multi_margin_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_normalize_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pad_replicate_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_pixel_unshuffle_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_rrelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_threshold_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_triplet_margin_with_distance_loss_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_nonzero_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_norm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_normal_in_place_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ones_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_ormqr_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_pca_lowrank_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_pinverse_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polar_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polar_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_polygamma_polygamma_n_2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_positive_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_put_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randint_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_remainder_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_reshape_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_roll_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_round_decimals_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_scatter_reduce_sum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_select_scatter_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_short_functorch_no_channels_last_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sign_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_general_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_general_hamming_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_hamming_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_hann_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signal_windows_nuttall_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_signbit_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sinc_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sinh_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_slice_scatter_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_softmax_with_dtype_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sparse_mm_reduce_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sparse_sampled_addmm_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_j0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_j1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_bessel_y1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_chebyshev_polynomial_v_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i0e_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_i1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_laguerre_polynomial_l_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_laguerre_polynomial_l_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_legendre_polynomial_p_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_i1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_i1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_k0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_modified_bessel_k1_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_ndtri_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_polygamma_special_polygamma_n_0_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_shifted_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_shifted_chebyshev_polynomial_u_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_shifted_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_special_xlog1py_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_split_with_sizes_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_square_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_mean_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_stft_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_sum_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_t_copy_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tensordot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tile_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_torch_ops_aten__safe_softmax_default_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_transpose_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_trapz_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_triangular_solve_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_tril_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_true_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unbind_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_var_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_vdot_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_vstack_cuda_float64, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmap_autograd_grad_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___rdiv___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___rmatmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall___rsub___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addmv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_asinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_atleast_1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_broadcast_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_clamp_min_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_diagonal_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_double_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_exp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__softmax_backward_data_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule__unsafe_masked_index_put_accumulate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_addcdiv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bfloat16_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bool_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_bucketize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_conj_physical_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_double_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_hfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_hfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_ifft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_ihfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_hash_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_int_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_le_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_slogdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_log2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_masked_var_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_mvlgamma_mvlgamma_p_5_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_binary_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_embedding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_grid_sample_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_hardtanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_hinge_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_kl_div_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_multi_head_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_pinverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_scatter_reduce_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_blackman_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_kaiser_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_chebyshev_polynomial_u_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_modified_bessel_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_special_scaled_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_split_list_args_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_std_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unique_consecutive_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unique_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_has_batch_rule_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_jiterator_binary_return_by_ref_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lerp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_eigvalsh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_linspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_log1p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_logical_or_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_logspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_native_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_gaussian_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_interpolate_bicubic_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_leaky_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_linear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_mse_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_prelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_randn_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_resize__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_ndtri_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_tan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpall_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ZeroGradientsGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp__segment_reduce_offsets_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_all_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_asin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_atan2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_atleast_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_bfloat16_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_char_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_column_stack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_contiguous_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cov_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_digamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_div_no_rounding_mode_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_einsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_erfinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_expand_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_expand_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_eye_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_ifftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_ihfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_geqrf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_heaviside_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isneginf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_ldl_factor_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_lstsq_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logdet_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_lu_unpack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_log_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_masked_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_minimum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_mv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_narrow_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_native_dropout_backward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_new_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_adaptive_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_binary_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_channel_shuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_conv2d_strided_padding_dilation_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_cosine_similarity_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_fractional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_gelu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_hardswish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_huber_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_area_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_interpolate_trilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_layer_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pad_replicate_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_pixel_unshuffle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_relu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_rms_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_silu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_softplus_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_normal_number_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_reciprocal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_reshape_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_resolve_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scalar_tensor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_select_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_short_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_gaussian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_bessel_y0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_entr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_i1e_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_log_ndtr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_special_shifted_chebyshev_polynomial_v_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_squeeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_std_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_transpose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_trapz_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_triangular_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_var_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_view_as_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvjp_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapjvpvmap_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpyCubeNotComposableAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp___rmul___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp___rsub___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp__batch_norm_with_update_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_abs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_addr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_argsort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_bmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_bool_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cauchy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ceil_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_chalf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_combinations_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_corrcoef_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cummin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_deg2rad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_diff_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_div_floor_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_div_trunc_rounding_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_empty_permuted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_erfc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_expand_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_expm1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_fftshift_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_ihfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_rfft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fft_rfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_float_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_fmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_frac_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_gather_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_grid_sampler_2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_MulGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_NumpyCubeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_SelectAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___radd___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___rmod___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule__unsafe_masked_index_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_acosh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_addbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_addcmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_angle_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_arange_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_as_strided_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_atleast_3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_baddbmm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_byte_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cfloat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_char_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_constant_pad_nd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_count_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cummax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_diag_embed_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_diagflat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_double_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_empty_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_equal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_fft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_ifftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fft_irfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_flip_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_float_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_floor_divide_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_fmod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_geometric_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_half_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_histc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_hsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_hstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_index_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_inner_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_cholesky_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_cross_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_eigh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_householder_product_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_inv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_lu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_linspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_log2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_log_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logcumsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_logical_not_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_long_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_argmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_masked_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_matmul_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_median_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_min_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_min_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_msort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nan_to_num_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_new_empty_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_new_full_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_adaptive_max_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_avg_pool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv2d_stride_depthwise_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_conv_transpose3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_interpolate_nearest-exact_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_margin_ranking_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_max_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_nll_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_nn_functional_upsample_nearest_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_polygamma_polygamma_n_4_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_qr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_rand_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_repeat_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_reshape_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_resize_as__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_roll_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_rsqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_bartlett_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_signal_windows_hann_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sinc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_slice_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_airy_ai_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_bessel_j0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_chebyshev_polynomial_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_erfcx_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_special_xlog1py_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sqrt_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_square_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_squeeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_tensordot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_tile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_unsqueeze_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_var_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_var_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_vdot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_vsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_where_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_xlogy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_has_batch_rule_zeros_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_index_put_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_index_reduce_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isfinite_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isnan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_cholesky_ex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_cond_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_diagonal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_ldl_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_lu_factor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_lu_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_matrix_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_matrix_power_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_matrix_rank_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_norm_subgradients_at_zero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_solve_triangular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_svdvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_log_softmax_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logaddexp2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logspace_tensor_overload_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_long_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_cumsum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_logsumexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_normalize_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_masked_softmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_max_binary_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_max_reduction_no_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mvlgamma_mvlgamma_p_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nanmean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nanquantile_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nextafter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_strided_padding_dilation_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv2d_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_cosine_embedding_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_glu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_interpolate_bilinear_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_l1_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_logsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_max_unpool3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_mish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_multi_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_multilabel_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pad_circular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pairwise_distance_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_pdist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_relu6_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_selu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_softmin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nn_functional_triplet_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_nonzero_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_norm_nuc_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_normal_in_place_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ops_aten_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_ormqr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_polygamma_polygamma_n_1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_polygamma_polygamma_n_2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_polygamma_polygamma_n_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_pow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_rad2deg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_randint_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_real_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_resolve_neg_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_round_decimals_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_round_decimals_neg_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_scatter_reduce_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_scatter_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_select_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sparse_mm_reduce_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_bessel_y1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_hermite_polynomial_h_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_scaled_modified_bessel_k1_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_special_zeta_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_std_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_stft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_sum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_t_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_to_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_trace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_transpose_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unsafe_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unsafe_split_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_unsqueeze_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_view_as_complex_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_view_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ForwardHasDefaultArgsAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_H_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpySortAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_NumpyTakeAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ScaleGradGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_SortGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_T_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp___getitem___functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp___rpow___cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__native_batch_norm_legit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp__upsample_bilinear2d_aa_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_alias_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_allclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_aminmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_as_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_as_strided_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_atan_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_bernoulli_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_broadcast_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_byte_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cartesian_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cdouble_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cholesky_inverse_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_cholesky_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_chunk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_clamp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_clamp_max_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_clone_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_conj_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_copysign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_diag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_dist_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_double_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_dsplit_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_dstack_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_exponential_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_fft_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_fftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_hfftn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fft_rfft2_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fill_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_flatten_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fliplr_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_flipud_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_floor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_fmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_frexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_full_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_gradient_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_half_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_igamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_put_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_index_reduce_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_int_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isclose_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isposinf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_isreal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_jiterator_2inputs_2outputs_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_kthvalue_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ldexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_lgamma_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_eigvals_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_ldl_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_lstsq_grad_oriented_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_multi_dot_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_pinv_hermitian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_pinv_singular_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_solve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_svd_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_tensorinv_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_tensorsolve_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_vander_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linalg_vector_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_linspace_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logical_and_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_logical_xor_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_amax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_argmax_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_cumprod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_fill_functorch_Scalar_only_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_logaddexp_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_masked_mean_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_meshgrid_list_of_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_meshgrid_variadic_tensors_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_min_reduction_with_dim_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_mvlgamma_mvlgamma_p_3_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nanmedian_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nansum_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_narrow_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_native_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_new_empty_strided_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_new_ones_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_adaptive_max_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_alpha_dropout_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_avg_pool1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_avg_pool2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_batch_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_celu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_stride_groups_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_stride_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_stride_padding_no_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv2d_stride_with_bias_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv3d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv_transpose1d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_conv_transpose2d_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_cross_entropy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_ctc_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_elu_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_embedding_bag_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_embedding_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_group_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_hardsigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_instance_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_local_response_norm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_max_unpool2d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_max_unpool3d_grad_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_mish_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_mse_loss_functorch_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_pad_constant_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_pad_reflect_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_pad_replicate_negative_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_soft_margin_loss_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_softmin_with_dtype_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_softshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_softsign_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_tanhshrink_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_nn_functional_threshold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_norm_inf_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_normal_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ones_like_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_ops_aten__new_zeros_with_same_feature_meta_functorchonly_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_outer_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_pca_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_permute_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_prod_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_randn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_renorm_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_repeat_interleave_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_resize_as__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_rot90_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_round_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_add_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_scatter_reduce_amin_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_searchsorted_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sgn_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_short_functorch_no_channels_last_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sigmoid_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_signal_windows_cosine_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_signal_windows_nuttall_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sinh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_slice_scatter_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_sort_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_chebyshev_polynomial_w_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_hermite_polynomial_he_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_legendre_polynomial_p_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_modified_bessel_i0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_modified_bessel_k0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_special_polygamma_special_polygamma_n_0_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_split_with_sizes_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_split_with_sizes_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_squeeze_multiple_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_std_mean_unbiased_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_svd_lowrank_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_t_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_take_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_tanh_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_topk_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_torch_ops_aten__safe_softmax_default_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unbind_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unbind_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unfold_copy_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unfold_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_uniform_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_unique_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_view_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_zero__cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvjp_zeros_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_CubeGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_NumpyExpMarkDirtyAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_NumpyMulAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_SelectGenVmapAutogradFunction_cuda_float32, test/functorch/test_ops.py::TestOperatorsCUDA::test_vmapvjpvmap_SortGenVmapAutogradFunction_cuda_float32 2025-12-04T13:49:13.2794108Z 2025-12-04T13:49:13.2794238Z Finished functorch/test_ops 2/4 ... [2025-12-04 13:49:13.182208][4980935.678391721], took 12.47min 2025-12-04T13:49:13.2794653Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:49:13.2795033Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:49:13.2795292Z Running torch_np/numpy_tests/core/test_dlpack 1/1 ... [2025-12-04 13:49:13.201821][4980935.698007149] 2025-12-04T13:49:13.2795509Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:49:13.2795948Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/core/test_dlpack.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:49:13.202295] 2025-12-04T13:49:15.1210289Z 2025-12-04T13:49:15.1211906Z torch_np/numpy_tests/core/test_dlpack 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.core.test_dlpack_1.1_2be2c4cd5a14f741_.log 2025-12-04T13:49:15.1233078Z Running 53 items in this shard: test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dlpack_destructor_exception, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dlpack_device, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype0, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype1, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype2, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype3, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype4, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype5, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype6, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype7, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype8, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dtype_passthrough_dtype9, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dunder_dlpack_refcount, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_dunder_dlpack_stream, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_from_dlpack_refcount, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_from_torch, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_0, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_1, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_10, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_11, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_12, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_13, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_14, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_15, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_16, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_17, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_18, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_19, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_2, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_20, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_21, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_22, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_23, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_24, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_25, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_26, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_27, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_28, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_29, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_3, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_30, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_31, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_32, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_4, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_5, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_6, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_7, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_8, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_higher_dims_ndim_9, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_ndim0, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_non_contiguous, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_readonly, test/torch_np/numpy_tests/core/test_dlpack.py::TestDLPack::test_to_torch 2025-12-04T13:49:15.1253006Z 2025-12-04T13:49:15.1253453Z Finished torch_np/numpy_tests/core/test_dlpack 1/1 ... [2025-12-04 13:49:15.120646][4980937.616831313], took 0.03min 2025-12-04T13:49:15.1254818Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:49:15.1403250Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:49:15.1415508Z Running test_multiprocessing_spawn 1/1 ... [2025-12-04 13:49:15.141036][4980937.637214018] 2025-12-04T13:49:15.1416121Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:49:15.1419795Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_multiprocessing_spawn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:49:15.141626] 2025-12-04T13:51:16.7151267Z 2025-12-04T13:51:16.7152664Z test_multiprocessing_spawn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_multiprocessing_spawn_1.1_b50040e0132212f3_.log 2025-12-04T13:51:16.7157950Z Running 31 items in this shard: test/test_multiprocessing_spawn.py::SpawnTest::test_exception_all, test/test_multiprocessing_spawn.py::SpawnTest::test_exception_raises, test/test_multiprocessing_spawn.py::SpawnTest::test_exception_single, test/test_multiprocessing_spawn.py::SpawnTest::test_first_argument_index, test/test_multiprocessing_spawn.py::SpawnTest::test_signal_raises, test/test_multiprocessing_spawn.py::SpawnTest::test_success, test/test_multiprocessing_spawn.py::SpawnTest::test_success_first_then_exception, test/test_multiprocessing_spawn.py::SpawnTest::test_success_non_blocking, test/test_multiprocessing_spawn.py::SpawnTest::test_terminate_exit_grace_period0, test/test_multiprocessing_spawn.py::SpawnTest::test_terminate_exit_grace_period_20, test/test_multiprocessing_spawn.py::SpawnTest::test_terminate_signal, test/test_multiprocessing_spawn.py::ForkTest::test_exception_all, test/test_multiprocessing_spawn.py::ForkTest::test_exception_single, test/test_multiprocessing_spawn.py::ForkTest::test_first_argument_index, test/test_multiprocessing_spawn.py::ForkTest::test_success, test/test_multiprocessing_spawn.py::ForkTest::test_success_first_then_exception, test/test_multiprocessing_spawn.py::ForkTest::test_success_non_blocking, test/test_multiprocessing_spawn.py::ForkTest::test_terminate_exit_grace_period0, test/test_multiprocessing_spawn.py::ForkTest::test_terminate_exit_grace_period_20, test/test_multiprocessing_spawn.py::ForkTest::test_terminate_signal, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_exception_all, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_exception_single, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_first_argument_index, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_success, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_success_first_then_exception, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_success_non_blocking, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_terminate_exit_grace_period0, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_terminate_exit_grace_period_20, test/test_multiprocessing_spawn.py::ParallelForkServerShouldWorkTest::test_terminate_signal, test/test_multiprocessing_spawn.py::ParallelForkServerPerfTest::test_forkserver_perf, test/test_multiprocessing_spawn.py::ErrorTest::test_errors_pickleable 2025-12-04T13:51:16.7162032Z 2025-12-04T13:51:16.7162159Z Finished test_multiprocessing_spawn 1/1 ... [2025-12-04 13:51:16.714672][4981059.2108586], took 2.03min 2025-12-04T13:51:16.7186129Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:51:16.7341884Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:51:16.7346157Z Running test_mkldnn 1/1 ... [2025-12-04 13:51:16.734440][4981059.230626167] 2025-12-04T13:51:16.7346349Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:51:16.7351494Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_mkldnn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:51:16.734914] 2025-12-04T13:51:19.0180600Z 2025-12-04T13:51:19.0182427Z test_mkldnn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_mkldnn_1.1_2d38102d099d582c_.log 2025-12-04T13:51:19.0183367Z Running 0 items in this shard: 2025-12-04T13:51:19.0183632Z 2025-12-04T13:51:19.0183977Z Finished test_mkldnn 1/1 ... [2025-12-04 13:51:19.017690][4981061.513875426], took 0.04min 2025-12-04T13:51:19.0216189Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:51:19.0369678Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:51:19.0376349Z Running torch_np/numpy_tests/lib/test_index_tricks 1/1 ... [2025-12-04 13:51:19.037342][4981061.533528035] 2025-12-04T13:51:19.0377081Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:51:19.0382955Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'torch_np/numpy_tests/lib/test_index_tricks.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:51:19.037814] 2025-12-04T13:51:21.4069966Z 2025-12-04T13:51:21.4071622Z torch_np/numpy_tests/lib/test_index_tricks 1/1 was successful, full logs can be found in artifacts with path test/test-reports/torch_np.numpy_tests.lib.test_index_tricks_1.1_e634aab09da50348_.log 2025-12-04T13:51:21.4091668Z Running 47 items in this shard: test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_0d, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_basic, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_big_indices, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_clipmodes, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_dtypes, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_empty_array_ravel_mode_clip, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_empty_array_ravel_mode_raise, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_empty_array_ravel_mode_wrap, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_empty_array_unravel, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_empty_indices, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestRavelUnravelIndex::test_writeability, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_accepts_longdouble, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_accepts_npcomplexfloating, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_accepts_npfloating, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_basic, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_linspace_equivalence, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_mgrid_size_none_handling_start0_stop_10_step0_expected0, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_mgrid_size_none_handling_start_-10_stop_20_step1_expected1, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_nd, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestGrid::test_sparse, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_0d, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_1d, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_2d, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_complex_step, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_mixed_type, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestConcatenator::test_more_mixed_type, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestNdenumerate::test_basic, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIndexExpression::test_regression_1, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIndexExpression::test_simple_1, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIx_::test_1d_only, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIx_::test_bool, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIx_::test_regression_1, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIx_::test_repeated_input, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestIx_::test_shape_and_dtype, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestC::test_c_, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_basic, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_hetero_shape_handling, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_low_dim_handling, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_operate_4d_array, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_tall_matrix, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_tall_matrix_wrap, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestFillDiagonal::test_wide_matrix, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestDiagIndices::test_diag_indices, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestDiagIndicesFrom::test_diag_indices_from, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestDiagIndicesFrom::test_error_shape_mismatch, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestDiagIndicesFrom::test_error_small_input, test/torch_np/numpy_tests/lib/test_index_tricks.py::TestNdIndex::test_ndindex 2025-12-04T13:51:21.4110450Z 2025-12-04T13:51:21.4110923Z Finished torch_np/numpy_tests/lib/test_index_tricks 1/1 ... [2025-12-04 13:51:21.406635][4981063.902821255], took 0.04min 2025-12-04T13:51:21.4112283Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:51:21.4255375Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:51:21.4260434Z Running test_jit_autocast 1/1 ... [2025-12-04 13:51:21.425729][4981063.921914403] 2025-12-04T13:51:21.4261055Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:51:21.4265035Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_jit_autocast.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:51:21.426202] 2025-12-04T13:51:38.9882622Z 2025-12-04T13:51:38.9883966Z test_jit_autocast 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_jit_autocast_1.1_85913ee3ba350dda_.log 2025-12-04T13:51:38.9901531Z Running 54 items in this shard: test/test_jit_autocast.py::TestAutocast::test_autocast_api, test/test_jit_autocast.py::TestAutocast::test_autocast_api_not_supported, test/test_jit_autocast.py::TestAutocast::test_autocast_autodiff, test/test_jit_autocast.py::TestAutocast::test_autocast_decorator, test/test_jit_autocast.py::TestAutocast::test_autocast_decorator_outside_jit, test/test_jit_autocast.py::TestAutocast::test_autocast_mixed_dtypes, test/test_jit_autocast.py::TestAutocast::test_callees, test/test_jit_autocast.py::TestAutocast::test_callees_with_autocast_off, test/test_jit_autocast.py::TestAutocast::test_callees_with_autocast_on, test/test_jit_autocast.py::TestAutocast::test_conditional_autocast, test/test_jit_autocast.py::TestAutocast::test_control_flow, test/test_jit_autocast.py::TestAutocast::test_divergent_autocast, test/test_jit_autocast.py::TestAutocast::test_divergent_types, test/test_jit_autocast.py::TestAutocast::test_duplicate_inputs, test/test_jit_autocast.py::TestAutocast::test_eager_and_script, test/test_jit_autocast.py::TestAutocast::test_explicit_casts, test/test_jit_autocast.py::TestAutocast::test_fp32_policy, test/test_jit_autocast.py::TestAutocast::test_fp32_policy_with_fp64, test/test_jit_autocast.py::TestAutocast::test_fp32_set_opt_dtype_policy, test/test_jit_autocast.py::TestAutocast::test_fp32_set_opt_dtype_policy_fp64, test/test_jit_autocast.py::TestAutocast::test_ignore_amp, test/test_jit_autocast.py::TestAutocast::test_implicitly_nested_autocast, test/test_jit_autocast.py::TestAutocast::test_inplace, test/test_jit_autocast.py::TestAutocast::test_jit_autocast_softmax_cpu, test/test_jit_autocast.py::TestAutocast::test_jit_autocast_softmax_gpu, test/test_jit_autocast.py::TestAutocast::test_jit_call_method_under_autocast, test/test_jit_autocast.py::TestAutocast::test_jit_executor_under_autocast, test/test_jit_autocast.py::TestAutocast::test_jit_freeze_autocast_basic, test/test_jit_autocast.py::TestAutocast::test_jit_freeze_autocast_constants, test/test_jit_autocast.py::TestAutocast::test_jit_generic_autocast, test/test_jit_autocast.py::TestAutocast::test_linear_bf16, test/test_jit_autocast.py::TestAutocast::test_minimal, test/test_jit_autocast.py::TestAutocast::test_minimal_cpu, test/test_jit_autocast.py::TestAutocast::test_minimal_off, test/test_jit_autocast.py::TestAutocast::test_nested_autocast, test/test_jit_autocast.py::TestAutocast::test_promote_policy, test/test_jit_autocast.py::TestAutocast::test_promote_policy_fp64, test/test_jit_autocast.py::TestAutocast::test_reused_autocast, test/test_jit_autocast.py::TestAutocast::test_reused_autocast_expr, test/test_jit_autocast.py::TestAutocast::test_runtime_autocast_state, test/test_jit_autocast.py::TestAutocast::test_runtime_autocast_state_expr, test/test_jit_autocast.py::TestAutocast::test_script_and_tracing, test/test_jit_autocast.py::TestAutocast::test_script_and_tracing_with_autocast, test/test_jit_autocast.py::TestAutocast::test_script_module, test/test_jit_autocast.py::TestAutocast::test_tracing_and_script, test/test_jit_autocast.py::TestAutocast::test_tracing_with_autocast_and_script, test/test_jit_autocast.py::TestJitTraceAutocast::test_cat_promote, test/test_jit_autocast.py::TestJitTraceAutocast::test_generate_autocast_jit_trace_model, test/test_jit_autocast.py::TestJitTraceAutocast::test_nchw_autocast_jit_trace_model, test/test_jit_autocast.py::TestJitTraceAutocast::test_nhwc_autocast_jit_trace_model, test/test_jit_autocast.py::TestJitTraceAutocast::test_script_autocast_cpu, test/test_jit_autocast.py::TestJitTraceAutocast::test_script_autocast_cuda, test/test_jit_autocast.py::TestJitTraceAutocast::test_script_autocast_enable_and_check, test/test_jit_autocast.py::TestJitTraceAutocast::test_scripted_aliasing 2025-12-04T13:51:38.9917699Z 2025-12-04T13:51:38.9918045Z Finished test_jit_autocast 1/1 ... [2025-12-04 13:51:38.987901][4981081.484084473], took 0.29min 2025-12-04T13:51:38.9919783Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:51:39.0071266Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:51:39.0080134Z Running nn/test_pooling 1/1 ... [2025-12-04 13:51:39.007526][4981081.503713183] 2025-12-04T13:51:39.0080755Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:51:39.0083116Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'nn/test_pooling.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:51:39.008011] 2025-12-04T13:52:05.9358685Z 2025-12-04T13:52:05.9359786Z nn/test_pooling 1/1 was successful, full logs can be found in artifacts with path test/test-reports/nn.test_pooling_1.1_b23e5ebf72bd6df7_.log 2025-12-04T13:52:05.9423817Z Running 147 items in this shard: test/nn/test_pooling.py::TestAvgPool::test_avg_pool1d_ceil_mode, test/nn/test_pooling.py::TestAvgPool::test_avg_pool2d_ceil_mode, test/nn/test_pooling.py::TestAvgPool::test_avg_pool3d_ceil_mode, test/nn/test_pooling.py::TestAvgPool::test_doubletensor_avg_pool2d, test/nn/test_pooling.py::TestAvgPool::test_doubletensor_avg_pool2d_with_divisor, test/nn/test_pooling.py::TestAvgPool::test_doubletensor_avg_pool3d, test/nn/test_pooling.py::TestAvgPool::test_doubletensor_avg_pool3d_with_divisor, test/nn/test_pooling.py::TestPoolingNN::test_MaxUnpool2d_output_size, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_avg_pooling_nhwc_overflow, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_avg_pooling_overflow, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_avg_nhwc, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_avg_nhwc_launch_config_backward, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_avg_nhwc_launch_config_forward, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_avg_nhwc_non_contiguous, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_lower_precision, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_size_none, test/nn/test_pooling.py::TestPoolingNN::test_adaptive_pooling_size_overflow, test/nn/test_pooling.py::TestPoolingNN::test_max_unpool, test/nn/test_pooling.py::TestPoolingNN::test_max_unpool2d_nhwc_cpu, test/nn/test_pooling.py::TestPoolingNN::test_max_unpool3d_input_check, test/nn/test_pooling.py::TestPoolingNN::test_quantized_max_pool1d_empty_kernel, test/nn/test_pooling.py::TestPoolingNN::test_quantized_max_pool3d, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool1d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool1d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool1d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool1d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool2d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool2d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool2d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool2d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool3d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool3d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool3d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool3d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AdaptiveMaxPool_zero_batch_dim_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AvgPool2d_empty_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_AvgPool3d_backward_after_cat_dim1_device_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool2d_zero_batch_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool2d_zero_out_size_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool2d_zero_samples_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool3d_errors_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool3d_zero_batch_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool3d_zero_out_size_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_FractionalMaxPool3d_zero_samples_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_LPPool1d_kernel_size_overflow_large_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool1d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool1d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool1d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool1d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool2d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool2d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool2d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool2d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool3d_errors_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool3d_indices_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool3d_indices_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool3d_indices_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool3d_indices_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxPool_zero_batch_dim_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case10_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case1_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case2_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case3_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case4_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case5_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case6_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case7_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case8_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_index_errors_case9_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_invalid_output_size_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_MaxUnpool_zero_batch_dim_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_avg_pool2d_output_size_one_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_avg_pool3d_output_size_one_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_avg_pooling_backward_fails_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_max_pooling_backward_fails_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pool_odd_size_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_empty_output_size_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_empty_output_size_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_empty_output_size_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_empty_output_size_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_max_nhwc_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_max_nhwc_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_no_suppot_input_cuda_int16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_no_suppot_input_cuda_int32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_no_suppot_input_cuda_int64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_no_suppot_input_cuda_int8, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_no_suppot_input_cuda_uint8, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_zero_batch_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_adaptive_pooling_zero_batch_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_avg_pool2d_nhwc_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_avg_pool2d_nhwc_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_avg_pool2d_nhwc_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_avg_pool2d_reduced_floating_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_avg_pool2d_reduced_floating_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool2d_backward_fails_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool2d_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool3d_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool_nan_inf_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool_nan_inf_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_fractional_max_pool_nan_inf_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool1d_corner_cases_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool1d_corner_cases_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool1d_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool1d_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_corner_cases_cuda_int32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_corner_cases_cuda_int64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_indices_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_nhwc_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_nhwc_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_nhwc_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool2d_with_indices_backward_fails_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool3d_ndhwc_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool3d_ndhwc_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool3d_ndhwc_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool_bfloat16_half_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool_bfloat16_half_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool_nan_inf_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool_nan_inf_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_pool_nan_inf_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_max_unpool_invalid_indices_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_maxpool3d_non_square_backward_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_maxpool_indices_no_batch_dim_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_maxpool_indices_no_batch_dim_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_maxpool_indices_no_batch_dim_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_maxpool_indices_no_batch_dim_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool3d_large_size_int64_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool3d_size_one_feature_dim_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_invalid_size_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_invalid_size_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_invalid_size_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_invalid_size_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_large_size_cuda_bfloat16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_large_size_cuda_float16, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_large_size_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pool_large_size_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_bfloat16_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_large_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_max_nhwc_cuda_float32, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_max_nhwc_cuda_float64, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_avg_pooling_dims_1_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_avg_pooling_dims_2_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_avg_pooling_dims_3_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_max_pooling_dims_1_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_max_pooling_dims_2_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_shape_kernel_max_pooling_dims_3_cuda, test/nn/test_pooling.py::TestPoolingNNDeviceTypeCUDA::test_pooling_zero_stride_cuda 2025-12-04T13:52:05.9484389Z 2025-12-04T13:52:05.9484752Z Finished nn/test_pooling 1/1 ... [2025-12-04 13:52:05.935726][4981108.431910191], took 0.45min 2025-12-04T13:52:05.9485996Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:52:05.9556404Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:52:05.9560855Z Running lazy/test_reuse_ir 1/1 ... [2025-12-04 13:52:05.955853][4981108.452039362] 2025-12-04T13:52:05.9561428Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:52:05.9566485Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'lazy/test_reuse_ir.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:52:05.956327] 2025-12-04T13:52:09.8313206Z 2025-12-04T13:52:09.8313846Z lazy/test_reuse_ir 1/1 was successful, full logs can be found in artifacts with path test/test-reports/lazy.test_reuse_ir_1.1_01341d1169df7df0_.log 2025-12-04T13:52:09.8314509Z Running 4 items in this shard: test/lazy/test_reuse_ir.py::TestLazyReuseIr::testAdd, test/lazy/test_reuse_ir.py::TestLazyReuseIr::testAddSub, test/lazy/test_reuse_ir.py::TestLazyReuseIr::testAddSubFallback, test/lazy/test_reuse_ir.py::TestLazyReuseIr::testBatchNorm 2025-12-04T13:52:09.8314919Z 2025-12-04T13:52:09.8315027Z Finished lazy/test_reuse_ir 1/1 ... [2025-12-04 13:52:09.831062][4981112.327248434], took 0.06min 2025-12-04T13:52:09.8352361Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:52:09.8508842Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:52:09.8513359Z Running test_functional_autograd_benchmark 1/1 ... [2025-12-04 13:52:09.851014][4981112.347200808] 2025-12-04T13:52:09.8514060Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:52:09.8516898Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_functional_autograd_benchmark.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:52:09.851470] 2025-12-04T13:52:49.8386827Z 2025-12-04T13:52:49.8388606Z test_functional_autograd_benchmark 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_functional_autograd_benchmark_1.1_0b6a5781bce8d73a_.log 2025-12-04T13:52:49.8390557Z Running 2 items in this shard: test/test_functional_autograd_benchmark.py::TestFunctionalAutogradBenchmark::test_fast_tasks, test/test_functional_autograd_benchmark.py::TestFunctionalAutogradBenchmark::test_slow_tasks 2025-12-04T13:52:49.8391625Z 2025-12-04T13:52:49.8392060Z Finished test_functional_autograd_benchmark 1/1 ... [2025-12-04 13:52:49.838392][4981152.334577122], took 0.67min 2025-12-04T13:52:49.8424911Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:52:49.8579803Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:52:49.8585985Z Running cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility 1/1 ... [2025-12-04 13:52:49.858278][4981152.354464208] 2025-12-04T13:52:49.8586884Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:52:49.8592428Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:52:49.858750] 2025-12-04T13:53:03.1468425Z 2025-12-04T13:53:03.1470268Z cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp_extensions.libtorch_agnostic_2_10_extension.test_version_compatibility_1.1_88307c35fcc25eb9_.log 2025-12-04T13:53:03.1489260Z Running 24 items in this shard: test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_get_any_data_ptr_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_get_template_any_data_ptr_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_make_tensor_clones_and_call_foreach_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_mv_tensor_accessor_cpu_hip_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_mv_tensor_accessor_cpu_works_with_2_9, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_mv_tensor_accessor_cuda_works_with_2_9, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my__foreach_mul__requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my__foreach_mul_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my_empty_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my_reshape_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my_shape_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my_string_op_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_my_view_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_cublas_handle_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_cuda_stream_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_constructor_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_equality_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_index_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_is_cpu_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_is_cuda_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_device_set_index_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_get_num_threads_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_parallel_for_requires_2_10, test/cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility.py::FunctionVersionCompatibilityTest::test_test_tensor_device_requires_2_10 2025-12-04T13:53:03.1506718Z 2025-12-04T13:53:03.1507345Z Finished cpp_extensions/libtorch_agnostic_2_10_extension/test_version_compatibility 1/1 ... [2025-12-04 13:53:03.146432][4981165.642615991], took 0.22min 2025-12-04T13:53:03.1508918Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:53:03.1660502Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:53:03.1667204Z Running test_autoload_disable 1/1 ... [2025-12-04 13:53:03.166379][4981165.662565356] 2025-12-04T13:53:03.3210396Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions 2025-12-04T13:53:05.5885204Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:53:05.5916418Z [?25hBuilding wheels for collected packages: torch_test_cpp_extension 2025-12-04T13:55:26.0331385Z Building wheel for torch_test_cpp_extension (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / - done 2025-12-04T13:55:26.0544753Z [?25h Created wheel for torch_test_cpp_extension: filename=torch_test_cpp_extension-0.0.0-cp310-cp310-linux_x86_64.whl size=13062818 sha256=886fa300bb71ce0971d4c5c853e9c50833e0ec675aa041e1b106b3e1a9388a8c 2025-12-04T13:55:26.0546396Z Stored in directory: /tmp/pip-ephem-wheel-cache-8pdeewj6/wheels/a9/2e/d7/a9e103243c0b754e2324c4ee6ddd055c388a2eefc520cfc979 2025-12-04T13:55:26.0577032Z Successfully built torch_test_cpp_extension 2025-12-04T13:55:26.2243094Z Installing collected packages: torch_test_cpp_extension 2025-12-04T13:55:26.3280721Z Successfully installed torch_test_cpp_extension-0.0.0 2025-12-04T13:55:27.7163439Z 2025-12-04T13:55:27.7163946Z Running tests... 2025-12-04T13:55:27.7164474Z ---------------------------------------------------------------------- 2025-12-04T13:55:27.8803584Z . 2025-12-04T13:55:27.8804032Z ---------------------------------------------------------------------- 2025-12-04T13:55:27.8804534Z Ran 1 test in 0.164s 2025-12-04T13:55:27.8804759Z 2025-12-04T13:55:27.8804875Z OK 2025-12-04T13:55:27.8805028Z 2025-12-04T13:55:27.8805189Z Generating XML reports... 2025-12-04T13:55:28.3899680Z Finished test_autoload_disable 1/1 ... [2025-12-04 13:55:28.389538][4981310.885703263], took 2.42min 2025-12-04T13:55:28.3940568Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:55:28.4092315Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:55:28.4093102Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2025-12-04T13:55:28.4093707Z Uploading artifacts took 0.00 seconds 2025-12-04T13:55:28.4101366Z Running test_cpp_extensions_aot_ninja 1/1 ... [2025-12-04 13:55:28.409589][4981310.905775597] 2025-12-04T13:55:28.5921533Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions 2025-12-04T13:55:30.8472703Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:55:30.8499125Z [?25hBuilding wheels for collected packages: torch_test_cpp_extension 2025-12-04T13:57:20.3633729Z Building wheel for torch_test_cpp_extension (pyproject.toml) ... [?25l- \ | / - \ | / - \ | / - \ | / done 2025-12-04T13:57:20.3847655Z [?25h Created wheel for torch_test_cpp_extension: filename=torch_test_cpp_extension-0.0.0-cp310-cp310-linux_x86_64.whl size=13063082 sha256=6caa620b24e0de28532e426117ba328a174d378fc965680a89a3048bc52be668 2025-12-04T13:57:20.3849604Z Stored in directory: /tmp/pip-ephem-wheel-cache-mhh_dmwk/wheels/a9/2e/d7/a9e103243c0b754e2324c4ee6ddd055c388a2eefc520cfc979 2025-12-04T13:57:20.3879422Z Successfully built torch_test_cpp_extension 2025-12-04T13:57:20.5563890Z Installing collected packages: torch_test_cpp_extension 2025-12-04T13:57:20.6592515Z Successfully installed torch_test_cpp_extension-0.0.0 2025-12-04T13:57:20.8550722Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/no_python_abi_suffix_test 2025-12-04T13:57:21.8072023Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:57:21.8091369Z [?25hBuilding wheels for collected packages: no_python_abi_suffix_test 2025-12-04T13:57:22.9407640Z Building wheel for no_python_abi_suffix_test (pyproject.toml) ... [?25l- \ done 2025-12-04T13:57:22.9418170Z [?25h Created wheel for no_python_abi_suffix_test: filename=no_python_abi_suffix_test-0.0.0-cp310-cp310-linux_x86_64.whl size=2944 sha256=62ba92e54c020ea10513225d4f71bd56e4a8c348ea16075b8f15d4ded1e65d9b 2025-12-04T13:57:22.9419583Z Stored in directory: /tmp/pip-ephem-wheel-cache-mvgrdwe7/wheels/01/96/31/d3c48c51cc163420d8b3b57e95a07fda055add3ed0ea48001b 2025-12-04T13:57:22.9448323Z Successfully built no_python_abi_suffix_test 2025-12-04T13:57:23.1250438Z Installing collected packages: no_python_abi_suffix_test 2025-12-04T13:57:23.1272678Z Successfully installed no_python_abi_suffix_test-0.0.0 2025-12-04T13:57:23.1800614Z * Getting build dependencies for wheel... 2025-12-04T13:57:24.1668915Z /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu -> /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu [skipped, no changes] 2025-12-04T13:57:24.1670681Z Successfully preprocessed all matching files. 2025-12-04T13:57:24.1671245Z Total number of unsupported CUDA function calls: 0 2025-12-04T13:57:24.1671583Z 2025-12-04T13:57:24.1671603Z 2025-12-04T13:57:24.1671784Z Total number of replaced kernel launches: 0 2025-12-04T13:57:24.1800677Z running egg_info 2025-12-04T13:57:24.1828578Z creating python_agnostic.egg-info 2025-12-04T13:57:24.1832586Z writing python_agnostic.egg-info/PKG-INFO 2025-12-04T13:57:24.1833267Z writing dependency_links to python_agnostic.egg-info/dependency_links.txt 2025-12-04T13:57:24.1833974Z writing top-level names to python_agnostic.egg-info/top_level.txt 2025-12-04T13:57:24.1836911Z writing manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:24.2071374Z reading manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:24.2076463Z writing manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:24.5577957Z * Building wheel... 2025-12-04T13:57:25.5282830Z /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu -> /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu [skipped, no changes] 2025-12-04T13:57:25.5284818Z Successfully preprocessed all matching files. 2025-12-04T13:57:25.5285379Z Total number of unsupported CUDA function calls: 0 2025-12-04T13:57:25.5285715Z 2025-12-04T13:57:25.5285724Z 2025-12-04T13:57:25.5285910Z Total number of replaced kernel launches: 0 2025-12-04T13:57:25.5383961Z running bdist_wheel 2025-12-04T13:57:25.5654557Z running build 2025-12-04T13:57:25.5654923Z running build_ext 2025-12-04T13:57:25.5672396Z building 'python_agnostic._C' extension 2025-12-04T13:57:25.5673433Z creating /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/build/temp.linux-x86_64-cpython-310/python_agnostic/csrc 2025-12-04T13:57:37.2039480Z [1/1] /opt/rocm/bin/hipcc -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/torch/csrc/api/include -I/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/include/THH -I/opt/rocm/include -I/opt/conda/envs/py_3.10/include/python3.10 -c -c /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu -o /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/build/temp.linux-x86_64-cpython-310/python_agnostic/csrc/ultra_norm.o -D__HIP_PLATFORM_AMD__=1 -DUSE_ROCM=1 -DHIPBLAS_V2 -fPIC -DCUDA_HAS_FP16=1 -D__HIP_NO_HALF_OPERATORS__=1 -D__HIP_NO_HALF_CONVERSIONS__=1 -DHIP_ENABLE_WARP_SYNC_BUILTINS=1 -DTORCH_API_INCLUDE_EXTENSION_H -DPy_LIMITED_API=0x030A0000 -DTORCH_EXTENSION_NAME=_C --offload-arch=gfx90a --offload-arch=gfx942 --offload-arch=gfx950 --offload-arch=gfx1100 -fno-gpu-rdc -std=c++17 2025-12-04T13:57:37.2073927Z creating build/lib.linux-x86_64-cpython-310/python_agnostic 2025-12-04T13:57:37.2078637Z g++ -pthread -B /opt/conda/envs/py_3.10/compiler_compat -Wno-unused-result -Wsign-compare -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /opt/conda/envs/py_3.10/include -fPIC -O2 -isystem /opt/conda/envs/py_3.10/include -pthread -B /opt/conda/envs/py_3.10/compiler_compat -shared /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/build/temp.linux-x86_64-cpython-310/python_agnostic/csrc/ultra_norm.o -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/rocm/lib -L/opt/rocm/hip/lib -lc10 -ltorch -ltorch_cpu -lamdhip64 -lc10_hip -ltorch_hip -o build/lib.linux-x86_64-cpython-310/python_agnostic/_C.so 2025-12-04T13:57:37.3947534Z installing to build/bdist.linux-x86_64/wheel 2025-12-04T13:57:37.3947983Z running install 2025-12-04T13:57:37.3967197Z running install_lib 2025-12-04T13:57:37.3988987Z creating build/bdist.linux-x86_64/wheel 2025-12-04T13:57:37.3989504Z creating build/bdist.linux-x86_64/wheel/python_agnostic 2025-12-04T13:57:37.3990375Z copying build/lib.linux-x86_64-cpython-310/python_agnostic/_C.so -> build/bdist.linux-x86_64/wheel/./python_agnostic 2025-12-04T13:57:37.3991459Z running install_egg_info 2025-12-04T13:57:37.4013936Z running egg_info 2025-12-04T13:57:37.4032724Z writing python_agnostic.egg-info/PKG-INFO 2025-12-04T13:57:37.4035398Z writing dependency_links to python_agnostic.egg-info/dependency_links.txt 2025-12-04T13:57:37.4036996Z writing top-level names to python_agnostic.egg-info/top_level.txt 2025-12-04T13:57:37.4060206Z reading manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:37.4065455Z writing manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:37.4066249Z Copying python_agnostic.egg-info to build/bdist.linux-x86_64/wheel/./python_agnostic-0.0-py3.10.egg-info 2025-12-04T13:57:37.4067860Z running install_scripts 2025-12-04T13:57:37.4115449Z creating build/bdist.linux-x86_64/wheel/python_agnostic-0.0.dist-info/WHEEL 2025-12-04T13:57:37.4117394Z creating '/var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/dist/.tmp-jvozxcve/python_agnostic-0.0-cp39-abi3-linux_x86_64.whl' and adding 'build/bdist.linux-x86_64/wheel' to it 2025-12-04T13:57:37.4131729Z adding 'python_agnostic/_C.so' 2025-12-04T13:57:37.4133530Z adding 'python_agnostic-0.0.dist-info/METADATA' 2025-12-04T13:57:37.4134029Z adding 'python_agnostic-0.0.dist-info/WHEEL' 2025-12-04T13:57:37.4134533Z adding 'python_agnostic-0.0.dist-info/top_level.txt' 2025-12-04T13:57:37.4135053Z adding 'python_agnostic-0.0.dist-info/RECORD' 2025-12-04T13:57:37.4135500Z removing build/bdist.linux-x86_64/wheel 2025-12-04T13:57:37.8579033Z Successfully built python_agnostic-0.0-cp39-abi3-linux_x86_64.whl 2025-12-04T13:57:38.0245797Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/libtorch_agnostic_2_9_extension 2025-12-04T13:57:39.4491367Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:57:39.4525814Z [?25hRequirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from libtorch_agnostic_2_9==0.0) (2.10.0a0+gitffd9b0f) 2025-12-04T13:57:39.4565484Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (3.18.0) 2025-12-04T13:57:39.4569084Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (4.12.2) 2025-12-04T13:57:39.4573914Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (1.13.3) 2025-12-04T13:57:39.4580527Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (2.8.8) 2025-12-04T13:57:39.4583780Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (3.1.6) 2025-12-04T13:57:39.4589623Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (2025.10.0) 2025-12-04T13:57:39.5026651Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch->libtorch_agnostic_2_9==0.0) (1.3.0) 2025-12-04T13:57:39.5058675Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->libtorch_agnostic_2_9==0.0) (3.0.3) 2025-12-04T13:57:39.5065750Z Building wheels for collected packages: libtorch_agnostic_2_9 2025-12-04T13:57:41.0896844Z Building wheel for libtorch_agnostic_2_9 (pyproject.toml) ... [?25l- \ done 2025-12-04T13:57:41.0904387Z [?25h Created wheel for libtorch_agnostic_2_9: filename=libtorch_agnostic_2_9-0.0-cp39-abi3-linux_x86_64.whl size=57687 sha256=22591bad7e199d71e0533eab110e257da2c422be221ad12ec53ef6a536056ecd 2025-12-04T13:57:41.0905919Z Stored in directory: /tmp/pip-ephem-wheel-cache-23tdlwji/wheels/5f/cf/13/81cd7305efc425780825ac0d330cbfca8f1ec0a935d4028550 2025-12-04T13:57:41.0925886Z Successfully built libtorch_agnostic_2_9 2025-12-04T13:57:41.2507589Z Installing collected packages: libtorch_agnostic_2_9 2025-12-04T13:57:41.2540745Z Successfully installed libtorch_agnostic_2_9-0.0 2025-12-04T13:57:41.4397548Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/libtorch_agnostic_2_10_extension 2025-12-04T13:57:42.7996045Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:57:42.8019287Z [?25hRequirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from libtorch_agnostic_2_10==0.0) (2.10.0a0+gitffd9b0f) 2025-12-04T13:57:42.8056968Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (3.18.0) 2025-12-04T13:57:42.8063467Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (4.12.2) 2025-12-04T13:57:42.8069010Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (1.13.3) 2025-12-04T13:57:42.8074815Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (2.8.8) 2025-12-04T13:57:42.8079073Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (3.1.6) 2025-12-04T13:57:42.8084834Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (2025.10.0) 2025-12-04T13:57:42.8465309Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch->libtorch_agnostic_2_10==0.0) (1.3.0) 2025-12-04T13:57:42.8490450Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->libtorch_agnostic_2_10==0.0) (3.0.3) 2025-12-04T13:57:42.8494221Z Building wheels for collected packages: libtorch_agnostic_2_10 2025-12-04T13:57:44.5122511Z Building wheel for libtorch_agnostic_2_10 (pyproject.toml) ... [?25l- \ done 2025-12-04T13:57:44.5135402Z [?25h Created wheel for libtorch_agnostic_2_10: filename=libtorch_agnostic_2_10-0.0-cp39-abi3-linux_x86_64.whl size=85717 sha256=e519f2606a0a1c5eb6828b4a9c65346fd0301bce7bfd819ffa7c033d0743b20f 2025-12-04T13:57:44.5136956Z Stored in directory: /tmp/pip-ephem-wheel-cache-d6a5hkw5/wheels/65/df/cc/92bed30159c7744af4875e08c646cf811fe190b78052f3234e 2025-12-04T13:57:44.5168821Z Successfully built libtorch_agnostic_2_10 2025-12-04T13:57:44.6747674Z Installing collected packages: libtorch_agnostic_2_10 2025-12-04T13:57:44.6784535Z Successfully installed libtorch_agnostic_2_10-0.0 2025-12-04T13:57:44.7064604Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:57:44.7071775Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_cpp_extensions_aot_ninja.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:57:44.706698] 2025-12-04T13:57:47.1435739Z 2025-12-04T13:57:47.1437126Z test_cpp_extensions_aot_ninja 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_cpp_extensions_aot_ninja_1.1_262f4b8b8cf43a7e_.log 2025-12-04T13:57:47.1445810Z Running 21 items in this shard: test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_backward, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_cublas_extension, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_cuda_dlink_libs, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_cuda_extension, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_cusolver_extension, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_extension_function, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_extension_module, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_mps_extension, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_no_python_abi_suffix_sets_the_correct_library_name, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_optional, test/test_cpp_extensions_aot_ninja.py::TestCppExtensionAOT::test_sycl_extension, test/test_cpp_extensions_aot_ninja.py::TestPybindTypeCasters::test_pybind_return_types, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_add, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_autocast_apis_for_maia_device, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_conv_backend_override, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_matmul_autocast_default_precision, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_matmul_autocast_float16_precision, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_unregistered, test/test_cpp_extensions_aot_ninja.py::TestMAIATensor::test_zeros, test/test_cpp_extensions_aot_ninja.py::TestRNGExtension::test_rng, test/test_cpp_extensions_aot_ninja.py::TestTorchLibrary::test_torch_library 2025-12-04T13:57:47.1453719Z 2025-12-04T13:57:47.1454152Z Finished test_cpp_extensions_aot_ninja 1/1 ... [2025-12-04 13:57:47.143401][4981449.63958445], took 2.31min 2025-12-04T13:57:47.1478555Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:57:47.1630113Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:57:47.1637219Z Running test_cpp_extensions_aot_no_ninja 1/1 ... [2025-12-04 13:57:47.163331][4981449.659517967] 2025-12-04T13:57:47.3290228Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions 2025-12-04T13:57:49.5379468Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:57:49.5410306Z [?25hBuilding wheels for collected packages: torch_test_cpp_extension 2025-12-04T13:57:54.3277179Z Building wheel for torch_test_cpp_extension (pyproject.toml) ... [?25l- \ | / - \ | / - \ | done 2025-12-04T13:57:54.3486382Z [?25h Created wheel for torch_test_cpp_extension: filename=torch_test_cpp_extension-0.0.0-cp310-cp310-linux_x86_64.whl size=13062818 sha256=5dc9417bbacd5974cf3f7391c5a2d0fff6cd5d906840b8017969142754fb76bf 2025-12-04T13:57:54.3488509Z Stored in directory: /tmp/pip-ephem-wheel-cache-qkjqafvx/wheels/a9/2e/d7/a9e103243c0b754e2324c4ee6ddd055c388a2eefc520cfc979 2025-12-04T13:57:54.3517779Z Successfully built torch_test_cpp_extension 2025-12-04T13:57:54.5183790Z Installing collected packages: torch_test_cpp_extension 2025-12-04T13:57:54.6207114Z Successfully installed torch_test_cpp_extension-0.0.0 2025-12-04T13:57:54.8109922Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/no_python_abi_suffix_test 2025-12-04T13:57:55.8594871Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:57:55.8620830Z [?25hBuilding wheels for collected packages: no_python_abi_suffix_test 2025-12-04T13:57:56.9811384Z Building wheel for no_python_abi_suffix_test (pyproject.toml) ... [?25l- \ done 2025-12-04T13:57:56.9820482Z [?25h Created wheel for no_python_abi_suffix_test: filename=no_python_abi_suffix_test-0.0.0-cp310-cp310-linux_x86_64.whl size=2944 sha256=ff40946f40e81989ca3cabee1386d708e5bee0b085bf5d6161e46310bd181ed6 2025-12-04T13:57:56.9822075Z Stored in directory: /tmp/pip-ephem-wheel-cache-hlvuacba/wheels/01/96/31/d3c48c51cc163420d8b3b57e95a07fda055add3ed0ea48001b 2025-12-04T13:57:56.9852476Z Successfully built no_python_abi_suffix_test 2025-12-04T13:57:57.1620859Z Installing collected packages: no_python_abi_suffix_test 2025-12-04T13:57:57.1640927Z Successfully installed no_python_abi_suffix_test-0.0.0 2025-12-04T13:57:57.2171019Z * Getting build dependencies for wheel... 2025-12-04T13:57:58.1857766Z /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu -> /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu [skipped, no changes] 2025-12-04T13:57:58.1859973Z Successfully preprocessed all matching files. 2025-12-04T13:57:58.1860536Z Total number of unsupported CUDA function calls: 0 2025-12-04T13:57:58.1860863Z 2025-12-04T13:57:58.1860873Z 2025-12-04T13:57:58.1861050Z Total number of replaced kernel launches: 0 2025-12-04T13:57:58.1990909Z running egg_info 2025-12-04T13:57:58.2018422Z writing python_agnostic.egg-info/PKG-INFO 2025-12-04T13:57:58.2019581Z writing dependency_links to python_agnostic.egg-info/dependency_links.txt 2025-12-04T13:57:58.2022491Z writing top-level names to python_agnostic.egg-info/top_level.txt 2025-12-04T13:57:58.2268347Z reading manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:58.2275855Z writing manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:57:58.7506321Z * Building wheel... 2025-12-04T13:57:59.7262733Z /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu -> /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/python_agnostic/csrc/ultra_norm.cu [skipped, no changes] 2025-12-04T13:57:59.7264905Z Successfully preprocessed all matching files. 2025-12-04T13:57:59.7265478Z Total number of unsupported CUDA function calls: 0 2025-12-04T13:57:59.7265822Z 2025-12-04T13:57:59.7265829Z 2025-12-04T13:57:59.7266016Z Total number of replaced kernel launches: 0 2025-12-04T13:57:59.7361799Z running bdist_wheel 2025-12-04T13:57:59.7631186Z running build 2025-12-04T13:57:59.7631543Z running build_ext 2025-12-04T13:57:59.7649974Z building 'python_agnostic._C' extension 2025-12-04T13:57:59.7989145Z ninja: no work to do. 2025-12-04T13:57:59.8015881Z g++ -pthread -B /opt/conda/envs/py_3.10/compiler_compat -Wno-unused-result -Wsign-compare -DNDEBUG -fwrapv -O2 -Wall -fPIC -O2 -isystem /opt/conda/envs/py_3.10/include -fPIC -O2 -isystem /opt/conda/envs/py_3.10/include -pthread -B /opt/conda/envs/py_3.10/compiler_compat -shared /var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/build/temp.linux-x86_64-cpython-310/python_agnostic/csrc/ultra_norm.o -L/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib -L/opt/rocm/lib -L/opt/rocm/hip/lib -lc10 -ltorch -ltorch_cpu -lamdhip64 -lc10_hip -ltorch_hip -o build/lib.linux-x86_64-cpython-310/python_agnostic/_C.so 2025-12-04T13:57:59.9978580Z installing to build/bdist.linux-x86_64/wheel 2025-12-04T13:57:59.9979124Z running install 2025-12-04T13:57:59.9999187Z running install_lib 2025-12-04T13:58:00.0020461Z creating build/bdist.linux-x86_64/wheel 2025-12-04T13:58:00.0021005Z creating build/bdist.linux-x86_64/wheel/python_agnostic 2025-12-04T13:58:00.0021903Z copying build/lib.linux-x86_64-cpython-310/python_agnostic/_C.so -> build/bdist.linux-x86_64/wheel/./python_agnostic 2025-12-04T13:58:00.0022810Z running install_egg_info 2025-12-04T13:58:00.0044590Z running egg_info 2025-12-04T13:58:00.0062711Z writing python_agnostic.egg-info/PKG-INFO 2025-12-04T13:58:00.0065079Z writing dependency_links to python_agnostic.egg-info/dependency_links.txt 2025-12-04T13:58:00.0066627Z writing top-level names to python_agnostic.egg-info/top_level.txt 2025-12-04T13:58:00.0089510Z reading manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:58:00.0093990Z writing manifest file 'python_agnostic.egg-info/SOURCES.txt' 2025-12-04T13:58:00.0095969Z Copying python_agnostic.egg-info to build/bdist.linux-x86_64/wheel/./python_agnostic-0.0-py3.10.egg-info 2025-12-04T13:58:00.0096909Z running install_scripts 2025-12-04T13:58:00.0143073Z creating build/bdist.linux-x86_64/wheel/python_agnostic-0.0.dist-info/WHEEL 2025-12-04T13:58:00.0145408Z creating '/var/lib/jenkins/pytorch/test/cpp_extensions/python_agnostic_extension/dist/.tmp-wob1jrbd/python_agnostic-0.0-cp39-abi3-linux_x86_64.whl' and adding 'build/bdist.linux-x86_64/wheel' to it 2025-12-04T13:58:00.0161191Z adding 'python_agnostic/_C.so' 2025-12-04T13:58:00.0162778Z adding 'python_agnostic-0.0.dist-info/METADATA' 2025-12-04T13:58:00.0163281Z adding 'python_agnostic-0.0.dist-info/WHEEL' 2025-12-04T13:58:00.0163801Z adding 'python_agnostic-0.0.dist-info/top_level.txt' 2025-12-04T13:58:00.0164309Z adding 'python_agnostic-0.0.dist-info/RECORD' 2025-12-04T13:58:00.0164759Z removing build/bdist.linux-x86_64/wheel 2025-12-04T13:58:00.3578632Z Successfully built python_agnostic-0.0-cp39-abi3-linux_x86_64.whl 2025-12-04T13:58:00.5199478Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/libtorch_agnostic_2_9_extension 2025-12-04T13:58:01.8480543Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:58:01.8504327Z [?25hRequirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from libtorch_agnostic_2_9==0.0) (2.10.0a0+gitffd9b0f) 2025-12-04T13:58:01.8528974Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (3.18.0) 2025-12-04T13:58:01.8533163Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (4.12.2) 2025-12-04T13:58:01.8536194Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (1.13.3) 2025-12-04T13:58:01.8539889Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (2.8.8) 2025-12-04T13:58:01.8542597Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (3.1.6) 2025-12-04T13:58:01.8546151Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_9==0.0) (2025.10.0) 2025-12-04T13:58:01.8883610Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch->libtorch_agnostic_2_9==0.0) (1.3.0) 2025-12-04T13:58:01.8908997Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->libtorch_agnostic_2_9==0.0) (3.0.3) 2025-12-04T13:58:01.8913366Z Building wheels for collected packages: libtorch_agnostic_2_9 2025-12-04T13:58:03.5320919Z Building wheel for libtorch_agnostic_2_9 (pyproject.toml) ... [?25l- \ done 2025-12-04T13:58:03.5331870Z [?25h Created wheel for libtorch_agnostic_2_9: filename=libtorch_agnostic_2_9-0.0-cp39-abi3-linux_x86_64.whl size=57687 sha256=9c703c5090c39301515a3f14d8be20613ccc90762fc1842621f3d6add9548250 2025-12-04T13:58:03.5333397Z Stored in directory: /tmp/pip-ephem-wheel-cache-23j5wu2z/wheels/5f/cf/13/81cd7305efc425780825ac0d330cbfca8f1ec0a935d4028550 2025-12-04T13:58:03.5365652Z Successfully built libtorch_agnostic_2_9 2025-12-04T13:58:03.6960505Z Installing collected packages: libtorch_agnostic_2_9 2025-12-04T13:58:03.6994706Z Successfully installed libtorch_agnostic_2_9-0.0 2025-12-04T13:58:03.8799201Z Processing /var/lib/jenkins/pytorch/test/cpp_extensions/libtorch_agnostic_2_10_extension 2025-12-04T13:58:05.2398351Z Preparing metadata (pyproject.toml) ... [?25l- done 2025-12-04T13:58:05.2412768Z [?25hRequirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from libtorch_agnostic_2_10==0.0) (2.10.0a0+gitffd9b0f) 2025-12-04T13:58:05.2424893Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (3.18.0) 2025-12-04T13:58:05.2427763Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (4.12.2) 2025-12-04T13:58:05.2429244Z Requirement already satisfied: sympy>=1.13.3 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (1.13.3) 2025-12-04T13:58:05.2430794Z Requirement already satisfied: networkx>=2.5.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (2.8.8) 2025-12-04T13:58:05.2432140Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (3.1.6) 2025-12-04T13:58:05.2433503Z Requirement already satisfied: fsspec>=0.8.5 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->libtorch_agnostic_2_10==0.0) (2025.10.0) 2025-12-04T13:58:05.2597477Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy>=1.13.3->torch->libtorch_agnostic_2_10==0.0) (1.3.0) 2025-12-04T13:58:05.2621873Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->libtorch_agnostic_2_10==0.0) (3.0.3) 2025-12-04T13:58:05.2626412Z Building wheels for collected packages: libtorch_agnostic_2_10 2025-12-04T13:58:07.1289796Z Building wheel for libtorch_agnostic_2_10 (pyproject.toml) ... [?25l- \ done 2025-12-04T13:58:07.1305337Z [?25h Created wheel for libtorch_agnostic_2_10: filename=libtorch_agnostic_2_10-0.0-cp39-abi3-linux_x86_64.whl size=85717 sha256=113a9a56137c4c4a7ae6936eb0f7255c252074d0a8cdeceb7e66e0065ee53a3f 2025-12-04T13:58:07.1306879Z Stored in directory: /tmp/pip-ephem-wheel-cache-_l2ee35m/wheels/65/df/cc/92bed30159c7744af4875e08c646cf811fe190b78052f3234e 2025-12-04T13:58:07.1340538Z Successfully built libtorch_agnostic_2_10 2025-12-04T13:58:07.2946629Z Installing collected packages: libtorch_agnostic_2_10 2025-12-04T13:58:07.2986919Z Successfully installed libtorch_agnostic_2_10-0.0 2025-12-04T13:58:07.3322236Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2025-12-04T13:58:07.3329210Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_cpp_extensions_aot_no_ninja.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2025-12-04 13:58:07.332412] 2025-12-04T13:58:09.7512968Z 2025-12-04T13:58:09.7514246Z test_cpp_extensions_aot_no_ninja 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_cpp_extensions_aot_no_ninja_1.1_b3a88d80afad44b9_.log 2025-12-04T13:58:09.7522982Z Running 21 items in this shard: test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_backward, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_cublas_extension, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_cuda_dlink_libs, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_cuda_extension, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_cusolver_extension, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_extension_function, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_extension_module, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_mps_extension, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_no_python_abi_suffix_sets_the_correct_library_name, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_optional, test/test_cpp_extensions_aot_no_ninja.py::TestCppExtensionAOT::test_sycl_extension, test/test_cpp_extensions_aot_no_ninja.py::TestPybindTypeCasters::test_pybind_return_types, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_add, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_autocast_apis_for_maia_device, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_conv_backend_override, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_matmul_autocast_default_precision, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_matmul_autocast_float16_precision, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_unregistered, test/test_cpp_extensions_aot_no_ninja.py::TestMAIATensor::test_zeros, test/test_cpp_extensions_aot_no_ninja.py::TestRNGExtension::test_rng, test/test_cpp_extensions_aot_no_ninja.py::TestTorchLibrary::test_torch_library 2025-12-04T13:58:09.7531530Z 2025-12-04T13:58:09.7531941Z Finished test_cpp_extensions_aot_no_ninja 1/1 ... [2025-12-04 13:58:09.751024][4981472.247211466], took 0.38min 2025-12-04T13:58:09.7553179Z Parsing testcases for test report: /var/lib/jenkins/pytorch/test/test-reports/python-pytest/inductor.test_aot_inductor/inductor.test_aot_inductor-fc937ce9b79649b3.xml 2025-12-04T13:58:09.7704139Z Failed to parse and upload json test reports: Unable to locate credentials 2025-12-04T13:58:11.8242100Z Running test batch 'tests to run' cost 15963.87 seconds 2025-12-04T13:58:11.8253146Z Emitting td_test_failure_stats_v2 2025-12-04T13:58:11.8256982Z Writing 1 documents to S3 ossci-raw-job-status/ossci_uploaded_metrics/td_test_failure_stats_v2_1764856691_45764f54d11911f0bb2e16e97310a121 2025-12-04T13:58:13.8548444Z /var/lib/jenkins/pytorch/tools/stats/upload_metrics.py:156: UserWarning: Error uploading metric td_test_failure_stats_v2 to DynamoDB: Unable to locate credentials 2025-12-04T13:58:13.8549625Z warn(f"Error uploading metric {metric_name} to DynamoDB: {e}") 2025-12-04T13:58:13.8550148Z test_dataloader 1/1 failed! 2025-12-04T13:58:14.4891135Z 2025-12-04T13:58:14.4891650Z real 266m9.619s 2025-12-04T13:58:14.4892091Z user 1004m1.783s 2025-12-04T13:58:14.4892400Z sys 65m25.175s 2025-12-04T13:58:14.4892805Z + sccache_epilogue 2025-12-04T13:58:14.4893224Z + echo '::group::Sccache Compilation Log' 2025-12-04T13:58:14.4894252Z ##[group]Sccache Compilation Log 2025-12-04T13:58:14.4894725Z + echo '=================== sccache compilation log ===================' 2025-12-04T13:58:14.4895277Z =================== sccache compilation log =================== 2025-12-04T13:58:14.4896037Z + python /var/lib/jenkins/pytorch/.ci/pytorch/print_sccache_log.py /var/lib/jenkins/sccache_error.log 2025-12-04T13:58:14.4979351Z + echo '=========== If your build fails, please take a look at the log above for possible reasons ===========' 2025-12-04T13:58:14.4980825Z =========== If your build fails, please take a look at the log above for possible reasons =========== 2025-12-04T13:58:14.4981511Z + sccache --show-stats 2025-12-04T13:58:14.4995408Z Compile requests 4404 2025-12-04T13:58:14.4995901Z Compile requests executed 612 2025-12-04T13:58:14.4996298Z Cache hits 55 2025-12-04T13:58:14.4996681Z Cache hits (C/C++) 52 2025-12-04T13:58:14.4997057Z Cache hits (HIP) 3 2025-12-04T13:58:14.4997437Z Cache misses 535 2025-12-04T13:58:14.4997813Z Cache misses (C/C++) 515 2025-12-04T13:58:14.4998272Z Cache misses (HIP) 20 2025-12-04T13:58:14.4998666Z Cache hits rate 9.32 % 2025-12-04T13:58:14.4999071Z Cache hits rate (C/C++) 9.17 % 2025-12-04T13:58:14.4999472Z Cache hits rate (HIP) 13.04 % 2025-12-04T13:58:14.4999867Z Cache timeouts 0 2025-12-04T13:58:14.5000261Z Cache read errors 0 2025-12-04T13:58:14.5000649Z Forced recaches 0 2025-12-04T13:58:14.5001177Z Cache write errors 0 2025-12-04T13:58:14.5001560Z Cache errors 1 2025-12-04T13:58:14.5001935Z Cache errors (C/C++) 1 2025-12-04T13:58:14.5002321Z Compilations 535 2025-12-04T13:58:14.5002710Z Compilation failures 21 2025-12-04T13:58:14.5003119Z Non-cacheable compilations 0 2025-12-04T13:58:14.5003524Z Non-cacheable calls 296 2025-12-04T13:58:14.5003926Z Non-compilation calls 3496 2025-12-04T13:58:14.5004338Z Unsupported compiler calls 0 2025-12-04T13:58:14.5004749Z Average cache write 0.000 s 2025-12-04T13:58:14.5005172Z Average compiler 2.065 s 2025-12-04T13:58:14.5005588Z Average cache read hit 0.000 s 2025-12-04T13:58:14.5006013Z Failed distributed compilations 0 2025-12-04T13:58:14.5006292Z 2025-12-04T13:58:14.5006430Z Non-cacheable reasons: 2025-12-04T13:58:14.5006793Z unknown source language 244 2025-12-04T13:58:14.5007194Z -E 52 2025-12-04T13:58:14.5007453Z 2025-12-04T13:58:14.5007715Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T13:58:14.5008320Z Use direct/preprocessor mode? yes 2025-12-04T13:58:14.5008737Z Version (client) 0.10.0 2025-12-04T13:58:14.5009150Z Cache size 56 MiB 2025-12-04T13:58:14.5009565Z Max cache size 10 GiB 2025-12-04T13:58:14.5009982Z + sccache --stop-server 2025-12-04T13:58:14.5026033Z Stopping sccache server... 2025-12-04T13:58:14.5030898Z Compile requests 4404 2025-12-04T13:58:14.5031304Z Compile requests executed 612 2025-12-04T13:58:14.5031725Z Cache hits 55 2025-12-04T13:58:14.5032115Z Cache hits (C/C++) 52 2025-12-04T13:58:14.5032498Z Cache hits (HIP) 3 2025-12-04T13:58:14.5032888Z Cache misses 535 2025-12-04T13:58:14.5033271Z Cache misses (C/C++) 515 2025-12-04T13:58:14.5033659Z Cache misses (HIP) 20 2025-12-04T13:58:14.5034066Z Cache hits rate 9.32 % 2025-12-04T13:58:14.5034478Z Cache hits rate (C/C++) 9.17 % 2025-12-04T13:58:14.5034888Z Cache hits rate (HIP) 13.04 % 2025-12-04T13:58:14.5035291Z Cache timeouts 0 2025-12-04T13:58:14.5035676Z Cache read errors 0 2025-12-04T13:58:14.5036067Z Forced recaches 0 2025-12-04T13:58:14.5036456Z Cache write errors 0 2025-12-04T13:58:14.5036837Z Cache errors 1 2025-12-04T13:58:14.5037217Z Cache errors (C/C++) 1 2025-12-04T13:58:14.5037600Z Compilations 535 2025-12-04T13:58:14.5038173Z Compilation failures 21 2025-12-04T13:58:14.5038626Z Non-cacheable compilations 0 2025-12-04T13:58:14.5039038Z Non-cacheable calls 296 2025-12-04T13:58:14.5039436Z Non-compilation calls 3496 2025-12-04T13:58:14.5039839Z Unsupported compiler calls 0 2025-12-04T13:58:14.5040252Z Average cache write 0.000 s 2025-12-04T13:58:14.5040664Z Average compiler 2.065 s 2025-12-04T13:58:14.5041078Z Average cache read hit 0.000 s 2025-12-04T13:58:14.5041493Z Failed distributed compilations 0 2025-12-04T13:58:14.5041766Z 2025-12-04T13:58:14.5041900Z Non-cacheable reasons: 2025-12-04T13:58:14.5042254Z unknown source language 244 2025-12-04T13:58:14.5042638Z -E 52 2025-12-04T13:58:14.5042889Z 2025-12-04T13:58:14.5043144Z Cache location Local disk: "/var/lib/jenkins/.cache/sccache" 2025-12-04T13:58:14.5043673Z Use direct/preprocessor mode? yes 2025-12-04T13:58:14.5044084Z Version (client) 0.10.0 2025-12-04T13:58:14.5044495Z Cache size 56 MiB 2025-12-04T13:58:14.5044980Z Max cache size 10 GiB 2025-12-04T13:58:14.5045392Z + echo ::endgroup:: 2025-12-04T13:58:14.5045953Z ##[endgroup] 2025-12-04T13:58:14.5137765Z ##[error]Process completed with exit code 1. 2025-12-04T13:58:14.5167220Z ##[group]Run # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T13:58:14.5167545Z # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2025-12-04T13:58:14.5167921Z docker exec -t "2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2025-12-04T13:58:14.5172854Z shell: /usr/bin/bash -e {0} 2025-12-04T13:58:14.5172968Z env: 2025-12-04T13:58:14.5173064Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:14.5173203Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:14.5173387Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:14.5173563Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:14.5173966Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:14.5174340Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:14.5174456Z AWS_REGION: us-east-1 2025-12-04T13:58:14.5174628Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:14.5174783Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:14.5176809Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:14.5176979Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:14.5177184Z ##[endgroup] 2025-12-04T13:58:14.5896484Z ##[group]Run docker exec -t "2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63" sh -c "sudo chown -R 1001:1001 test" 2025-12-04T13:58:14.5897801Z docker exec -t "2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63" sh -c "sudo chown -R 1001:1001 test" 2025-12-04T13:58:14.5907700Z shell: /usr/bin/bash -e {0} 2025-12-04T13:58:14.5908054Z env: 2025-12-04T13:58:14.5908421Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:14.5908868Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:14.5909441Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:14.5909980Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:14.5911248Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:14.5912510Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:14.5912889Z AWS_REGION: us-east-1 2025-12-04T13:58:14.5913340Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:14.5913959Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:14.5920730Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:14.5921285Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:14.5921875Z ##[endgroup] 2025-12-04T13:58:14.6820650Z ##[group]Run cat test/**/*_toprint.log || true 2025-12-04T13:58:14.6821155Z cat test/**/*_toprint.log || true 2025-12-04T13:58:14.6830837Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2025-12-04T13:58:14.6831305Z env: 2025-12-04T13:58:14.6831606Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:14.6832048Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:14.6832628Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:14.6833175Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:14.6834427Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:14.6835800Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:14.6836175Z AWS_REGION: us-east-1 2025-12-04T13:58:14.6836629Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:14.6837156Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:14.6844038Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:14.6844604Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:14.6845197Z ##[endgroup] 2025-12-04T13:58:14.6941400Z cat: 'test/**/*_toprint.log': No such file or directory 2025-12-04T13:58:14.7115524Z Prepare all required actions 2025-12-04T13:58:14.7116440Z Getting action download info 2025-12-04T13:58:15.0463751Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2025-12-04T13:58:15.9061032Z Download action repository 'actions/upload-artifact@v4' (SHA:ea165f8d65b6e75b540449e92b4886f43607fa02) 2025-12-04T13:58:16.8743957Z ##[group]Run ./.github/actions/upload-test-artifacts 2025-12-04T13:58:16.8744268Z with: 2025-12-04T13:58:16.8744452Z use-gha: true 2025-12-04T13:58:16.8744761Z file-suffix: test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162 2025-12-04T13:58:16.8745109Z s3-bucket: gha-artifacts 2025-12-04T13:58:16.8745326Z env: 2025-12-04T13:58:16.8745509Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:16.8745777Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:16.8746133Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:16.8746493Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:16.8747278Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:16.8748045Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:16.8748534Z AWS_REGION: us-east-1 2025-12-04T13:58:16.8748818Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:16.8749131Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:16.8753432Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:16.8753783Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:16.8754151Z ##[endgroup] 2025-12-04T13:58:16.8816860Z ##[group]Run actions/upload-artifact@v4 2025-12-04T13:58:16.8816984Z with: 2025-12-04T13:58:16.8817161Z name: test-jsons-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip 2025-12-04T13:58:16.8817365Z retention-days: 14 2025-12-04T13:58:16.8817471Z if-no-files-found: warn 2025-12-04T13:58:16.8817581Z path: test/**/*.json 2025-12-04T13:58:16.8817682Z compression-level: 6 2025-12-04T13:58:16.8817781Z overwrite: false 2025-12-04T13:58:16.8817881Z include-hidden-files: false 2025-12-04T13:58:16.8817990Z env: 2025-12-04T13:58:16.8818142Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:16.8818276Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:16.8818531Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:16.8818698Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:16.8819081Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:16.8819447Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:16.8819560Z AWS_REGION: us-east-1 2025-12-04T13:58:16.8819692Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:16.8819847Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:16.8821904Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:16.8822073Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:16.8822254Z ##[endgroup] 2025-12-04T13:58:17.3167717Z With the provided path, there will be 6 files uploaded 2025-12-04T13:58:17.3171001Z Artifact name is valid! 2025-12-04T13:58:17.3171560Z Root directory input is valid! 2025-12-04T13:58:17.5581911Z Beginning upload of artifact content to blob storage 2025-12-04T13:58:18.0165344Z Uploaded bytes 46621 2025-12-04T13:58:18.0983806Z Finished uploading artifact content to blob storage! 2025-12-04T13:58:18.0988466Z SHA256 digest of uploaded artifact zip is 325e1d5d2d7e773c7be4e1bd45dd89ebf3a176c0a5914de730958538da5053a9 2025-12-04T13:58:18.0990668Z Finalizing artifact upload 2025-12-04T13:58:18.2997707Z Artifact test-jsons-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip.zip successfully finalized. Artifact ID 4764819718 2025-12-04T13:58:18.2999474Z Artifact test-jsons-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip has been successfully uploaded! Final size is 46621 bytes. Artifact ID is 4764819718 2025-12-04T13:58:18.3009904Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4764819718 2025-12-04T13:58:18.3194970Z ##[group]Run actions/upload-artifact@v4 2025-12-04T13:58:18.3195277Z with: 2025-12-04T13:58:18.3195670Z name: test-reports-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip 2025-12-04T13:58:18.3196117Z retention-days: 14 2025-12-04T13:58:18.3196350Z if-no-files-found: ignore 2025-12-04T13:58:18.3196594Z path: test/**/*.xml test/**/*.csv 2025-12-04T13:58:18.3196857Z compression-level: 6 2025-12-04T13:58:18.3197078Z overwrite: false 2025-12-04T13:58:18.3197303Z include-hidden-files: false 2025-12-04T13:58:18.3197535Z env: 2025-12-04T13:58:18.3197722Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:18.3198024Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:18.3198454Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:18.3198798Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:18.3199603Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:18.3200400Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:18.3200652Z AWS_REGION: us-east-1 2025-12-04T13:58:18.3200977Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:18.3201302Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:18.3205492Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:18.3205859Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:18.3206245Z ##[endgroup] 2025-12-04T13:58:18.7904307Z With the provided path, there will be 110 files uploaded 2025-12-04T13:58:18.7907104Z Artifact name is valid! 2025-12-04T13:58:18.7907627Z Root directory input is valid! 2025-12-04T13:58:19.0221238Z Beginning upload of artifact content to blob storage 2025-12-04T13:58:19.8546790Z Uploaded bytes 685346 2025-12-04T13:58:19.9272578Z Finished uploading artifact content to blob storage! 2025-12-04T13:58:19.9277176Z SHA256 digest of uploaded artifact zip is 85f39d7aca5aa649b9fa8ac21ab85fb1b056efafb2c53f62f2709316c475a287 2025-12-04T13:58:19.9279104Z Finalizing artifact upload 2025-12-04T13:58:20.1043596Z Artifact test-reports-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip.zip successfully finalized. Artifact ID 4764820062 2025-12-04T13:58:20.1045245Z Artifact test-reports-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip has been successfully uploaded! Final size is 685346 bytes. Artifact ID is 4764820062 2025-12-04T13:58:20.1053263Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4764820062 2025-12-04T13:58:20.1262136Z ##[group]Run actions/upload-artifact@v4 2025-12-04T13:58:20.1262607Z with: 2025-12-04T13:58:20.1263188Z name: logs-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip 2025-12-04T13:58:20.1263860Z retention-days: 14 2025-12-04T13:58:20.1264234Z if-no-files-found: ignore 2025-12-04T13:58:20.1264643Z path: usage_log.txt test/**/*.log 2025-12-04T13:58:20.1265063Z compression-level: 6 2025-12-04T13:58:20.1265436Z overwrite: false 2025-12-04T13:58:20.1265914Z include-hidden-files: false 2025-12-04T13:58:20.1266295Z env: 2025-12-04T13:58:20.1266608Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:20.1267088Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:20.1267691Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:20.1268701Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:20.1270203Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:20.1271464Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:20.1271863Z AWS_REGION: us-east-1 2025-12-04T13:58:20.1272365Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:20.1272901Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:20.1279641Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:20.1280237Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:20.1280855Z ##[endgroup] 2025-12-04T13:58:20.6328784Z Multiple search paths detected. Calculating the least common ancestor of all paths 2025-12-04T13:58:20.6330250Z The least common ancestor is /home/runner/_work/pytorch/pytorch. This will be the root directory of the artifact 2025-12-04T13:58:20.6356968Z With the provided path, there will be 96 files uploaded 2025-12-04T13:58:20.6357729Z Artifact name is valid! 2025-12-04T13:58:20.6358193Z Root directory input is valid! 2025-12-04T13:58:20.8878831Z Beginning upload of artifact content to blob storage 2025-12-04T13:58:21.9305961Z Uploaded bytes 1171189 2025-12-04T13:58:22.0009727Z Finished uploading artifact content to blob storage! 2025-12-04T13:58:22.0014035Z SHA256 digest of uploaded artifact zip is f3f0f8b946b130abba831a5c272db411e6b8ad4ec0cb26a9f0571c534f49b240 2025-12-04T13:58:22.0016459Z Finalizing artifact upload 2025-12-04T13:58:22.1464754Z Artifact logs-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip.zip successfully finalized. Artifact ID 4764820464 2025-12-04T13:58:22.1466383Z Artifact logs-runattempt1-test-default-4-6-linux.rocm.gpu.gfx942.1.b_57116213162.zip has been successfully uploaded! Final size is 1171189 bytes. Artifact ID is 4764820464 2025-12-04T13:58:22.1474807Z Artifact download URL: https://github.com/pytorch/pytorch/actions/runs/19922849170/artifacts/4764820464 2025-12-04T13:58:22.1697888Z ##[group]Run # shellcheck disable=SC2156 2025-12-04T13:58:22.1698482Z # shellcheck disable=SC2156 2025-12-04T13:58:22.1699243Z find . -iname "core.[1-9]*" -exec docker exec "${CONTAINER_NAME}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2025-12-04T13:58:22.1709112Z shell: /usr/bin/bash -e {0} 2025-12-04T13:58:22.1709511Z env: 2025-12-04T13:58:22.1709836Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:22.1710316Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:22.1710943Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:22.1711658Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:22.1712963Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:22.1714235Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:22.1714682Z AWS_REGION: us-east-1 2025-12-04T13:58:22.1715172Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:22.1715717Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:22.1722611Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:22.1723214Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:22.1723839Z ##[endgroup] 2025-12-04T13:58:22.3132914Z ##[group]Run actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02 2025-12-04T13:58:22.3133517Z with: 2025-12-04T13:58:22.3133957Z name: coredumps-default-4-6-linux.rocm.gpu.gfx942.1.b 2025-12-04T13:58:22.3134489Z retention-days: 14 2025-12-04T13:58:22.3135004Z if-no-files-found: ignore 2025-12-04T13:58:22.3135394Z path: ./**/core.[1-9]* 2025-12-04T13:58:22.3135762Z compression-level: 6 2025-12-04T13:58:22.3136125Z overwrite: false 2025-12-04T13:58:22.3136484Z include-hidden-files: false 2025-12-04T13:58:22.3136867Z env: 2025-12-04T13:58:22.3137185Z GIT_DEFAULT_BRANCH: main 2025-12-04T13:58:22.3137659Z RUNNER_ARTIFACT_DIR: /home/runner/_work/_temp/artifacts 2025-12-04T13:58:22.3138355Z RUNNER_TEST_RESULTS_DIR: /home/runner/_work/_temp/test-results 2025-12-04T13:58:22.3138928Z RUNNER_DOCS_DIR: /home/runner/_work/_temp/docs 2025-12-04T13:58:22.3140275Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --group-add 110 --device /dev/dri/renderD152 --group-add video --group-add 109 --group-add daemon --group-add bin --cap-add=SYS_PTRACE --security-opt seccomp=unconfined --network=host 2025-12-04T13:58:22.3141542Z AWS_DEFAULT_REGION: us-east-1 2025-12-04T13:58:22.3141951Z AWS_REGION: us-east-1 2025-12-04T13:58:22.3142426Z AWS_ACCESS_KEY_ID: *** 2025-12-04T13:58:22.3142972Z AWS_SECRET_ACCESS_KEY: *** 2025-12-04T13:58:22.3150062Z AWS_SESSION_TOKEN: *** 2025-12-04T13:58:22.3150662Z CONTAINER_NAME: 2ab7e336d6015a57cd6d1e5bbfb96b93c7a05b35f086ff4567f6d3abdb9ddd63 2025-12-04T13:58:22.3151292Z ##[endgroup] 2025-12-04T13:58:27.9258366Z No files were found with the provided path: ./**/core.[1-9]*. No artifacts will be uploaded. 2025-12-04T13:58:27.9566918Z Post job cleanup. 2025-12-04T13:58:27.9605175Z Post job cleanup. 2025-12-04T13:58:27.9808029Z Logging out of registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2025-12-04T13:58:28.0067749Z Post job cleanup. 2025-12-04T13:58:28.0784373Z Post job cleanup. 2025-12-04T13:58:28.0874731Z Post job cleanup. 2025-12-04T13:58:28.1355788Z [command]/usr/bin/git version 2025-12-04T13:58:28.1387034Z git version 2.52.0 2025-12-04T13:58:28.1412348Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/6439b654-6c57-4d14-9528-b300a5312fb2/.gitconfig' 2025-12-04T13:58:28.1419766Z Temporarily overriding HOME='/home/runner/_work/_temp/6439b654-6c57-4d14-9528-b300a5312fb2' before making global git config changes 2025-12-04T13:58:28.1420832Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T13:58:28.1422288Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T13:58:28.1456767Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T13:58:28.1479538Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T13:58:28.1751517Z Entering 'android/libs/fbjni' 2025-12-04T13:58:28.1813659Z Entering 'third_party/FP16' 2025-12-04T13:58:28.1840416Z Entering 'third_party/FXdiv' 2025-12-04T13:58:28.1865829Z Entering 'third_party/NNPACK' 2025-12-04T13:58:28.1892053Z Entering 'third_party/NVTX' 2025-12-04T13:58:28.1930392Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:28.1960093Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:28.2018528Z Entering 'third_party/aiter' 2025-12-04T13:58:28.2046182Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:28.2072462Z Entering 'third_party/benchmark' 2025-12-04T13:58:28.2101677Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:28.2129433Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:28.2173811Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:28.2204946Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:28.2248204Z Entering 'third_party/cutlass' 2025-12-04T13:58:28.2295862Z Entering 'third_party/fbgemm' 2025-12-04T13:58:28.2325891Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:28.2364998Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:28.2407085Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:28.2429735Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:28.2474564Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:28.2513521Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:28.2544552Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:28.2594159Z Entering 'third_party/flash-attention' 2025-12-04T13:58:28.2622981Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:28.2644426Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:28.2669479Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:28.2712979Z Entering 'third_party/fmt' 2025-12-04T13:58:28.2745966Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:28.2774258Z Entering 'third_party/gloo' 2025-12-04T13:58:28.2806855Z Entering 'third_party/googletest' 2025-12-04T13:58:28.2829710Z Entering 'third_party/ideep' 2025-12-04T13:58:28.2853598Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:28.2878654Z Entering 'third_party/ittapi' 2025-12-04T13:58:28.2908866Z Entering 'third_party/kineto' 2025-12-04T13:58:28.2932799Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:28.2974603Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:28.3034361Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:28.3074182Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:28.3118507Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:28.3146803Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:28.3185798Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:28.3234855Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:28.3280527Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:28.3308290Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:28.3330116Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:28.3353833Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:28.3381163Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:28.3432118Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:28.3455160Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:28.3477717Z Entering 'third_party/kleidiai' 2025-12-04T13:58:28.3504127Z Entering 'third_party/mimalloc' 2025-12-04T13:58:28.3546120Z Entering 'third_party/nlohmann' 2025-12-04T13:58:28.3596842Z Entering 'third_party/onnx' 2025-12-04T13:58:28.3634467Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:28.3688497Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:28.3721158Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:28.3746464Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:28.3791553Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:28.3833276Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:28.3867443Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:28.3905903Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:28.3948627Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:28.3990813Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:28.4029082Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:28.4073244Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:28.4125565Z Entering 'third_party/pocketfft' 2025-12-04T13:58:28.4149640Z Entering 'third_party/protobuf' 2025-12-04T13:58:28.4177223Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:28.4211993Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:28.4248590Z Entering 'third_party/psimd' 2025-12-04T13:58:28.4281671Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:28.4316483Z Entering 'third_party/pybind11' 2025-12-04T13:58:28.4355856Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:28.4379837Z Entering 'third_party/sleef' 2025-12-04T13:58:28.4408009Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:28.4439211Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:28.4473654Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:28.4515722Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:28.4564033Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:28.4606051Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:28.4678534Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T13:58:28.4707995Z http.https://github.com/.extraheader 2025-12-04T13:58:28.4720820Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2025-12-04T13:58:28.4754380Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T13:58:28.5034501Z Entering 'android/libs/fbjni' 2025-12-04T13:58:28.5064942Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5088821Z Entering 'third_party/FP16' 2025-12-04T13:58:28.5112373Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5141753Z Entering 'third_party/FXdiv' 2025-12-04T13:58:28.5166678Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5197975Z Entering 'third_party/NNPACK' 2025-12-04T13:58:28.5213382Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5231791Z Entering 'third_party/NVTX' 2025-12-04T13:58:28.5254774Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5279022Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:28.5292359Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5310372Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:28.5323602Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5347000Z Entering 'third_party/aiter' 2025-12-04T13:58:28.5377101Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5395838Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:28.5418256Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5443547Z Entering 'third_party/benchmark' 2025-12-04T13:58:28.5457028Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5474681Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:28.5508002Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5549107Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:28.5562376Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5596708Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:28.5611379Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5628052Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:28.5642155Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5679764Z Entering 'third_party/cutlass' 2025-12-04T13:58:28.5694845Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5730837Z Entering 'third_party/fbgemm' 2025-12-04T13:58:28.5744308Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5773171Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:28.5798420Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5827427Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:28.5838895Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5859444Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:28.5881909Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5922487Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:28.5953605Z http.https://github.com/.extraheader 2025-12-04T13:58:28.5978038Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:28.5997573Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6030972Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:28.6068230Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6096793Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:28.6122341Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6154710Z Entering 'third_party/flash-attention' 2025-12-04T13:58:28.6181377Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6200060Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:28.6226532Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6249553Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:28.6272545Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6297025Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:28.6321376Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6340916Z Entering 'third_party/fmt' 2025-12-04T13:58:28.6370594Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6390416Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:28.6405362Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6422713Z Entering 'third_party/gloo' 2025-12-04T13:58:28.6436961Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6453958Z Entering 'third_party/googletest' 2025-12-04T13:58:28.6468850Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6485084Z Entering 'third_party/ideep' 2025-12-04T13:58:28.6503068Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6542313Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:28.6557437Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6589302Z Entering 'third_party/ittapi' 2025-12-04T13:58:28.6609167Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6639390Z Entering 'third_party/kineto' 2025-12-04T13:58:28.6663232Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6704084Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:28.6731309Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6761470Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:28.6780683Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6810039Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:28.6839064Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6868192Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:28.6891064Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6923003Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:28.6949604Z http.https://github.com/.extraheader 2025-12-04T13:58:28.6968290Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:28.6995104Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7016878Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:28.7052885Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7083349Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:28.7110273Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7147197Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:28.7181860Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7214093Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:28.7242063Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7259858Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:28.7277842Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7295639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:28.7317702Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7335893Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:28.7366328Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7400517Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:28.7428316Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7458361Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:28.7480131Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7510129Z Entering 'third_party/kleidiai' 2025-12-04T13:58:28.7527977Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7545795Z Entering 'third_party/mimalloc' 2025-12-04T13:58:28.7558737Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7575427Z Entering 'third_party/nlohmann' 2025-12-04T13:58:28.7589391Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7607122Z Entering 'third_party/onnx' 2025-12-04T13:58:28.7623865Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7649122Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:28.7668031Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7691652Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:28.7705876Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7723184Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:28.7740957Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7757620Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:28.7770458Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7786123Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:28.7801515Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7821420Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:28.7838172Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7862802Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:28.7890302Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7914707Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:28.7926812Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7942648Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:28.7954626Z http.https://github.com/.extraheader 2025-12-04T13:58:28.7994958Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:28.8008358Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8048553Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:28.8068342Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8087290Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:28.8110118Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8146184Z Entering 'third_party/pocketfft' 2025-12-04T13:58:28.8169648Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8198590Z Entering 'third_party/protobuf' 2025-12-04T13:58:28.8212229Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8240507Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:28.8263193Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8295644Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:28.8308620Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8338607Z Entering 'third_party/psimd' 2025-12-04T13:58:28.8352267Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8380436Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:28.8400885Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8428586Z Entering 'third_party/pybind11' 2025-12-04T13:58:28.8448753Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8478617Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:28.8497675Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8537786Z Entering 'third_party/sleef' 2025-12-04T13:58:28.8556752Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8585272Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:28.8603618Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8621012Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:28.8634437Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8650002Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:28.8673027Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8691280Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:28.8712828Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8729917Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:28.8747267Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8763533Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:28.8775647Z http.https://github.com/.extraheader 2025-12-04T13:58:28.8823479Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:28.8854494Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T13:58:28.9036268Z Entering 'android/libs/fbjni' 2025-12-04T13:58:28.9047392Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T13:58:28.9057064Z Entering 'third_party/FP16' 2025-12-04T13:58:28.9084189Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T13:58:28.9096315Z Entering 'third_party/FXdiv' 2025-12-04T13:58:28.9117398Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T13:58:28.9127124Z Entering 'third_party/NNPACK' 2025-12-04T13:58:28.9144733Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T13:58:28.9162584Z Entering 'third_party/NVTX' 2025-12-04T13:58:28.9173085Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T13:58:28.9192283Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:28.9203544Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T13:58:28.9212982Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:28.9224177Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T13:58:28.9250945Z Entering 'third_party/aiter' 2025-12-04T13:58:28.9261865Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T13:58:28.9272373Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:28.9283345Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T13:58:28.9298703Z Entering 'third_party/benchmark' 2025-12-04T13:58:28.9310034Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:28.9319420Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:28.9334026Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T13:58:28.9347066Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:28.9357970Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T13:58:28.9377434Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:28.9389809Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T13:58:28.9403628Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:28.9416125Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T13:58:28.9438249Z Entering 'third_party/cutlass' 2025-12-04T13:58:28.9468973Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T13:58:28.9486285Z Entering 'third_party/fbgemm' 2025-12-04T13:58:28.9503788Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T13:58:28.9519748Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:28.9539017Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T13:58:28.9554288Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:28.9575507Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T13:58:28.9602217Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:28.9636688Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T13:58:28.9645597Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:28.9660841Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T13:58:28.9692709Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:28.9713273Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T13:58:28.9723550Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:28.9735545Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T13:58:28.9743476Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:28.9768590Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T13:58:28.9791552Z Entering 'third_party/flash-attention' 2025-12-04T13:58:28.9802924Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T13:58:28.9815658Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:28.9826046Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T13:58:28.9836885Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:28.9846446Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T13:58:28.9873770Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:28.9884112Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T13:58:28.9894746Z Entering 'third_party/fmt' 2025-12-04T13:58:28.9911882Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T13:58:28.9931049Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:28.9943035Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T13:58:28.9953074Z Entering 'third_party/gloo' 2025-12-04T13:58:28.9969338Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T13:58:28.9979094Z Entering 'third_party/googletest' 2025-12-04T13:58:28.9999284Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.0010089Z Entering 'third_party/ideep' 2025-12-04T13:58:29.0021095Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T13:58:29.0040521Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:29.0071995Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T13:58:29.0085915Z Entering 'third_party/ittapi' 2025-12-04T13:58:29.0107259Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T13:58:29.0117263Z Entering 'third_party/kineto' 2025-12-04T13:58:29.0137418Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T13:58:29.0148898Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:29.0159125Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T13:58:29.0179485Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:29.0203610Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T13:58:29.0214930Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:29.0229866Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T13:58:29.0248666Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:29.0259010Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T13:58:29.0279350Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:29.0295037Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T13:58:29.0306279Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:29.0330494Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T13:58:29.0341095Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:29.0369184Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T13:58:29.0377744Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:29.0393373Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.0401842Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:29.0411892Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T13:58:29.0431411Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:29.0441589Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T13:58:29.0450172Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:29.0460109Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T13:58:29.0480375Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:29.0491945Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T13:58:29.0500875Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:29.0526439Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T13:58:29.0539184Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:29.0574679Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T13:58:29.0583556Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:29.0603980Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.0614047Z Entering 'third_party/kleidiai' 2025-12-04T13:58:29.0624968Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T13:58:29.0636031Z Entering 'third_party/mimalloc' 2025-12-04T13:58:29.0647699Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T13:58:29.0669532Z Entering 'third_party/nlohmann' 2025-12-04T13:58:29.0684337Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T13:58:29.0695421Z Entering 'third_party/onnx' 2025-12-04T13:58:29.0719292Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T13:58:29.0759128Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:29.0778779Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:29.0791925Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:29.0812278Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T13:58:29.0825025Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:29.0848044Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:29.0857776Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:29.0866704Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.0874654Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:29.0883235Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T13:58:29.0891634Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:29.0915593Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T13:58:29.0925773Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:29.0949888Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T13:58:29.0959006Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:29.0969417Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T13:58:29.0977699Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:29.0986799Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T13:58:29.0994418Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:29.1004330Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T13:58:29.1024292Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:29.1038985Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T13:58:29.1050330Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:29.1060210Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T13:58:29.1089890Z Entering 'third_party/pocketfft' 2025-12-04T13:58:29.1114592Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T13:58:29.1134827Z Entering 'third_party/protobuf' 2025-12-04T13:58:29.1146010Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T13:58:29.1157593Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:29.1171982Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:29.1180696Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:29.1196786Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.1217100Z Entering 'third_party/psimd' 2025-12-04T13:58:29.1228258Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T13:58:29.1238383Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:29.1249716Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T13:58:29.1269883Z Entering 'third_party/pybind11' 2025-12-04T13:58:29.1286779Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:29.1308840Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:29.1319514Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T13:58:29.1338403Z Entering 'third_party/sleef' 2025-12-04T13:58:29.1351012Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T13:58:29.1361320Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:29.1371435Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T13:58:29.1381653Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:29.1390204Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:29.1398619Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:29.1416948Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T13:58:29.1427461Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:29.1436692Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T13:58:29.1444733Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:29.1454536Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:29.1463127Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:29.1490494Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T13:58:29.1540556Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1582625Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1620718Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1646343Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1684881Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1709345Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1749041Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1773847Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1812754Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1850346Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1892412Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1914607Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1951454Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.1986432Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2029988Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2053876Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2092396Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2127943Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2162042Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2194207Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2231867Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2266867Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2290599Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2325116Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2349931Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2386321Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2422100Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2446170Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2471614Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2504725Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2528527Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2564530Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2598856Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2634891Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2659604Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2693618Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2716538Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2740260Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2769995Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2794867Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2821604Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2845538Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2878283Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2911745Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2935488Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2958044Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.2981738Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3003854Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3026584Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3062186Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3081938Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3113639Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3146293Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3167028Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3184889Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3215935Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3234972Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3252961Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3284209Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3303798Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3340388Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3367298Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3402478Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3422708Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3454322Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3474000Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3493684Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3512267Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3545056Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3564157Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3583373Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3612104Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3641526Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3662076Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3680880Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3699745Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3718776Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3736776Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3755917Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3785941Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3814712Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:29.3978517Z Post job cleanup. 2025-12-04T13:58:29.4471509Z [command]/usr/bin/git version 2025-12-04T13:58:29.4492628Z git version 2.52.0 2025-12-04T13:58:29.4508023Z Copying '/home/runner/.gitconfig' to '/home/runner/_work/_temp/6459fc86-2827-486e-baa3-75e59797ea2b/.gitconfig' 2025-12-04T13:58:29.4513310Z Temporarily overriding HOME='/home/runner/_work/_temp/6459fc86-2827-486e-baa3-75e59797ea2b' before making global git config changes 2025-12-04T13:58:29.4514401Z Adding repository directory to the temporary git global config as a safe directory 2025-12-04T13:58:29.4515303Z [command]/usr/bin/git config --global --add safe.directory /home/runner/_work/pytorch/pytorch 2025-12-04T13:58:29.4545114Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2025-12-04T13:58:29.4570553Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2025-12-04T13:58:29.4862973Z Entering 'android/libs/fbjni' 2025-12-04T13:58:29.4911690Z Entering 'third_party/FP16' 2025-12-04T13:58:29.4962453Z Entering 'third_party/FXdiv' 2025-12-04T13:58:29.5001373Z Entering 'third_party/NNPACK' 2025-12-04T13:58:29.5037791Z Entering 'third_party/NVTX' 2025-12-04T13:58:29.5074770Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:29.5116320Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:29.5176294Z Entering 'third_party/aiter' 2025-12-04T13:58:29.5210680Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:29.5236655Z Entering 'third_party/benchmark' 2025-12-04T13:58:29.5266957Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:29.5298167Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:29.5322978Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:29.5362134Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:29.5406107Z Entering 'third_party/cutlass' 2025-12-04T13:58:29.5438628Z Entering 'third_party/fbgemm' 2025-12-04T13:58:29.5465126Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:29.5508131Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:29.5549240Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:29.5572099Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:29.5611238Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:29.5649974Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:29.5698559Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:29.5749389Z Entering 'third_party/flash-attention' 2025-12-04T13:58:29.5792822Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:29.5842732Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:29.5893754Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:29.5954081Z Entering 'third_party/fmt' 2025-12-04T13:58:29.5985600Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:29.6019636Z Entering 'third_party/gloo' 2025-12-04T13:58:29.6064864Z Entering 'third_party/googletest' 2025-12-04T13:58:29.6103256Z Entering 'third_party/ideep' 2025-12-04T13:58:29.6133173Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:29.6191678Z Entering 'third_party/ittapi' 2025-12-04T13:58:29.6217933Z Entering 'third_party/kineto' 2025-12-04T13:58:29.6262394Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:29.6299198Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:29.6327563Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:29.6348209Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:29.6382238Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:29.6415923Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:29.6445825Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:29.6468895Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:29.6502262Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:29.6546767Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:29.6572158Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:29.6596150Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:29.6632500Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:29.6671119Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:29.6728557Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:29.6777407Z Entering 'third_party/kleidiai' 2025-12-04T13:58:29.6804084Z Entering 'third_party/mimalloc' 2025-12-04T13:58:29.6838211Z Entering 'third_party/nlohmann' 2025-12-04T13:58:29.6880552Z Entering 'third_party/onnx' 2025-12-04T13:58:29.6917861Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:29.6951993Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:29.6978614Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:29.7001511Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:29.7045094Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:29.7090464Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:29.7137115Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:29.7169591Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:29.7207412Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:29.7247562Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:29.7296672Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:29.7350326Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:29.7393432Z Entering 'third_party/pocketfft' 2025-12-04T13:58:29.7426705Z Entering 'third_party/protobuf' 2025-12-04T13:58:29.7468734Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:29.7499968Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:29.7544494Z Entering 'third_party/psimd' 2025-12-04T13:58:29.7576689Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:29.7622728Z Entering 'third_party/pybind11' 2025-12-04T13:58:29.7665490Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:29.7694989Z Entering 'third_party/sleef' 2025-12-04T13:58:29.7720166Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:29.7747877Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:29.7786544Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:29.7826642Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:29.7870228Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:29.7908345Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:29.7975603Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2025-12-04T13:58:29.8008676Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2025-12-04T13:58:29.8227701Z Entering 'android/libs/fbjni' 2025-12-04T13:58:29.8258609Z Entering 'third_party/FP16' 2025-12-04T13:58:29.8300861Z Entering 'third_party/FXdiv' 2025-12-04T13:58:29.8345757Z Entering 'third_party/NNPACK' 2025-12-04T13:58:29.8396014Z Entering 'third_party/NVTX' 2025-12-04T13:58:29.8438922Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:29.8487762Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:29.8533421Z Entering 'third_party/aiter' 2025-12-04T13:58:29.8562758Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:29.8602838Z Entering 'third_party/benchmark' 2025-12-04T13:58:29.8643777Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:29.8696628Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:29.8723656Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:29.8757834Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:29.8783823Z Entering 'third_party/cutlass' 2025-12-04T13:58:29.8822914Z Entering 'third_party/fbgemm' 2025-12-04T13:58:29.8847803Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:29.8881161Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:29.8926932Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:29.8952086Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:29.9001816Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:29.9045495Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:29.9067918Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:29.9121927Z Entering 'third_party/flash-attention' 2025-12-04T13:58:29.9152431Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:29.9180275Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:29.9223685Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:29.9267297Z Entering 'third_party/fmt' 2025-12-04T13:58:29.9291564Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:29.9316431Z Entering 'third_party/gloo' 2025-12-04T13:58:29.9350805Z Entering 'third_party/googletest' 2025-12-04T13:58:29.9387192Z Entering 'third_party/ideep' 2025-12-04T13:58:29.9413239Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:29.9440112Z Entering 'third_party/ittapi' 2025-12-04T13:58:29.9468753Z Entering 'third_party/kineto' 2025-12-04T13:58:29.9509910Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:29.9550545Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:29.9609374Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:29.9659916Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:29.9695429Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:29.9731980Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:29.9789639Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:29.9836837Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:29.9871695Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:29.9931105Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:29.9968308Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:30.0007711Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:30.0046484Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:30.0075782Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:30.0104263Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:30.0131098Z Entering 'third_party/kleidiai' 2025-12-04T13:58:30.0168919Z Entering 'third_party/mimalloc' 2025-12-04T13:58:30.0195446Z Entering 'third_party/nlohmann' 2025-12-04T13:58:30.0232293Z Entering 'third_party/onnx' 2025-12-04T13:58:30.0274319Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:30.0302578Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:30.0328789Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:30.0355615Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:30.0386411Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:30.0407676Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:30.0465653Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:30.0508490Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:30.0556088Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:30.0586593Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:30.0635005Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:30.0674706Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:30.0707229Z Entering 'third_party/pocketfft' 2025-12-04T13:58:30.0754916Z Entering 'third_party/protobuf' 2025-12-04T13:58:30.0797325Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:30.0825110Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:30.0864766Z Entering 'third_party/psimd' 2025-12-04T13:58:30.0894699Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:30.0921088Z Entering 'third_party/pybind11' 2025-12-04T13:58:30.0947522Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:30.0972933Z Entering 'third_party/sleef' 2025-12-04T13:58:30.1003783Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:30.1040804Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:30.1062892Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:30.1092203Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:30.1125645Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:30.1170804Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:30.1239159Z [command]/usr/bin/git config --local --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.1271447Z [command]/usr/bin/git submodule foreach --recursive git config --local --show-origin --name-only --get-regexp remote.origin.url 2025-12-04T13:58:30.1460546Z Entering 'android/libs/fbjni' 2025-12-04T13:58:30.1475955Z file:/home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2025-12-04T13:58:30.1496827Z Entering 'third_party/FP16' 2025-12-04T13:58:30.1515737Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2025-12-04T13:58:30.1535352Z Entering 'third_party/FXdiv' 2025-12-04T13:58:30.1555621Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2025-12-04T13:58:30.1564859Z Entering 'third_party/NNPACK' 2025-12-04T13:58:30.1589130Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2025-12-04T13:58:30.1599995Z Entering 'third_party/NVTX' 2025-12-04T13:58:30.1630353Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2025-12-04T13:58:30.1641638Z Entering 'third_party/VulkanMemoryAllocator' 2025-12-04T13:58:30.1653513Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2025-12-04T13:58:30.1663367Z Entering 'third_party/XNNPACK' 2025-12-04T13:58:30.1685076Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2025-12-04T13:58:30.1699959Z Entering 'third_party/aiter' 2025-12-04T13:58:30.1714982Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config remote.origin.url 2025-12-04T13:58:30.1724091Z Entering 'third_party/aiter/3rdparty/composable_kernel' 2025-12-04T13:58:30.1752884Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config remote.origin.url 2025-12-04T13:58:30.1765114Z Entering 'third_party/benchmark' 2025-12-04T13:58:30.1784427Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:30.1805899Z Entering 'third_party/composable_kernel' 2025-12-04T13:58:30.1833134Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2025-12-04T13:58:30.1848553Z Entering 'third_party/cpp-httplib' 2025-12-04T13:58:30.1863190Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2025-12-04T13:58:30.1885129Z Entering 'third_party/cpuinfo' 2025-12-04T13:58:30.1914078Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2025-12-04T13:58:30.1925256Z Entering 'third_party/cudnn_frontend' 2025-12-04T13:58:30.1947349Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2025-12-04T13:58:30.1967579Z Entering 'third_party/cutlass' 2025-12-04T13:58:30.1986764Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2025-12-04T13:58:30.2016892Z Entering 'third_party/fbgemm' 2025-12-04T13:58:30.2044443Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2025-12-04T13:58:30.2058388Z Entering 'third_party/fbgemm/external/asmjit' 2025-12-04T13:58:30.2083130Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config remote.origin.url 2025-12-04T13:58:30.2102484Z Entering 'third_party/fbgemm/external/composable_kernel' 2025-12-04T13:58:30.2126193Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config remote.origin.url 2025-12-04T13:58:30.2142253Z Entering 'third_party/fbgemm/external/cpuinfo' 2025-12-04T13:58:30.2158940Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config remote.origin.url 2025-12-04T13:58:30.2169193Z Entering 'third_party/fbgemm/external/cutlass' 2025-12-04T13:58:30.2186262Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config remote.origin.url 2025-12-04T13:58:30.2216256Z Entering 'third_party/fbgemm/external/googletest' 2025-12-04T13:58:30.2228654Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config remote.origin.url 2025-12-04T13:58:30.2250048Z Entering 'third_party/fbgemm/external/hipify_torch' 2025-12-04T13:58:30.2261845Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config remote.origin.url 2025-12-04T13:58:30.2277878Z Entering 'third_party/fbgemm/external/json' 2025-12-04T13:58:30.2302892Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config remote.origin.url 2025-12-04T13:58:30.2326225Z Entering 'third_party/flash-attention' 2025-12-04T13:58:30.2343937Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config remote.origin.url 2025-12-04T13:58:30.2365318Z Entering 'third_party/flash-attention/csrc/composable_kernel' 2025-12-04T13:58:30.2391668Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config remote.origin.url 2025-12-04T13:58:30.2402862Z Entering 'third_party/flash-attention/csrc/cutlass' 2025-12-04T13:58:30.2425968Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config remote.origin.url 2025-12-04T13:58:30.2441248Z Entering 'third_party/flatbuffers' 2025-12-04T13:58:30.2461005Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2025-12-04T13:58:30.2484370Z Entering 'third_party/fmt' 2025-12-04T13:58:30.2505306Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2025-12-04T13:58:30.2517863Z Entering 'third_party/gemmlowp/gemmlowp' 2025-12-04T13:58:30.2529191Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2025-12-04T13:58:30.2540868Z Entering 'third_party/gloo' 2025-12-04T13:58:30.2558173Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2025-12-04T13:58:30.2570391Z Entering 'third_party/googletest' 2025-12-04T13:58:30.2582356Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.2593807Z Entering 'third_party/ideep' 2025-12-04T13:58:30.2615893Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2025-12-04T13:58:30.2628388Z Entering 'third_party/ideep/mkl-dnn' 2025-12-04T13:58:30.2643543Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2025-12-04T13:58:30.2657035Z Entering 'third_party/ittapi' 2025-12-04T13:58:30.2677722Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2025-12-04T13:58:30.2697977Z Entering 'third_party/kineto' 2025-12-04T13:58:30.2716181Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2025-12-04T13:58:30.2727727Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2025-12-04T13:58:30.2755522Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2025-12-04T13:58:30.2777892Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2025-12-04T13:58:30.2797575Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2025-12-04T13:58:30.2818896Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2025-12-04T13:58:30.2846646Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2025-12-04T13:58:30.2868938Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2025-12-04T13:58:30.2890530Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2025-12-04T13:58:30.2900832Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2025-12-04T13:58:30.2916457Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2025-12-04T13:58:30.2938012Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2025-12-04T13:58:30.2955446Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2025-12-04T13:58:30.2965857Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2025-12-04T13:58:30.2991359Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2025-12-04T13:58:30.3001760Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2025-12-04T13:58:30.3025859Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.3038288Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2025-12-04T13:58:30.3063233Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2025-12-04T13:58:30.3073911Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2025-12-04T13:58:30.3100779Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2025-12-04T13:58:30.3111696Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp' 2025-12-04T13:58:30.3131036Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T13:58:30.3141193Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:30.3162846Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T13:58:30.3174156Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:30.3200363Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T13:58:30.3214243Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2025-12-04T13:58:30.3230927Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2025-12-04T13:58:30.3240760Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2025-12-04T13:58:30.3252936Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.3264130Z Entering 'third_party/kleidiai' 2025-12-04T13:58:30.3282308Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config remote.origin.url 2025-12-04T13:58:30.3291904Z Entering 'third_party/mimalloc' 2025-12-04T13:58:30.3303592Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2025-12-04T13:58:30.3319627Z Entering 'third_party/nlohmann' 2025-12-04T13:58:30.3331023Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2025-12-04T13:58:30.3340734Z Entering 'third_party/onnx' 2025-12-04T13:58:30.3358551Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2025-12-04T13:58:30.3377072Z Entering 'third_party/onnx/third_party/pybind11' 2025-12-04T13:58:30.3395703Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:30.3408273Z Entering 'third_party/opentelemetry-cpp' 2025-12-04T13:58:30.3420584Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2025-12-04T13:58:30.3442505Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2025-12-04T13:58:30.3470406Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:30.3481047Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2025-12-04T13:58:30.3506693Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.3518850Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2025-12-04T13:58:30.3526624Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2025-12-04T13:58:30.3535286Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2025-12-04T13:58:30.3561495Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2025-12-04T13:58:30.3571050Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2025-12-04T13:58:30.3580778Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2025-12-04T13:58:30.3591211Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2025-12-04T13:58:30.3601379Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2025-12-04T13:58:30.3609761Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2025-12-04T13:58:30.3630131Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2025-12-04T13:58:30.3640276Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2025-12-04T13:58:30.3651905Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2025-12-04T13:58:30.3660683Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2025-12-04T13:58:30.3678394Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2025-12-04T13:58:30.3689603Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2025-12-04T13:58:30.3705811Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2025-12-04T13:58:30.3729792Z Entering 'third_party/pocketfft' 2025-12-04T13:58:30.3747366Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2025-12-04T13:58:30.3757476Z Entering 'third_party/protobuf' 2025-12-04T13:58:30.3784496Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2025-12-04T13:58:30.3796485Z Entering 'third_party/protobuf/third_party/benchmark' 2025-12-04T13:58:30.3816018Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2025-12-04T13:58:30.3825993Z Entering 'third_party/protobuf/third_party/googletest' 2025-12-04T13:58:30.3836888Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.3847654Z Entering 'third_party/psimd' 2025-12-04T13:58:30.3861191Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2025-12-04T13:58:30.3880984Z Entering 'third_party/pthreadpool' 2025-12-04T13:58:30.3892794Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2025-12-04T13:58:30.3912042Z Entering 'third_party/pybind11' 2025-12-04T13:58:30.3929099Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:30.3938574Z Entering 'third_party/python-peachpy' 2025-12-04T13:58:30.3948714Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2025-12-04T13:58:30.3968375Z Entering 'third_party/sleef' 2025-12-04T13:58:30.3982720Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2025-12-04T13:58:30.3991205Z Entering 'third_party/tensorpipe' 2025-12-04T13:58:30.4002539Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2025-12-04T13:58:30.4014093Z Entering 'third_party/tensorpipe/third_party/googletest' 2025-12-04T13:58:30.4023575Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2025-12-04T13:58:30.4034272Z Entering 'third_party/tensorpipe/third_party/libnop' 2025-12-04T13:58:30.4047662Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2025-12-04T13:58:30.4066829Z Entering 'third_party/tensorpipe/third_party/libuv' 2025-12-04T13:58:30.4082423Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2025-12-04T13:58:30.4101468Z Entering 'third_party/tensorpipe/third_party/pybind11' 2025-12-04T13:58:30.4122284Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2025-12-04T13:58:30.4132376Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2025-12-04T13:58:30.4146454Z file:/home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2025-12-04T13:58:30.4176998Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4211403Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4243725Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4271833Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4292981Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4325262Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4345521Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4377273Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4412052Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/aiter/modules/3rdparty/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4443348Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4463300Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4497757Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4518479Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4537878Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4567249Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4599832Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4635380Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/asmjit/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4667167Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4698657Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cpuinfo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4716085Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4744620Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4774522Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/hipify_torch/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4797331Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/external/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4814981Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4842896Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/composable_kernel/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4872230Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flash-attention/modules/csrc/cutlass/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4891934Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4912206Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4930285Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4960356Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.4979647Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5009050Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5029817Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5058426Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5086332Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5115230Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5147476Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5165936Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5185898Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5219633Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5249027Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5285819Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5320506Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5351633Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5382287Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5411523Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5441248Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5460709Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5490364Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5509453Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5537789Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/kleidiai/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5556736Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5575613Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5593684Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5617693Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5637249Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5656874Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5688953Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5708932Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5739974Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5771839Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5792224Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5822205Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5848150Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5867333Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5900418Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5922456Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5940618Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5959135Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.5977524Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6008760Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6035133Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6060125Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6089845Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6114136Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6143972Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6164591Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6191652Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6211664Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6231122Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6250780Z [command]/usr/bin/git config --file /home/runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config --name-only --get-regexp ^includeIf\.gitdir: 2025-12-04T13:58:30.6419329Z Cleaning up orphan processes